shift
instruction of Orthotope Machine is displace the array by a constant vector. Because every array has a finite size, the users must set the boundary condition to specify how the out-of-bound access are treated.Current Paraiso supports two boundary conditions; open boundary and cyclic boundary.
You can choose boundary condition for every dimension independently
by setting a value of the type vector of
Condition
in boundary
field of the Setup
data.Of the two, the cyclic boundary condition is simple. Out-of-bound accesses see the other side of the array.
The open boundary condition is rather complicated. First, the size of margin regions are determined so that any Paraiso kernel can cover the region you said you wanted to calculate. You have specified the region
0 <= i < localSize
in the Setup. Next, each kernel start the calculation from the maximum region including the margins, and calculates as large region as possible. The array values out of that region are undefined. Also, the reduction operators always reduce over the entire margined region. You need to manually mask the margins if you want to.Please look at the sample program to understand how Paraiso deals with the boundary conditions. This program has three kernels:
myKernels :: [Named (Builder Vec1 Int Annotation ())] myKernels = ["init" `isNameOf` do store table $ loadIndex (Axis 0) ,"increment" `isNameOf` do store table $ 1 + load table ,"calculate" `isNameOf` do center <- bind $ load table right <- bind $ shift (Vec :~ (-1)) center left <- bind $ shift (Vec :~ ( 1)) center ret <- bind $ 10000 * left + 100 * center + right store table ret store total $ reduce Reduce.Sum ret ]The first kernel initializes the array elements with their respective indices. The second adds 1 to every element. The third reads adjacent cells and perform some calculations. Let's give it a go:
maker.init(); dump(); maker.increment(); dump(); maker.calculate(); dump(); cout << "total: " << maker.total() << endl; cout << endl;Say
make
and two program will be generated. They are the same but for the boundary conditions. Under the cyclic boundary condition, you can read across the boundary, as follows:$ ./main-cyclic.out
index: 0 1 2 3 4 5 6 7
value: 0 1 2 3 4 5 6 7
index: 0 1 2 3 4 5 6 7
value: 1 2 3 4 5 6 7 8
index: 0 1 2 3 4 5 6 7
value: 80102 10203 20304 30405 40506 50607 60708 70801
total: 363636
On the other hand if you use the open boundary condition, Paraiso adds the margin regions for you, and you can access indices beyond 0 and localSize
.$ ./main-open.out
index: -1 0 1 2 3 4 5 6 7 8
value: -1 0 1 2 3 4 5 6 7 8
index: -1 0 1 2 3 4 5 6 7 8
value: 0 1 2 3 4 5 6 7 8 9
index: -1 0 1 2 3 4 5 6 7 8
value: 0 102 10203 20304 30405 40506 50607 60708 70809 0
total: 283644
No comments:
Post a Comment