Data Parallelism

The parallel hardware targeted by Project Sumatra can readily be programmed in the data parallel style.

Data parallel programs tend to exhibit these features:

large-scale SIMD: one stream (or a few streams) of instructions applied in parallel to many processors
arrays or tables: homogeneous data easily split across processors (easily = sub-linear work to decide partitions)
FORALL loops: large-scale program structure of: partition input; do local processing; collect results (and repeat)
reduction: inter-processor communication via meshes or trees, producing reductions or scans

Useful links:

Java parallel collections: http://cr.openjdk.java.net/~briangoetz/lambda/collections-overview.html
90's era survey of data-parallel languages: http://www.cs.cmu.edu/~scandal/parallel-lang/reading-list/node1.html
Haskell-based work at Microsoft: http://www.haskell.org/haskellwiki/Parallel_GHC_Project

Nested Data Parallelism

Flat data parallel languages are not very flexible. In practice programmers prefer to request lots of small tasks rather than one big SIMD task. The limit case is lots of random loops performing scalar operations, which is hard to convert back to any data-parallel form. A compromise is to allow the programmer to specify groupings or nestings on data sets for which an underlying homogeneity can be recovered and exploited by the system.

References (more recent first):

Recent MSL results: http://research.microsoft.com/en-us/um/people/simonpj/papers/ndp/vectorisation-avoidance-hw12.pdf
NDP challenges: http://research.microsoft.com/en-us/um/people/simonpj/papers/ndp/NdpSlides.pdf
NESL/VCODE: http://www.cs.cmu.edu/afs/cs.cmu.edu/project/scandal/public/papers/vcode-frontiers90.html

Child pages

Data Parallelism

Nested Data Parallelism