by Jaspal Subhlok, Bwolen Yang
In Proceedings of the Sixth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
http://www-cgi.cs.cmu.edu/afs/cs.cmu.edu/user/bwolen/Web/papers/ppopp97-task.ps
Add To MetaCart
Abstract:
High Performance Fortran (HPF) has emerged as a standard language for data parallel computing. However, a wide variety of scientific applications are best programmed by a combination of task and data parallelism. Therefore, a good model of task parallelism is important for continued success of HPF for parallel programming. This paper presents a task parallelism model that is simple, elegant, and relatively easy to implement in an HPF environment. Task parallelism is exploited by mechanisms for dividing processors into subgroups and mapping computations and data onto processor subgroups. This model of task parallelism has been implemented in the Fx compiler at Carnegie Mellon University. The paper addresses the main issues in compiling integrated task and data parallel programs and reports on the use of this model for programming various flat and nested task structures. Performance results are presented for a set of programs spanning signal processing, image processing, computer vision and environment modeling. A variant of this task model is a new approved extension of HPF and this paper offers insight into the power of expression and ease of implementation of this extension.
Citations
|
963
|
Performance Fortran Forum. High Performance Fortran language specification version 1.0
– High
- 1993
|
|
389
|
The High Performance Fortran Handbook
– Koelbel, Loveman, et al.
- 1994
|
|
316
|
Compiling Fortran D for MIMD distributed-memory machines
– Hiranandani, Kennedy, et al.
- 1992
|
|
196
|
A multiple-baseline stereo
– �Okutomi, Kanade
- 1993
|
|
111
|
A hierarchical O(N log N) force calculation algorithm
– Barnes, Hut
- 1986
|
|
87
|
Task parallelism in a High Performance Fortran framework.IEEE Parallel
– GROSS, O'HALLARON, et al.
- 1994
|
|
74
|
Generating communication for array statements: Design, implementation, and evaluation
– Stichnoth, O'Hallaron, et al.
- 1994
|
|
61
|
An integrated runtime and compile-time approach for parallelizing structured and block structured applications
– Agrawal, Sussman, et al.
- 1995
|
|
60
|
The CMU task parallel program suite
– Dinda, Gross, et al.
- 1994
|
|
55
|
A Software Architecture for Multidisciplinary Applications: Integrating Task and Data Parallelism
– Chapman, Mehrotra, et al.
- 1994
|
|
55
|
A Compilation System that Integrates High Performance Fortran and Fortran M
– FOSTER, AVALANI, et al.
- 1994
|
|
33
|
Double Standards: Bringing Task Parallelism to HPF via the Message Passing Interface
– Foster, Kohr, et al.
- 1996
|
|
32
|
A convex programming approach for exploiting data and functional parallelism on distributed memory multiprocessors
– Ramaswamy, Sapatnekar, et al.
- 1994
|
|
29
|
An overview of the Opus language and runtime system
– Mehrotra, Haines
- 1994
|
|
29
|
Decoupling synchronization and data transfer in message passing systems of parallel computers
– Stricker, Stichnoth, et al.
- 1995
|
|
28
|
Communication and memory requirements as the basis for mapping task and data parallel programs
– Subhlok, O'Hallaron, et al.
- 1994
|
|
23
|
Optimal mapping of sequences of data parallel tasks
– Subhlok, Vondran
- 1995
|
|
19
|
Do&Merge: Integrating parallel loops and reductions
– YANG, WEBB, et al.
- 1993
|
|
18
|
CIT Photochemical Airshed Model - Systems
– McRae, Russell, et al.
- 1992
|
|
18
|
Latency and bandwidth consideration in parallel robotics image processing
– Webb
- 1993
|
|
12
|
Multiprocessors for radar signal processing
– SHAW, GABEL, et al.
- 1992
|
|
8
|
Using MPI: Portable parallel processing with the Message Passing Interface
– Gropp, Lusk, et al.
- 1994
|
|
8
|
Optimal latency-- throughput tradeoffs for data parallel pipelines
– SUBHLOK, VONDRAN
- 1996
|