by Jaspal Subhlok, Gary Vondran
In Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
http://www.cs.uh.edu/~jaspal/papers/ppopp95.ps
Add To MetaCart
Abstract:
Many applications in a variety of domains including digital signal processing, image processing, and computer vision are composed of a sequence of tasks that act on a stream of input data sets in a pipelined manner. Recent research has established that these applications are best mapped to a massively parallel machine by dividing the tasks into modulesand assigninga subset of the available processors to each module. This paper addresses the problem of optimally mapping such applications onto a massively parallel machine. We formulate the problem of optimizing throughput in task pipelines and present two new solution algorithms. The formulation uses a general and realistic model for inter-task communication, takes memory constraints into account, and addresses the entire problem of mapping which includes clustering tasks into modules, assignment of processors to modules, and possible replication of modules. The first algorithm is based on dynamic programming and finds the optimal mapping of k tasks onto P processors in O(P 4
Citations
|
230
|
Partitioning and Scheduling Parallel Programs for Multiprocessors
– Sarkar
- 1989
|
|
93
|
Exploiting task and data parallelism on a multicomputer
– Subhlok, Stichnoth, et al.
- 1993
|
|
87
|
Task parallelism in a High Performance Fortran framework.IEEE Parallel
– GROSS, O'HALLARON, et al.
- 1994
|
|
60
|
The CMU task parallel program suite
– Dinda, Gross, et al.
- 1994
|
|
55
|
A Software Architecture for Multidisciplinary Applications: Integrating Task and Data Parallelism
– Chapman, Mehrotra, et al.
- 1994
|
|
55
|
A Compilation System that Integrates High Performance Fortran and Fortran M
– FOSTER, AVALANI, et al.
- 1994
|
|
32
|
A convex programming approach for exploiting data and functional parallelism on distributed memory multiprocessors
– Ramaswamy, Sapatnekar, et al.
- 1994
|
|
28
|
Communication and memory requirements as the basis for mapping task and data parallel programs
– Subhlok, O'Hallaron, et al.
- 1994
|
|
26
|
Scheduling and Code Generation for Parallel Architectures
– Yang
- 1993
|
|
25
|
Integrated support for task and data parallelism
– Chandy, Foster, et al.
- 1994
|
|
19
|
Do&Merge: Integrating parallel loops and reductions
– YANG, WEBB, et al.
- 1993
|
|
18
|
Latency and bandwidth consideration in parallel robotics image processing
– Webb
- 1993
|
|
15
|
Optimal processor assignment for a class of pipelined computations
– CHOUDHARY, NARAHARI, et al.
- 1994
|
|
6
|
The advantages of multiple parallelizations in combinatorial search
– Crowl, Crovella, et al.
- 1994
|
|
3
|
Optimization of latency, throughput and processors for pipelines of data parallel tasks
– Vondran
- 1995
|
|
2
|
AssignmentProblems in Parallel and Distributed Computing
– BOKHARI
- 1987
|