Download:
|
by David K. Lowenthal, David K. Lowenthal, Gregory R. Andrews, Gregory R. Andrews
University of Arizona
ftp://ftp.cs.arizona.edu/reports/1995/TR95-13.ps.Z
Add To MetaCart
Abstract:
Programming distributed-memory machines requires careful placement of data on the nodes. This is because achieving efficiency requires balancing the computational load among the nodes and minimizing excess data movement between the nodes. Most current approaches to data placement require the programmer or compiler to place data initially and then possibly to move it explicitly during a computation. This paper describes a new, adaptive approach to data placement. It is implemented in the Adapt system, which takes an initial data placement, efficiently monitors how well it performs, and changes the placement whenever the monitoring indicates that a different placement would perform better. Using Adapt can simplify the programming of parallel systems and simplify compilers for parallel languages such as HPF. In particular, Adapt frees the programmer from having to specify data placements, and it frees the compiler from having to do often complex analysis to determine a good placement. Moreover, Adapt supports a new "variable block " placement, which is especially useful for applications with nearest-neighbor communication but an imbalanced workload. For applications in which the best data placement varies dynamically, using Adapt can lead to much better performance than using any statically determined data placement. We present the performance of Adapt on three scientific applications that require different data placements.
Citations
|
705
|
SPLASH: Stanford Parallel Applications for Shared Memory
– Singh, Weber, et al.
- 1992
|
|
316
|
Compiling Fortran D for MIMD distributed-memory machines
– Hiranandani, Kennedy, et al.
- 1992
|
|
241
|
Global optimizations for parallelism and locality on scalable parallel machines
– Anderson, Lain
- 1993
|
|
179
|
Adaptive mesh refinement for hyperbolic partial differential equations
– Berger, Oliger
- 1984
|
|
179
|
SUPERB: A tool for semi-automatic MIMD/SIMD parallelization
– ZIMA, BAST, et al.
- 1988
|
|
172
|
A static performance estimator to guide data partitioning decisions
– BALASUNDARAM, Fox, et al.
- 1991
|
|
140
|
The Livermore Fortran kernels: a computer test of the numerical performance range
– McMahon
- 1986
|
|
129
|
Data optimization: Allocation of arrays to reduce communication on SIMD machines
– Knobe, Lukas, et al.
- 1990
|
|
117
|
The data alignment phase in compiling programs for distributed-memory machines
– Li, Chen
- 1991
|
|
115
|
Index Domain Alignment: Minimizing Cost of Cross{referencing between Distributed Arrays
– Li, Chen
- 1990
|
|
115
|
RunTime Parallelization and Scheduling of Loops
– Saltz, Mirchandaney, et al.
- 1991
|
|
83
|
The DINO parallel programming language
– Rosing, Schnabel, et al.
- 1990
|
|
75
|
Automatic Data Layout for High Performance Fortran
– Kremer
- 1995
|
|
73
|
Distributed Filaments: Efficient Fine-Grain Parallelism on a Cluster of Workstations
– Freeh, Lowenthal, et al.
- 1994
|
|
72
|
Automatic data mapping for distributed-memory parallel computers
– WHOLEY
- 1992
|
|
41
|
Performance FORTRAN Language Specification
– High
- 1993
|
|
36
|
Compiler techniques for data partitioning of sequentially iterated parallel loops
– Hudak, Abraham
- 1990
|
|
28
|
Runtime Array Redistribution in HPF Programs
– Thakur, Choudhary, et al.
- 1994
|
|
16
|
Vectorization of a particle simulation method for hypersonic rarified flow,” presented at
– McDonald, Baganoff
- 1988
|
|
15
|
Eric Schauser. Active Messages: a Mechanism for Integrated Communication and Computation
– Eicken, Culler, et al.
- 1992
|
|
13
|
The particle-in-cell computing method for fluid dynamics
– Harlow
- 1964
|
|
11
|
Supporting Fine-Grain Computation on Distributed Memory Parallel Computers
– Socha
- 1991
|
|
10
|
PARADIGM: A compiler for automated data distribution on multicomputers
– Gupta, Banerjee
- 1993
|
|
8
|
Compiling functional parallelism on distributed-memory systems
– Pande, Agrawal, et al.
- 1994
|