MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Adaptive Data Placement for Distributed-Memory Machines (1995) [5 citations — 2 self]

Download:
Download as a PDF | Download as a PS
by David K. Lowenthal, David K. Lowenthal, Gregory R. Andrews, Gregory R. Andrews
University of Arizona
ftp://ftp.cs.arizona.edu/reports/1995/TR95-13.ps.Z
Add To MetaCart

Abstract:

Programming distributed-memory machines requires careful placement of data on the nodes. This is because achieving efficiency requires balancing the computational load among the nodes and minimizing excess data movement between the nodes. Most current approaches to data placement require the programmer or compiler to place data initially and then possibly to move it explicitly during a computation. This paper describes a new, adaptive approach to data placement. It is implemented in the Adapt system, which takes an initial data placement, efficiently monitors how well it performs, and changes the placement whenever the monitoring indicates that a different placement would perform better. Using Adapt can simplify the programming of parallel systems and simplify compilers for parallel languages such as HPF. In particular, Adapt frees the programmer from having to specify data placements, and it frees the compiler from having to do often complex analysis to determine a good placement. Moreover, Adapt supports a new "variable block " placement, which is especially useful for applications with nearest-neighbor communication but an imbalanced workload. For applications in which the best data placement varies dynamically, using Adapt can lead to much better performance than using any statically determined data placement. We present the performance of Adapt on three scientific applications that require different data placements.

Citations

705 SPLASH: Stanford Parallel Applications for Shared Memory – Singh, Weber, et al. - 1992
316 Compiling Fortran D for MIMD distributed-memory machines – Hiranandani, Kennedy, et al. - 1992
241 Global optimizations for parallelism and locality on scalable parallel machines – Anderson, Lain - 1993
179 Adaptive mesh refinement for hyperbolic partial differential equations – Berger, Oliger - 1984
179 SUPERB: A tool for semi-automatic MIMD/SIMD parallelization – ZIMA, BAST, et al. - 1988
172 A static performance estimator to guide data partitioning decisions – BALASUNDARAM, Fox, et al. - 1991
140 The Livermore Fortran kernels: a computer test of the numerical performance range – McMahon - 1986
129 Data optimization: Allocation of arrays to reduce communication on SIMD machines – Knobe, Lukas, et al. - 1990
117 The data alignment phase in compiling programs for distributed-memory machines – Li, Chen - 1991
115 Index Domain Alignment: Minimizing Cost of Cross{referencing between Distributed Arrays – Li, Chen - 1990
115 RunTime Parallelization and Scheduling of Loops – Saltz, Mirchandaney, et al. - 1991
83 The DINO parallel programming language – Rosing, Schnabel, et al. - 1990
75 Automatic Data Layout for High Performance Fortran – Kremer - 1995
73 Distributed Filaments: Efficient Fine-Grain Parallelism on a Cluster of Workstations – Freeh, Lowenthal, et al. - 1994
72 Automatic data mapping for distributed-memory parallel computers – WHOLEY - 1992
41 Performance FORTRAN Language Specification – High - 1993
36 Compiler techniques for data partitioning of sequentially iterated parallel loops – Hudak, Abraham - 1990
28 Runtime Array Redistribution in HPF Programs – Thakur, Choudhary, et al. - 1994
16 Vectorization of a particle simulation method for hypersonic rarified flow,” presented at – McDonald, Baganoff - 1988
15 Eric Schauser. Active Messages: a Mechanism for Integrated Communication and Computation – Eicken, Culler, et al. - 1992
13 The particle-in-cell computing method for fluid dynamics – Harlow - 1964
11 Supporting Fine-Grain Computation on Distributed Memory Parallel Computers – Socha - 1991
10 PARADIGM: A compiler for automated data distribution on multicomputers – Gupta, Banerjee - 1993
8 Compiling functional parallelism on distributed-memory systems – Pande, Agrawal, et al. - 1994