by Steve G. Steinberg, Jun Yang, Katherine Yelick
In Proceedings of the 10th International Parallel Processing Symposium
http://HTTP.CS.Berkeley.EDU/~yelick/steve/ipps96.ps
Add To MetaCart
Abstract:
We present a case study in the use of performance modeling for parallel application development, with a biological cell simulation as our target application. We show that a simple performance model is adequate for determining data layout for arrays and linked structures, and validate our model against experimental results for some application kernels. We quantify the importance of optimizing across program components using information about machine performance and input characteristics. The cell simulation application has two phases, one regular and one irregular. The model closely predicts actual performance within the regular phase and allows for qualitative design comparisons in the irregular one. The resulting application is written in Split-C and runs on multiple platforms. 1.
Citations
|
241
|
Global optimizations for parallelism and locality on scalable parallel machines
– Anderson, Lain
- 1993
|
|
154
|
Parallel programming in split-c
– Culler, Dusseau, et al.
- 1993
|
|
137
|
Communication optimizations for irregular scientific computations on distributed memory architectures
– Das, Uysal, et al.
- 1994
|
|
91
|
Automatic array alignment in data-parallel programs
– Chatterjee, Gilbert, et al.
- 1993
|
|
75
|
Automatic Data Layout for High Performance Fortran
– Kremer
- 1995
|
|
54
|
An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs," presented at Supercomputing '95
– Adve, Mellor-Crummey, et al.
- 1995
|
|
27
|
Automatic performance prediction to support parallelization of Fortran programs for massively parallel systems
– Fahringer, Blasko, et al.
- 1992
|
|
24
|
Automatic selection of Dynamic Data Partitioning Schemes for Distributed-Memory Multicomputers
– Palermo, Banerjee
- 1995
|
|
19
|
Interprocedural compilation of irregular applications for distributed memory machines
– Agrawal, Saltz
- 1995
|
|
8
|
Parallelizing a Global Atmospheric Chemical Tracer Model
– Demmel, Smith
- 1994
|
|
5
|
Automatic data layout using 0--1 integer programming
– Kremer
- 1994
|
|
4
|
Truncated Newton methods and the modeling of complex immersed elastic structures
– Fauci, Fogelson
- 1993
|
|
3
|
Parallelizing a cell simulation: Analysis, abstraction, and portability
– Steinberg
- 1994
|
|
2
|
Two--dimensional fft algorithms on hypercube and mesh machines
– Angelopoulos, Pitas
- 1993
|
|
2
|
Cardiac fluid dynamics
– McQueen
- 1992
|
|
1
|
High-level optimization via automated satistical modeling
– Brewer
- 1995
|
|
1
|
Three-dimensional fluid dynamics in a twodimensional amount of central memory
– Greenberg
- 1987
|
|
1
|
Notochord morphogenesis in xernopus laevis: simulation of cell behavior underlying tissue convergence and extension
– Weliky
- 1991
|