Download:
by Jeffrey S. Vetter, Michael O. Mccracken
http://www.llnl.gov/CASC/people/vetter/pubs/ppopp01_scal_analysis.pdf
Add To MetaCart
Abstract:
Current trends in high performance computing suggest that users will soon have widespread access to clusters of multiprocessors with hundreds, if not thousands, of processors. This unprecedented degree of parallelism will undoubtedly expose scalability limitations in existing applications, where scalability is the ability of a parallel algorithm on a parallel architecture to effectively utilize an increasing number of processors. Users will need precise and automated techniques for detecting the cause of limited scalability. This paper addresses this dilemma. First, we argue that users face numerous challenges in understanding application scalability: managing substantial amounts of experiment data, extracting useful trends from this data, and reconciling performance information with their application’s design. Second, we propose a solution to automate this data analysis problem by applying fundamental statistical techniques to scalability experiment data. Finally, we evaluate our operational prototype on several applications, and show that statistical techniques offer an effective strategy for assessing application scalability. In particular, we find that non-parametric correlation of the number of tasks to the ratio of the time for communication operations to overall communication time provides a reliable measure for identifying communication operations that scale poorly. 1
Citations
|
739
|
Using MPI: Portable Parallel Programming with the Message Passing Interface, 2nd edition
– Gropp, Lusk, et al.
- 1999
|
|
376
|
Parallel Computer Architecture: A Hardware/Software Approach
– Culler, Singh
- 1998
|
|
344
|
Readings in Information Visualization: Using Vision to Think
– Card, Mackinlay, et al.
- 1999
|
|
230
|
Applied Multivariate Statistical Analysis
– RA, DW
- 1982
|
|
82
|
Analyzing scalability of parallel algorithms and architectures
– Kumar, Gupta
- 1994
|
|
71
|
Designing and Building Parallel Programs, Concepts and Tools for Parallel Software Engineering
– Foster
- 1994
|
|
62
|
et al. Numerical Recipes: The Art of Scientific Computing
– Press
- 2002
|
|
36
|
Solution of the FirstOrder Form of the 3-D Discrete Ordinates Equation on a Massively Parallel
– Koch, Baker, et al.
- 1992
|
|
34
|
Semicoarsening multigrid on distributed memory machines
– Brown, Falgout, et al.
- 2000
|
|
21
|
Parallel Performance Visualization: From Practice to Theory
– Heath, Malony, et al.
- 1995
|
|
18
|
Performance Analysis of Distributed Applications using Automatic Classification of Communication Inefficiencies
– Vetter
- 2000
|
|
15
|
Grama et al., Introduction to parallel computing: design and analysis of algorithms
– Kumar
- 1994
|
|
15
|
Real-Time Statistical Clustering for Event Trace Reduction
– REED, NICKOLAYEV, et al.
- 1997
|
|
11
|
Managing performance analysis with dynamic statistical projection pursuit
– VETTER, REED
- 1999
|
|
9
|
Aydt et al., “An Overview of the Pablo Performance Analysis Environment
– Reed, A
- 1992
|
|
7
|
et al., “A Users' Guide to PICL - A Portable Instrumented Communication
– Geist, Heath
- 1991
|
|
7
|
et al., “Very High Resolution Simulation of Compressible Turbulence
– Mirin, Cohen
- 1999
|
|
5
|
Barszcz et al., “NAS parallel benchmark results
– Bailey, E
- 1994
|
|
5
|
Malony et al., “Portable profiling and tracing for parallel, scientific applications using C
– Shende, D
- 1998
|
|
4
|
et al., Eds., Software Visualization: Programming as a Multimedia Experience
– Stasko, Domingue
- 1998
|
|
2
|
et al., “Architectural Requirements and Scalability
– Wong, Martin
- 1999
|
|
1
|
et al., “High-Performance Reactive Fluid Flow Simulations Using Adaptive Mesh Refinement on
– Calder, Curtis
- 2000
|
|
1
|
et al., “Parallel performance evaluation: the Medea tool
– Calzarossa, Massari
- 1996
|