by Shamik D. Sharma, Mark D. Hill, James R. Larus, Anne Rogers, Joel Saltz
ftp://ftp.cs.wisc.edu/wwt/ppopp95_irregular.ps
Add To MetaCart
Abstract:
Irregular computation problems underlie many important scientific applications. Although these problems are computationally expensive, and so would seem appropriate for parallel machines, their irregular and unpredictable run-time behavior makes this type of parallel program difficult to write and adversely affects run-time performance. This paper explores three issues---partitioning, mutual exclusion, and data transfer---crucial to the efficient execution of irregular problems on distributed-memory machines. Unlike previous work, we studied the same programs running in three alternative systems on the same hardware base (a Thinking Machines CM-5): the CHAOS irregular application library, Transparent Shared Memory (TSM), and eXtensible Shared Memory (XSM). CHAOS and XSM performed equivalently for all three applications. Both systems were somewhat (13%) to significantly faster (991%) than TSM.
Citations
|
414
|
Partitioning sparse matrices with eigenvectors of graphs
– Pothen, Simon, et al.
- 1990
|
|
323
|
Tempest and Typhoon: User-Level Shared Memory
– Reinhardt, Larus, et al.
- 1994
|
|
263
|
CHARMM: a program for macromolecular energy, minimization and dynamics calculations
– Brooks, Buccoleri, et al.
- 1983
|
|
175
|
Supporting Compiling Global Name-Space Parallel Loops for Distributed Execution
– Koelbel, Mehrotra
- 1991
|
|
166
|
et al., “The Stanford FLASH Multiprocessor
– Kuskin
- 1994
|
|
166
|
Fine-Grain Access Control for Distributed Shared Memory
– Schoinas, Falsafi, et al.
- 1994
|
|
137
|
Communication optimizations for irregular scientific computations on distributed memory architectures
– Das, Uysal, et al.
- 1994
|
|
115
|
RunTime Parallelization and Scheduling of Loops
– Saltz, Mirchandaney, et al.
- 1991
|
|
111
|
A hierarchical O(N log N) force calculation algorithm
– Barnes, Hut
- 1986
|
|
100
|
Integrating Message-Passing and Shared-Memory: Early Experience
– Kranz, Johnson, et al.
- 1993
|
|
89
|
ApplicationSpecific Protocols for User-Level Shared Memory
– Falsafi, Lebeck, et al.
- 1994
|
|
75
|
Software Versus Hardware Shared-Memory Implementation: a Case Study
– Cox, Dwarkadas, et al.
- 1994
|
|
60
|
Run-time and Compile-time Support for Adaptive Irregular Problems
– Sharma, Ponnusamy, et al.
- 1994
|
|
49
|
Slicing Analysis and Indirect Accesses to Distributed Arrays
– Das, Saltz, et al.
- 1993
|
|
48
|
Implementing an irregular application on a distributed memory multiprocessor
– Chakrabarti, Yelick
- 1993
|
|
45
|
Tempest: A substrate for portable parallel programs
– Hill, Larus, et al.
- 1995
|
|
44
|
A Manual for the CHAOS Runtime Library
– Saltz, Ponnusamy, et al.
- 1995
|
|
36
|
Adaptive Runtime Support for Direct Simulation Monte Carlo Methods on Distributed Memory Architectures
– Moon, Saltz
- 1994
|
|
32
|
Improving the Performance of DSM Systems via Compiler Involvement
– Mirchandaney, Hiranandani, et al.
- 1994
|
|
20
|
APartitioning Strategy for PDEs across Multiprocessors
– Berger, Bokhari
- 1985
|
|
20
|
Support for Distributed Dynamic Data Structures in C
– Chang, Sussman, et al.
- 1995
|
|
19
|
Data and Program Restructuring of Irregular Applications for CacheCoherent Multiprocessors
– Tomko, Abraham
- 1994
|
|
18
|
Tempest Interface Specification (Revision 1.2.1
– Reinhardt
- 1995
|
|
15
|
Direct Simulation Monte Carlo Analysis of Rare ed Flows on Parallel Processors
– Wilmoth
- 1991
|