Data prefetching, data forwarding, message passing, compiler algorithms, cache coherent shared memory multiprocessors, execution-driven simulation, Perfect Benchmarks.
|
455
|
Design and evaluation of a compiler algorithm for prefetching
– Mowry, Lam, et al.
- 1992
|
|
363
|
The Stanford Dash Multiprocessor
– Lenoski, Laudon, et al.
- 1992
|
|
316
|
Compiling Fortran D for MIMD distributed-memory machines
– Hiranandani, Kennedy, et al.
- 1992
|
|
274
|
Lockup-free instruction fetch/prefetch cache organisation
– Kroft
- 1981
|
|
264
|
Tolerating Latency Through SoftwareControlled Prefetching in Shared-Memory Multiprocessors
– Mowry, Gupta
- 1991
|
|
240
|
Software prefetching
– Callahan, Kennedy, et al.
- 1991
|
|
213
|
The Perfect Club Benchmarks: Effective Performance Evaluation of Supercomputers
– Berry, Chen, et al.
- 1989
|
|
143
|
Efficient Synchronization Primitives for Large-Scale Cache-Coherent Multiprocessors
– Goodman, Vernon, et al.
- 1989
|
|
130
|
The performance of multistage interconnection networks for multiprocessors
– Kruskal, Snir
- 1983
|
|
107
|
Analysis of cache invalidation patterns in multiprocessors
– Weber, Gupta
- 1989
|
|
61
|
Effective Cache Prefetching on BusBased Multiprocessors
– Tullsen, Eggers
- 1995
|
|
60
|
Reducing Memory and Traffic Requirements for Scalable Directory-Based Cache Coherence Schemes
– Gupta, Weber, et al.
- 1990
|
|
51
|
An efficient data dependence analysis for parallelizing compilers
– Li, Yew, et al.
- 1990
|
|
36
|
Data prefetching for high-performance processors
– Chen
- 1993
|
|
24
|
Execution-Driven Tools for Parallel Simulation of Parallel Architectures and Applications
– Poulsen, Yew
- 1993
|
|
23
|
The Cedar system and an initial performance study
– Kuck
- 1993
|
|
21
|
Limitations of Cache Prefetching on a Bus-Based Multiprocessor
– Tullsen, Eggers
- 1993
|
|
18
|
ªThe Potential of Compile-Time Analysis to Adapt the Cache Coherence Enforcement Strategy to the Data Sharing Characteristics,º
– Mounes-Toussi, Lilja
- 1995
|
|
16
|
Architectural primitives for a scalable shared memory multiprocessor
– Lee, Ramachandran
- 1991
|
|
15
|
Data prefetching and data forwarding in shared memory multiprocessors
– Poulsen, Yew
- 1994
|
|
14
|
Memory Latency Reduction via Data Prefetching and Data Forwarding in Shared Memory Multiprocessors
– Poulsen
- 1994
|
|
13
|
Notification and multicast networks for synchronization and coherence
– Andrews, Beckmann, et al.
- 1992
|
|
7
|
A hybrid shared memory / message passing parallel machine
– Frank, Vernon
- 1993
|
|
5
|
et al. Cooperative shared memory: software and hardware for scalable multiprocessors
– Hill
- 1993
|
|
4
|
The Cedar Fortran Project
– Padua
- 1992
|
|
4
|
EPG source code instrumentation tools - user manual
– Poulsen, Yew
- 1994
|
|
4
|
Efficient doacross synchronization on distributed shared-memory multiprocessors
– Su, Yew
- 1991
|
|
3
|
et al. Parafrase-2: an environment for parallelizing, partitioning, synchronizing and scheduling programs on multiprocessors
– Polychronopoulos
- 1989
|
|
2
|
et al. Data forwarding in scalable shared-memory multiprocessors
– Koufaty
- 1995
|