MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  y

Download:
Download as a PDF | Download as a PS
by Hock-beng Lim, Pen-chung Yew
http://www.cs.umn.edu/Research/Agassiz/Paper/lim.ipdps00.ps.Z
Add To MetaCart

Abstract:

Cache coherence enforcement and memory latency reduction and hiding are very important and challenging problems in the design of large-scale distributed sharedmemory (DSM) multiprocessors. We propose an integrated framework to solve these problems through a compilerdirected cache coherence scheme called the Cache Coherence with Data Prefetching (CCDP) scheme. The CCDP scheme enforces cache coherence by prefetching the potentially stale references in a parallel program. It also prefetches the nonstale references to hide their memory latencies. To optimize the performance of the CCDP scheme, some prefetch hardware support is provided to efficiently handle these two forms of data prefetching operations. We also developed the compiler techniques utilized by the CCDP scheme for stale reference detection, prefetch target analysis and prefetch scheduling. We evaluated the performance of the CCDP scheme via execution-driven simulations of several applications from the SPEC CFP95 and the Perfect benchmark suites. The simulation results show that the CCDP scheme provides significant performance improvements for the applications studied.

Citations

264 Tolerating Latency Through Software-Controlled Data Prefetching – Mowry - 1994
131 Improving Locality and Parallelism in Nested Loops – Wolf - 1992
130 The performance of multistage interconnection networks for multiprocessors – Kruskal, Snir - 1983
60 An Economical Solution to the Cache Coherence Problem – Archibald, Baer - 1984
47 Polaris: A New-Generation Parallelizing Compiler for MPPs – Padua, Eigenmann, et al. - 1993
30 Compiler techniques for data prefetching on the PowerPC – Bernstein, Cohen, et al.
24 Execution-Driven Tools for Parallel Simulation of Parallel Architectures and Applications – Poulsen, Yew - 1993
13 Cache Design and Performance in a Large-Scale Shared-Memory Multiprocessor System – Chen - 1993
13 Compile time analysis for data prefetching – Gornish - 1989
11 Hardware and Compiler Support for Cache Coherence in Large-Scale Multiprocessors – Choi - 1996
5 Techniques for Compiler-Directed Cache Coherence – Choi, Lim, et al. - 1996
2 Maintaining cache coherence through compiler-directed data prefetching – Lim, Yew - 1998