See this document in CiteSeerX!

Compiler Techniques for Software Prefetching on Cache-Coherent Shared-Memory Multiprocessors (1997)  (Make Corrections)  (1 citation)
Nathaniel McIntosh, Katherine Fletcher, Keith Cooper, Ken Kennedy...



  Home/Search   Context   Related

 
View or download:
unc.edu/pub/users/sc/ppopp97/20.ps.Z
rice.edu/pub/CRPC...PCTR96675S.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  unc.edu (more)
From:  rice.edu/CRPC/softli...TRs_online
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: In this paper we present a comprehensive compiler framework for improving the efficiency of compilerdirected software prefetching on cache-coherent distributed shared-memory multiprocessors. The key component of our work is a form of global data-flow analysis that predicts at compile-time the sets of array references that are likely to cause coherence activity at run-time. The data-flow framework accurately analyzes the cache behavior in a parallel program by combining array section analysis... (Update)

Context of citations to this paper:   More

.... by using information about the data layout) Such an approach has been discussed in other studies for software and hardware prefetching [8, 13, 21]. Our applications, however, did not show much benefit with this modification. To appear in Proceedings of ISCA 24 (June,...

Cited by:   More
The Interaction of Software Prefetching with ILP.. - Ranganathan, Pai.. (1997)   (Correct)

Similar documents (at the sentence level):
39.6%:   Compiler Support for Software Prefetching - McIntosh (1998)   (Correct)
29.1%:   Compiler Techniques for Software Prefetching on.. - Cooper, Kennedy..   (Correct)

Active bibliography (related documents):   More   All
0.2:   Communication Generation for Data-Parallel Languages - Sethi (1996)   (Correct)
0.1:   Cross-loop Reuse Analysis and its Application to Cache.. - Keith Cooper (1996)   (Correct)
0.1:   Compiler Support for Machine-Independent Parallelization of.. - von Hanxleden (1994)   (Correct)

Similar documents based on text:   More   All
0.3:   Design and Evaluation of a Compiler Algorithm for Prefetching - Mowry, Lam, Gupta (1992)   (Correct)
0.3:   Guided Region Prefetching: A Cooperative.. - Wang, Burger.. (2003)   (Correct)
0.1:   Unknown - (2001)   (Correct)

Related documents from co-citation:   More   All
2:   Tolerating latency through software-controlled prefetching in shared-memory mult.. - Mowry, Gupta - 1991

BibTeX entry:   (Update)

N. McIntosh, K. Fletcher, K. Cooper, and K. Kennedy. Compiler Techniques for Software Prefetching on Cache-Coherent Shared-Memory Multiprocessors. Center for Research on Parallel Computation, Rice University, CRPC-TR96675-S, 1997. http://citeseer.ist.psu.edu/mcintosh97compiler.html   More

@phdthesis{ mcintosh97compiler,
    author = "Nathaniel McIntosh",
    title = "Compiler Support for Software Prefetching",
    address = "Houston, TX",
    year = "1997",
    url = "citeseer.ist.psu.edu/mcintosh97compiler.html" }
Citations (may not include all citations):
1399   Compilers: Principles (context) - Aho, Sethi et al. - 1986
835   High Performance Fortran language specification - Fortran - 1993
362   The Stanford FLASH multiprocessor (context) - Kuskin, Ofelt et al. - 1994
344   Design and evaluation of a compiler algorithm for prefetchin.. - Mowry, Lam et al. - 1992
249   Tolerating latency through software-controlled prefetching i.. - Mowry, Gupta - 1991
249   Tolerating Latency Through Software Controlled Data Prefetch.. - Mowry - 1994
212   The MIT Alewife machine: Architecture and performance - Agarwal, Bianchini et al. - 1995
149   An implementation of interprocedural bounded regular section.. - Havlak, Kennedy - 1991
121   Using processor affinity in loop scheduling on shared-memory.. - Markatos, LeBlanc - 1994
82   A Low Overhead Coherence Solution for Multiprocessors with P.. (context) - Papamarcos, Patel - 1984
79   Interprocedural dependence analysis and parallelization (context) - Burke, Cytron - 1986
58   A mechanism for keeping useful internal information in paral.. (context) - Balasundaram - 1990
55   A survey of data flow analysis techniques (context) - Kennedy - 1981
50   Structured dataflow analysis for arrays and its use in an op.. (context) - Gross, Steenkiste - 1990
40   Limitations of cache prefetching on a bus-based multiprocess.. (context) - Tullsen, Eggers - 1993
38   A unified data-flow framework for optimizing communication - Gupta, Schonberg et al. - 1994
36   Detecting redundant accesses to array data (context) - Granston, Veidenbaum - 1991
33   KAP User's Guide (context) - Associates - 1988
25   SPEC Benchmark Suite: Designed for today's advanced systems (context) - Uniejewski - 1989
21   Efficient interprocedural analysis for program restructuring.. (context) - Li, Yew - 1988
20   A framework for exploiting data availability to optimize com.. - Gupta, Schonberg - 1993
19   Compiler Support for Machine-Independent Parallelization of .. - Hanxleden - 1994
14   Exemplar Architecture (context) - Corporation - 1993
13   Execution-driven simulation of multiprocessors: Address and .. - Dwarkadas, Jump et al. - 1994
11   Cross-loop reuse analysis and its application to cache optim.. - Cooper, Kennedy et al. - 1996

Documents on the same site (http://fermivista.math.jussieu.fr/ftp/ftp.cs.unc.edu.html):   More
Managing Latency in Complex Augmented Reality Systems - Jacobs, Livingston, State (1997)   (Correct)
Automated Deduction Techniques for Classification in.. - Paramasivam, Plaisted (1995)   (Correct)
N-body Methods on MIMD Supercomputers: Astrophysics on.. - Fullagar, Quinn.. (1992)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC