See this document in CiteSeerX!

Estimating Cache Performance for Sequential and Data Parallel Programs (1997)  (Make Corrections)  (8 citations)
Thomas Fahringer
HPCN Europe



  Home/Search   Context   Related

Links:   ACM   DBLP

 
View or download:
vcpc.univie.ac.at/...oratr199701.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  vcpc.univie.ac.at/aurora/publi... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper introduces an analytical model that enables automatic estimation of the cache performance for both sequential and data parallel Fortran programs. The estimation is based on a classification of array accesses with respect to cache reuse at the source code level. An estimated upper bound of the number of distinct cache lines accessed inside of a loop is statically computed. Based on this estimate the number of cache misses for loops, procedures and the entire program can be... (Update)

Context of citations to this paper:   More

...there is enough computation contained in a loop, thus parallelizing the loop may be effective. ffl Cache misses It is well known [27, 81, 55, 52, 33] that inefficient memory access patterns and data mapping into the memory hierarchy (data locality problem) of a single processor...

...and loop nests L 2 LE , is defined as ctE(E) X s2SE ctS(s) X l2LE ctL(l) 4. 4 Number of Cache Misses It is well known [18, 45, 36, 33, 24] that inefficient memory access patterns and data mapping into the memory hierarchy (data locality problem) of a single processor cause...

Cited by:   More
P³T+: A Performance Estimator for Distributed and.. - Fahringer, Pozgaj (1999)   (Correct)
P³T+: A Performance Estimator for Distributed and.. - Fahringer, Pozgaj (2001)   (Correct)
Estimating Cache Misses and Locality Using Stack Distances - Cascaval, Padua (2003)   (Correct)

Similar documents (at the sentence level):
33.1%:   PłT+: A Performance Estimator for Distributed and Parallel.. - Pozgaj, Fahringer (2000)   (Correct)

Active bibliography (related documents):   More   All
0.0:   Estimating Cache Performance for Sequential and Data Parallel.. - Fahringer (1997)   (Correct)
0.0:   A Unified Symbolic Evaluation Framework for Parallelizing.. - Fahringer, Scholz (1999)   (Correct)
0.0:   Efficient Symbolic Analysis for Parallelizing Compilers and.. - Fahringer (1998)   (Correct)

Similar documents based on text:   More   All
0.2:   Symbolic Cache Analysis for Real-Time Systems - Blieberger, Fahringer, Scholz (1999)   (Correct)
0.1:   SCALEA-G: a Unified Monitoring and Performance Analysis.. - Truong, Fahringer (2003)   (Correct)
0.1:   From Web Services to OGSA: Experiences in Implementing an.. - Prodan, al. (2003)   (Correct)

Related documents from co-citation:   More   All
6:   VFC: The Vienna Fortran Compiler (context) - Benkner - 1998
6:   On estimating and enhancing cache effectiveness (context) - Ferrante, Sarkar et al. - 1991
6:   A Data Locality Optimizing Algorithm (context) - Wolf, Lam - 1991

BibTeX entry:   (Update)

T. Fahringer. Estimating cache performance for sequential and data parallel programs. Technical Report TR 97-9, Institute for Software Technology and Parallel Systems, Univ. of Vienna, Vienna, Austria, October 1997. http://citeseer.ist.psu.edu/fahringer97estimating.html   More

@inproceedings{ fahringer97estimating,
    author = "Thomas Fahringer",
    title = "Estimating Cache Performance for Sequential and Data Parallel Programs",
    booktitle = "{HPCN} Europe",
    pages = "840-849",
    year = "1997",
    url = "citeseer.ist.psu.edu/fahringer97estimating.html" }
Citations (may not include all citations):
474   A data locality optimizing algorithm (context) - Wolf, Lam - 1991  ACM   DBLP
94   Optimizing for Parallelism and Data Locality - Kennedy, McKinley - 1992  ACM   DBLP
82   On estimating and enhancing cache effectiveness (context) - Ferrante, Sarkar et al. - 1991  ACM   DBLP
70   Automatic Performance Prediction of Parallel Programs (context) - Fahringer - 1996  ACM
39   Estimating and Optimizing Performance for Parallel Programs - Fahringer - 1995  ACM   DBLP
37   Vienna Fortran Compilation System - Version (context) - Benkner, Andel et al. - 1995



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.vcpc.univie.ac.at/aurora/publications/):   More
Dynamic Load Balancing on Heterogeneous Workstation.. - Hlavacs, Ueberhuber (1998)   (Correct)
Dynamic Asset Allocation under Uncertainty for Pension Fund .. - Pflug, Swietanowski (1998)   (Correct)
Estimating Cache Performance for Sequential and Data Parallel.. - Fahringer (1997)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC