(Enter summary)
Abstract: This paper introduces an analytical model that enables automatic estimation of
the cache performance for both sequential and data parallel Fortran programs. The
estimation is based on a classification of array accesses with respect to cache reuse at
the source code level. An estimated upper bound of the number of distinct cache lines
accessed inside of a loop is statically computed. Based on this estimate the number of
cache misses for loops, procedures and the entire program can be predicted.... (Update)
Context of citations to this paper: More
...there is enough computation contained in a loop, thus parallelizing the loop may be effective. ffl Cache misses It is well known [27, 81, 55, 52, 33] that inefficient memory access patterns and data mapping into the memory hierarchy (data locality problem) of a single processor...
...and loop nests L 2 LE , is defined as ctE(E) X s2SE ctS(s) X l2LE ctL(l) 4. 4 Number of Cache Misses It is well known [18, 45, 36, 33, 24] that inefficient memory access patterns and data mapping into the memory hierarchy (data locality problem) of a single processor cause...
Cited by: More
P³T+: A Performance Estimator for Distributed and.. - Fahringer, Pozgaj (1999)
(Correct)
P³T+: A Performance Estimator for Distributed and.. - Fahringer, Pozgaj (2001)
(Correct)
Estimating Cache Misses and Locality Using Stack Distances - Cascaval, Padua (2003)
(Correct)
Similar documents (at the sentence level):
34.4%: PłT+: A Performance Estimator for Distributed and Parallel.. - Pozgaj, Fahringer (2000)
(Correct)
5.0%: P³T: An Automatic Performance Estimator for Parallel Programs - Fahringer (1995)
(Correct)
Active bibliography (related documents): More All
0.0: Estimating Cache Performance for Sequential and Data Parallel.. - Fahringer (1997)
(Correct)
0.0: A Unified Symbolic Evaluation Framework for Parallelizing.. - Fahringer, Scholz (1999)
(Correct)
0.0: Efficient Symbolic Analysis for Parallelizing Compilers and.. - Fahringer (1998)
(Correct)
Similar documents based on text: More All
0.2: Estimating and Optimizing Performance for Parallel Programs - Thomas Fahringer Institute (1995)
(Correct)
0.2: Implications of Memory Mappings on Cache Misses - Genius, Eisenbiegler (1998)
(Correct)
0.1: Integrated Range Comparison for Data-Parallel Compilation.. - Sun, Pantano, Fahringer (1997)
(Correct)
Related documents from co-citation: More All
6: VFC: The Vienna Fortran Compiler (context) - Benkner - 1998
6: On estimating and enhancing cache effectiveness (context) - Ferrante, Sarkar et al. - 1991
6: A Data Locality Optimizing Algorithm (context) - Wolf, Lam - 1991
BibTeX entry: (Update)
T. Fahringer. Estimating cache performance for sequential and data parallel programs. Technical Report TR 97-9, Institute for Software Technology and Parallel Systems, Univ. of Vienna, Vienna, Austria, October 1997. http://citeseer.ist.psu.edu/article/fahringer97estimating.html More
@inproceedings{ fahringer97estimating,
author = "Thomas Fahringer",
title = "Estimating Cache Performance for Sequential and Data Parallel Programs",
booktitle = "{HPCN} Europe",
pages = "840-849",
year = "1997",
url = "citeseer.ist.psu.edu/article/fahringer97estimating.html" }
Citations (may not include all citations):
474
A data locality optimizing algorithm (context) - Wolf, Lam - 1991
94
Optimizing for Parallelism and Data Locality
- Kennedy, McKinley - 1992
82
On estimating and enhancing cache effectiveness (context) - Ferrante, Sarkar et al. - 1991
70
Automatic Performance Prediction of Parallel Programs (context) - Fahringer - 1996
39
Estimating and Optimizing Performance for Parallel Programs
- Fahringer - 1995
37
Vienna Fortran Compilation System - Version (context) - Benkner, Andel et al. - 1995
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.par.univie.ac.at/publications/1996-1997/tr96-97_short.html): More
Techniques and Optimizations for Developing Irregular.. - Peter Brezany (1996)
(Correct)
High-Level Management of Communication Schedules.. - Benkner.. (1997)
(Correct)
Distribution Assignment Placement: A New Aggressive Approach.. - Knoop, Mehofer (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC