(Enter summary)
Abstract: As the speed gap widens between CPU and memory, memory
hierarchy performance has become the bottleneck for
most applications. This is due in part to the difficulty of
fully utilizing the deep and complex memory hierarchies
found on most modern machines. In the past, various tools
on performance tuning and prediction have been developed
to improve machine utilization. However, these tools are
not effective in practice because they either do not consider
memory hierarchy or do so with expensive... (Update)
Context of citations to this paper: More
.... is much simpler than latency based performance tools and is very effective for tuning and predicting performance for large applications[6]. Many architectural studies examined the memory bandwidth constraint. McCalpin [13] used the STREAM benchmark to demonstrate that...
Cited by: More
Scientific Computing Research Environments for the.. - Heinkenschloss.. (2001)
(Correct)
Memory Bandwidth Bottleneck and Its Amelioration by a Compiler - Ding, Kennedy (1999)
(Correct)
Similar documents (at the sentence level):
31.2%: Improving Effective Bandwidth through Compiler Enhancement of.. - Ding (2000)
(Correct)
5.0%: Memory-Bandwidth Based Performance Tuning and Prediction - Ding, Kennedy (1998)
(Correct)
Active bibliography (related documents): More All
0.3: The Memory Bandwidth Bottleneck and its - Amelioration By Compiler
(Correct)
0.1: Reducing Parallel Overheads Through Dynamic Serialization - Voss, Eigenmann
(Correct)
0.1: Basic Sparse Matrix Computations on Massively Parallel.. - Ferng, Wu, Petiton, Saad (1993)
(Correct)
Similar documents based on text: More All
0.2: Improving Effective Bandwidth through Compiler Enhancement of.. - Ding, Kennedy
(Correct)
0.2: Proceedings of the IASTED International Conference - Parallel And Distributed
(Correct)
0.2: A Method and a Genetic Algorithm for Deriving.. - El-Fakih, Yamaguchi.. (1999)
(Correct)
Related documents from co-citation: More All
3: Improving effective bandwidth through compiler enhancement of global and dynamic..
- Ding
2: Improving Memory Hierarchy Performance for Irregular Applications (context) - Mellor-Crummey, Whalley et al. - 1999
2: Improving cache performance in dynamic applications through data and computation..
- Ding, Kennedy - 1999
BibTeX entry: (Update)
C. Ding and K. Kennedy. Bandwidth-based performance tuning and prediction. In Proceedings of IASTED International Conference on Parallel Computing and Distributed Systems, November 1999. http://citeseer.ist.psu.edu/ding99bandwidthbased.html More
@misc{ ding99bandwidthbased,
author = "C. Ding and K. Kennedy",
title = "Bandwidth-based performance tuning and prediction",
text = "C. Ding and K. Kennedy. Bandwidth-based performance tuning and prediction.
In Proceedings of IASTED International Conference on Parallel Computing
and Distributed Systems, November 1999.",
year = "1999",
url = "citeseer.ist.psu.edu/ding99bandwidthbased.html" }
Citations (may not include all citations):
159
A static performance estimator to guide data partitioning de.. (context) - Balasundaram, Fox et al. - 1991
149
An implementation of interprocedural bounded regular section..
- Havlak, Kennedy - 1991
82
On estimating and enhancing cache effectiveness (context) - Ferrante, Sarkar et al. - 1991
69
Estimating interlock and improving balance for pipelined mac..
- Callahan, Cocke et al. - 1988
57
Improving cache performance in dynamic applications through ..
- Ding, Kennedy - 1999
50
Mtool: An Integrated System for Performance Debugging Shared.. (context) - Goldberg, Hennessy - 1993
35
Analytical Performance Prediction on Multicomputers
- Clement, Quinn - 1993
23
Sustainable memory bandwidth in current high performance com.. (context) - McCalpin - 1995
17
Memory bandwidth bottleneck and its amelioration by a compil..
- Ding, Kennedy - 1999
13
Inter-array data regrouping
- Ding, Kennedy - 1999
11
Analyzing and visualizing performance of memory hierarchies (context) - Callahan, Kennedy et al. - 1990
10
Compiler Support for Software Prefetching
- McIntosh - 1997
7
Performance Prediction for Parallel Numerical Algorithms (context) - Gallivan, Jalby et al. - 1991
4
Technical Report ut-cs (context) - Mucci, London et al. - 1998
Documents on the same site (http://www.crpc.rice.edu/CRPC/softlib/TRs_online.html): More
Dispersion-Induced Chromatographic Waves - Steven Bryant Clint
(Correct)
State Discovery: The Power of Ping - Van Wyck (1999)
(Correct)
Competition between Chemical and Physical Processes in.. - Wang, Wheeler, Bryant (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC