See this document in CiteSeerX!

Bandwidth-Based Performance Tuning and Prediction (1999)  (Make Corrections)  (2 citations)
Chen Ding, Ken Kennedy



  Home/Search   Context   Related

 
View or download:
rice.edu/pub/CRPC...PCTR98742S.ps.gz
rochester.edu/~cding/Docum...pdcs99.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  rice.edu/CRPC/softli...TRs_online (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: As the speed gap widens between CPU and memory, memory hierarchy performance has become the bottleneck for most applications. This is due in part to the difficulty of fully utilizing the deep and complex memory hierarchies found on most modern machines. In the past, various tools on performance tuning and prediction have been developed to improve machine utilization. However, these tools are not effective in practice because they either do not consider memory hierarchy or do so with expensive... (Update)

Context of citations to this paper:   More

.... is much simpler than latency based performance tools and is very effective for tuning and predicting performance for large applications[6]. Many architectural studies examined the memory bandwidth constraint. McCalpin [13] used the STREAM benchmark to demonstrate that...

Cited by:   More
Scientific Computing Research Environments for the.. - Heinkenschloss.. (2001)   (Correct)
Memory Bandwidth Bottleneck and Its Amelioration by a Compiler - Ding, Kennedy (1999)   (Correct)

Similar documents (at the sentence level):
31.2%:   Improving Effective Bandwidth through Compiler Enhancement of.. - Ding (2000)   (Correct)
5.0%:   Memory-Bandwidth Based Performance Tuning and Prediction - Ding, Kennedy (1998)   (Correct)

Active bibliography (related documents):   More   All
0.3:   The Memory Bandwidth Bottleneck and its - Amelioration By Compiler   (Correct)
0.1:   Reducing Parallel Overheads Through Dynamic Serialization - Voss, Eigenmann   (Correct)
0.1:   Basic Sparse Matrix Computations on Massively Parallel.. - Ferng, Wu, Petiton, Saad (1993)   (Correct)

Similar documents based on text:   More   All
0.2:   Improving Effective Bandwidth through Compiler Enhancement of.. - Ding, Kennedy   (Correct)
0.2:   Proceedings of the IASTED International Conference - Parallel And Distributed   (Correct)
0.2:   A Method and a Genetic Algorithm for Deriving.. - El-Fakih, Yamaguchi.. (1999)   (Correct)

Related documents from co-citation:   More   All
3:   Improving effective bandwidth through compiler enhancement of global and dynamic.. - Ding
2:   Improving Memory Hierarchy Performance for Irregular Applications (context) - Mellor-Crummey, Whalley et al. - 1999
2:   Improving cache performance in dynamic applications through data and computation.. - Ding, Kennedy - 1999

BibTeX entry:   (Update)

C. Ding and K. Kennedy. Bandwidth-based performance tuning and prediction. In Proceedings of IASTED International Conference on Parallel Computing and Distributed Systems, November 1999. http://citeseer.ist.psu.edu/ding99bandwidthbased.html   More

@misc{ ding99bandwidthbased,
  author = "C. Ding and K. Kennedy",
  title = "Bandwidth-based performance tuning and prediction",
  text = "C. Ding and K. Kennedy. Bandwidth-based performance tuning and prediction.
    In Proceedings of IASTED International Conference on Parallel Computing
    and Distributed Systems, November 1999.",
  year = "1999",
  url = "citeseer.ist.psu.edu/ding99bandwidthbased.html" }
Citations (may not include all citations):
159   A static performance estimator to guide data partitioning de.. (context) - Balasundaram, Fox et al. - 1991
149   An implementation of interprocedural bounded regular section.. - Havlak, Kennedy - 1991
82   On estimating and enhancing cache effectiveness (context) - Ferrante, Sarkar et al. - 1991
69   Estimating interlock and improving balance for pipelined mac.. - Callahan, Cocke et al. - 1988
57   Improving cache performance in dynamic applications through .. - Ding, Kennedy - 1999
50   Mtool: An Integrated System for Performance Debugging Shared.. (context) - Goldberg, Hennessy - 1993
35   Analytical Performance Prediction on Multicomputers - Clement, Quinn - 1993
23   Sustainable memory bandwidth in current high performance com.. (context) - McCalpin - 1995
17   Memory bandwidth bottleneck and its amelioration by a compil.. - Ding, Kennedy - 1999
13   Inter-array data regrouping - Ding, Kennedy - 1999
11   Analyzing and visualizing performance of memory hierarchies (context) - Callahan, Kennedy et al. - 1990
10   Compiler Support for Software Prefetching - McIntosh - 1997
7   Performance Prediction for Parallel Numerical Algorithms (context) - Gallivan, Jalby et al. - 1991
4   Technical Report ut-cs (context) - Mucci, London et al. - 1998

Documents on the same site (http://www.crpc.rice.edu/CRPC/softlib/TRs_online.html):   More
Dispersion-Induced Chromatographic Waves - Steven Bryant Clint   (Correct)
State Discovery: The Power of Ping - Van Wyck (1999)   (Correct)
Competition between Chemical and Physical Processes in.. - Wang, Wheeler, Bryant (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC