(Enter summary)
Abstract: This paper presents a performance modeling methodology
that is faster than traditional cycle-accurate simulation,
more sophisticated than performance estimation based on
system peak-performance metrics, and is shown to be
effective on a class of High Performance Computing
benchmarks. The method yields insight into the factors
that affect performance on single-processor and parallel
computers. (Update)
Context of citations to this paper: More
...Aggregate counts are frequently used in performance modeling to parame terize the models. For examples, the methodology described in [12] generates a machine signature which is a characterization of the rate at which a machine carries out fundamental operations...
Cited by: More
Performance Modeling and Analysis of Cache Blocking.. - Nishtala, Vuduc.. (2004)
(Correct)
Cross-Architecture Performance Predictions for Scientific.. - Marin, Mellor-Crummey (2004)
(Correct)
When Cache Blocking of Sparse Matrix Vector Multiply.. - Nishtala, Vuduc..
(Correct)
Active bibliography (related documents): More All
0.4: Scaling the Unscalable: A Case Study on the AlphaServer SC - Worley (2002)
(Correct)
0.3: A Comparative Study of Online Scheduling Algorithms .. - Arndt, Freisleben, .. (1998)
(Correct)
0.2: Communication Characteristics of Large-Scale Scientific.. - Vetter, Mueller (2002)
(Correct)
Similar documents based on text: More All
0.8: Security Models for NARA Electronic Record Management - Schroeder, Perrine (2001)
(Correct)
0.7: Evaluation of a Multithreaded Architecture for.. - Pfeiffer, Carter.. (1999)
(Correct)
0.7: Npaci Contacts - Leadership Team Thrust (1999)
(Correct)
Related documents from co-citation: More All
3: Modeling and improving locality for irregular problems: sparse matrix-vector pro..
- Heras, Perez et al. - 1999
3: Memory hierarchy performance prediction for sparse blocked algorithms (context) - Fraguela, Doallo et al. - 1999
3: Optimizing the Performance of Sparse Matrix-Vector Multiply (context) - Im - 2000
BibTeX entry: (Update)
A. Snavely, L. Carrington, and N. Wolter, "Modeling Application Performance by Convolving Machine Signatures with Application Profiles," Proc. IEEE Workshop on Workload Characterization, 2001. http://citeseer.ist.psu.edu/snavely01modeling.html More
@misc{ snavely01modeling,
author = "A. Snavely and L. Carrington and N. Wolter",
title = "Modeling Application Performance by Convolving Machine Signatures with
Application Profiles",
text = "A. Snavely, L. Carrington, and N. Wolter, Modeling Application Performance
by Convolving Machine Signatures with Application Profiles, Proc. IEEE Workshop
on Workload Characterization, 2001.",
year = "2001",
url = "citeseer.ist.psu.edu/snavely01modeling.html" }
Citations (may not include all citations):
58
Caches Miss Equations: A Compiler Framework for Analyzing an..
- Ghosh, Martonosi et al. - 1999
52
Converting Thread-Level Parallelism to Instruction-Level Par..
- Lo, Egger et al. - 1997
46
An API for Runtime Code Patching
- Buck, Hollingsworth - 2000
25
SvPablo: A Multi-Language Performance Analysis System
- DeRose, Zhang et al. - 1998
14
Prediction and Adaptation in Active harmony
- Hollingsworth, Keleher - 1998
14
Integrated Compilation and Scalability Analysis for Parallel..
- Mendes, Reed - 1998
12
Performance Assertion Checking
- Perl, Weihl - 1993
10
Toward Realistic Performance Bounds for Implicit CFD Codes
- Gropp, Kaushik et al. - 1999
9
FLASH vs. (Simulated) FLASH: Closing the Simulation Loop
- Gibson, Kunz et al. - 2000
9
Caches as Filters: A Framework for the Analysis of Caching S..
- Weikle, McKee et al. - 2000
6
Performance Analysis of Distributed Applications Using Autom..
- Vetter - 2000
6
Performance of Parallel Computers for Spectral Atmospheric M..
- Foster, Toonen et al.
4
Performance Evaluation of the IBM SP and the Compaq AlphaSer.. (context) - Worley
3
Conventional Benchmarks as a Sample of the Performance Spect.. (context) - Gustafson, Todi - 1998
1
Exact Analysis of Cache Misses in Nested Loops (context) - Chatterjee, Parker et al.
1
Impact of Communication Protocol on Performance (context) - Worley
1
Resource-Aware Meta-Computing (context) - Hollingsworth, Keleher et al. - 2000
1
Pablo: A Multilanguage, Architecture-Independent Performance.. (context) - DeRose, Reed - 1999
http://www.cepba.upc.es/tools_i.html
http://www.cs.virginia.edu/stream/
http://science.nas.nasa.gov/Software/NPB
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.sdsc.edu/~allans/papers.html): More
Performance and Programming Experience on the Tera MTA - Larry Carter (1999)
(Correct)
Evaluation of a Multithreaded Architecture for.. - Pfeiffer, Carter.. (1999)
(Correct)
Explorations in Symbiosis on two Multithreaded.. - Snavely, Mitchell.. (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC