(Enter summary)
Abstract: This paper focus on cost/effectiveness of these two
prefetching techniques in the field scientific of computing.
Typically theses codes exhibit a high utilization of vectors
and data streams with large data set. At first glance they
are ideally targeted for prefetching, but some performance
variations are induced by cache hierarchy, spatial locality
or bandwidth limitation (Update)
Context of citations to this paper: More
.... schemes bring in a non negligible instruction execution overhead [Chen and Baer, 1994] Benchmarking results on commercial systems [Acquaviva, 1999] seem to indicate that the overheads of software prefetching may actually sometimes hinder rather then aid performance, and, while...
Cited by: More
A Survey of prefetching techniques - Oren (2000)
(Correct)
Active bibliography (related documents): More All
0.5: Performance Evaluation of FFT Algorithms Using.. - Auer, Franchetti.. (1998)
(Correct)
0.3: Comparing the Performance of MPICH with Cray's MPI and - With Sgi's Mpi
(Correct)
0.3: Comparing the Performance of MPICH with Cray's MPI and with.. - Luecke, Ju, Kraeva
(Correct)
Similar documents based on text: More All
0.3: Integrating Fine-Grained Message Passing In Cache Coherent.. - Poulsen, Yew (1996)
(Correct)
0.3: Second-level Cache Organization for Data Prefetching - Kim, Veidenbaum
(Correct)
0.3: Maintaining Cache Coherence through Compiler-Directed Data.. - Lim, Yew (1998)
(Correct)
BibTeX entry: (Update)
Acquaviva, J. (1999). Data prefetching efficiency on two commercial systems. In Proceedings of the fifth European SGI/Cray MPP Workshop. http://citeseer.ist.psu.edu/acquaviva99data.html More
@misc{ acquaviva99data,
author = "J. Acquaviva",
title = "Data prefetching efficiency on two commercial systems",
text = "Acquaviva, J. (1999). Data prefetching efficiency on two commercial systems.
In Proceedings of the fifth European SGI/Cray MPP Workshop.",
year = "1999",
url = "citeseer.ist.psu.edu/acquaviva99data.html" }
Citations (may not include all citations):
249
Tolerating latency through software-controlled data prefetch..
- Mowry - 1994
222
The SGI Origin: A ccNUMA highly scalable server (context) - Laudon, Lenoski - 1997
136
superscalar microprocessor (context) - Yeager - 1996
106
Microprocessor User's Manual (context) - Technologies, MIPS - 1996
98
Evaluating stream buffers as secondary cache replacement (context) - Palacharla, Kessler - 1994
59
Performance Analysis Using the MIPS R10000 Performance Count..
- Zagha, Larson et al. - 1996
41
and Allan Porterfield (context) - Callahan, Kennedy - 1991
23
Sustainable memory bandwidth in current high performance com.. (context) - McCalpin - 1995
10
AlphaServer 4100 performance characterization
- Cvetanovic, Donaldson - 1996
8
Benchmarker's guide to single processor optimization Cray TE.. (context) - Anderson, Brooks et al. - 1997
8
CPU cache prefetching: timing evaluation of hardware impleme.. (context) - Tse, Smith - 1998
5
Alpha AXP Architecture Handbook (context) - Corporation - 1994
4
Origin servers (context) - Computer - 1997
3
Cray TE programming with coherent memory stream (context) - Research, programming et al. - 1996
2
Definition of MIPS R10000 Performance Counter (context) - technologies - 1997
1
MIPS IV instruction set (context) - Inc - 1995
1
Cray TE Optimization (context) - Research, Optimization - 1997
1
Conf'erence Simulation et T'eraflops (context) - Normand, Cray - 1995
1
Superscalar intruction execution in the 21164 Alpha micropro.. (context) - Edmondson, Rubinfeld et al. - 1995
1
Tango: a hardware-based prefetching technique for superscala.. (context) - Pinter, Yoaz - 1996
1
Livermore loops single-node code optimization for the Cray T.. (context) - Kessler - 1994
1
the use and performance of the E-registers in the Cray-T3E. .. (context) - Kessler - 1994
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC