(Enter summary)
Abstract: this paper focuses on
optimization techniques for enhancing cache performance (Update)
Cited by: More
Cache Performance Optimizations for Parallel Lattice.. - Wilke, Pohl..
(Correct)
Active bibliography (related documents): More All
1.3: Data Locality Optimizations for Multigrid Methods on Structured.. - Weiß
(Correct)
0.5: Unknown - Apparently There Is
(Correct)
0.4: Cache-aware Multigrid Methods for Solving Poisson's.. - Weiss, Kowarschik.. (1999)
(Correct)
Similar documents based on text: More All
0.4: Fixed and Adaptive Cache Aware Algorithms for.. - Douglas, Hu, Karl.. (2000)
(Correct)
0.4: Cache Optimization for Structured and Unstructured.. - Douglas, Hu.. (1999)
(Correct)
0.3: Memory Characteristics of Iterative Methods - Weiss, Karl, Kowarschik, Rüde (1999)
(Correct)
BibTeX entry: (Update)
M. Kowarschik and C. Wei. An Overview of Cache Optimization Techniques and Cache{Aware Numerical Algorithms. In Algorithms for Memory Hierarchies, volume 2625 of LNCS. Springer, 2003. http://citeseer.ist.psu.edu/kowarschik03overview.html More
@misc{ kowarschik03overview,
author = "M. Kowarschik and C. Wei",
title = "An Overview of Cache Optimization Techniques and Cache{Aware Numerical
Algorithms",
text = "M. Kowarschik and C. Wei. An Overview of Cache Optimization Techniques
and Cache{Aware Numerical Algorithms. In Algorithms for Memory Hierarchies,
volume 2625 of LNCS. Springer, 2003.",
year = "2003",
url = "citeseer.ist.psu.edu/kowarschik03overview.html" }
Citations (may not include all citations):
2441
John Hopkins University Press (context) - Golub, Van Loan - 1998
1575
Computer Architecture: A Quantitative Approach (context) - Hennessy, Patterson - 1996
532
LAPACK Users' Guide (context) - Anderson, Bai et al. - 1999
474
A Data Locality Optimizing Algorithm (context) - Wolf, Lam - 1991
408
Multigrid Methods and Applications (context) - Hackbusch - 1985
387
A Set of Level 3 Basic Linear Algebra Subprograms (context) - Dongarra, Croz et al. - 1990
376
The Cache Performance and Optimizations of Blocked Algorithm.. (context) - Lam, Rothberg et al. - 1991
372
Matrix Iterative Analysis (context) - Varga - 1962
297
A Multigrid Tutorial (context) - Briggs, Henson et al. - 2000
296
Free Software Foundation (context) - Fenlason, Stallman - 1998
249
Tolerating Latency Through Software{Controlled Data Prefetch..
- Mowry - 1994
234
Accuracy and Stability of Numerical Algorithms (context) - Higham - 2002
234
Cache Memories (context) - Smith - 1982
230
Compiler Transformations for High{ Performance Computing
- Bacon, Graham et al. - 1994
157
Automatically Tuned Linear Algebra Software
- Whaley, Dongarra - 1998
124
FFTW: An Adaptive Software Architecture for the FFT
- Frigo, Johnson - 1998
123
Optimizing Matrix Multiply using PHiPAC: A Portable
- Bilmes, Asanovic et al. - 1997
109
Advanced Compiler Design & Implementation (context) - Muchnick - 1997
108
Iterative Solution of Large Sparse Systems of Equations (context) - Hackbusch - 1993
106
Unifying Data and Control Transformations for Distributed Sh..
- Cierniak, Li - 1995
87
Complete Computer System Simulation: The SimOS Approach
- Rosenblum, Herrod et al. - 1995
82
To Copy or Not to Copy: A Compile{ Time Technique for Assess..
- Temam, Granston et al. - 1993
81
Reducing False Sharing on Shared Memory Multiprocessors thro..
- Jeremiassen, Eggers - 1995
77
Cache Miss Equations: An Analytical Representation of Cache ..
- Ghosh, Martonosi et al. - 1997
48
New Tiling Techniques to Improve Cache Temporal Locality
- Song, Li - 1999
48
Optimizing Compilers for Modern Architectures (context) - Allen, Kennedy - 2001
43
ATOM: A Flexible Interface for Building High Performance Pro.. (context) - Eustace, Srivastava - 1995
38
Locality of Reference in LU Decomposition with Partial Pivot..
- Toledo - 1997
34
The Cache Memory Book (context) - Handy - 1998
32
Cache Optimization for Structured and Unstructured Grid Mult..
- Douglas, Hu et al. - 2000
29
Shared Data Placement Optimizations to Reduce Multiprocessor.. (context) - Torrellas, Lam et al. - 1990
28
Caching in With Multigrid Algorithms: Problems in Two Dimens..
- Douglas - 1996
28
Analytical Modeling of Set{Associative Cache Behavior
- Harper, Kerbyson et al. - 1999
28
A Portable Programming Interface for Performance Evaluation ..
- Browne, Dongarra et al. - 2000
23
the Complexity of Loop Fusion
- Darte - 1999
14
Data Transformations for Eliminating Con ict Misses (context) - Rivera, Tseng - 1998
13
Continuous Pro ling: Where Have All the Cycles Gone (context) - Anderson, Berc et al. - 1997
12
ective Hardware Based Data Prefetching for High{ Performance.. (context) - Chen, Baer - 1995
11
Memory Characteristics of Iterative Methods
- Wei, Karl et al. - 1999
11
On Estimating and Enhancing Cache Eectiveness (context) - Ferrante, Sarkar et al. - 1991
8
Investigating Optimal Local Memory Performance
- Temam - 1998
6
Block Algorithms for Sparse Matrix Computations on High Perf.. (context) - Navarro, Garcia-Diego et al. - 1996
6
High Performance Parallel Implicit CFD
- Gropp, Kaushik et al. - 2001
5
Performance Optimization of Numerically Intensive Codes (context) - Goedecker, Hoisie - 2001
5
Temporal Locality Optimizations for Stencil Operations withi.. (context) - Bassetti, Davis et al. - 1998
5
ectiveness of Dynamic Caching for General{Purpose Microproce.. (context) - Burger, Goodman et al. - 1995
3
Tiling Optimizations for 3D Scienti c Computations (context) - Rivera, Tseng - 2000
3
The Performance Counter Library: A Common Interface to Acces.. (context) - Berrendorf, Mohr - 2000
3
Fully Adaptive Multigrid Methods
- ude - 1993
3
Ecient Simulation of Caches under Optimal Replacement with A.. (context) - Sugumar, Abraham - 1993
2
Data Prefetching Mechanisms (context) - Vanderwiel, Lilja - 2000
2
Ecient Memory Programming (context) - Loshin - 1998
1
Adaptive Multilevel Iteration (context) - otzbeyer, ude - 1997
1
Oriented Framework for Solving Partial Dierential Equations.. (context) - Brown, Henshaw et al. - 1998
1
Data Layout Optimizations for Variable Coecient Multigrid (context) - Kowarschik, Wei et al. - 2002
1
Multigrid on Hierarchical Memory Architectures (context) - Kowarschik, ude et al. - 2002
1
Perfomance Compilers for Parallel Computing (context) - Wolfe - 1996
1
Ecient Multigrid Algorithms (context) - Sellappa, Chatterjee - 2001
1
gridlib: Flexible and Ecient Grid Management for Simulation .. (context) - ulsemann, Kipfer et al. - 2002
1
Nested Loop Nests (context) - Ahmed, Mateev et al. - 2000
1
Data Locality Optimizations for Multigrid Methods on Structu.. (context) - Wei - 2001
1
the Realistic Performance of Linear Algebra Components in It..
- Altieri, Becker et al. - 1998
1
A Recursive Formulation of the Inversion of Symmetric Positi.. (context) - Andersen, Gunnels et al. - 2002
1
Centric Multi{Level Blocking (context) - Kodukula, Ahmed et al. - 1997
Documents on the same site (http://www10.informatik.uni-erlangen.de/Research/Projects/DiME/publications.html): More
Data Locality Optimizations for Multigrid Methods on Structured.. - Weiß
(Correct)
Cache Performance Optimizations for Parallel Lattice.. - Wilke, Pohl..
(Correct)
DiMEPACK - A Cache-Optimized Multigrid Library - Kowarschik, Weiß (2001)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC