|
676
|
A data locality optimizing algorithm
– Wolf, Lam
- 1991
|
|
537
|
Cache Memories
– Smith
- 1982
|
|
487
|
The cache performance and optimizations of blocked algorithms
– LAM, ROTHBERG, et al.
- 1991
|
|
361
|
A Loop Transformation Theory and an Algorithm to Maximize Parallelism
– Wolf, Lam
- 1991
|
|
296
|
Advanced compiler optimizations for supercomputers
– Padua, Wolfe
- 1986
|
|
293
|
Automatic Translation of FORTRAN Programs to Vector Form
– Allen, Kennedy
- 1987
|
|
251
|
Strategies for cache and local memory management by global program transformation
– Gannon, Jalby, et al.
- 1988
|
|
218
|
Dependence graphs and compiler optimizations
– Kuck, Kuhn, et al.
- 1981
|
|
200
|
Improving register allocation for subscripted variables
– Callahan, Carr, et al.
- 1990
|
|
172
|
Unimodular transformations of double loops
– Banerjee
- 1990
|
|
169
|
Scanning polyhedra with DO loops
– Ancourt, Irigoin
- 1991
|
|
74
|
Eliminating false data dependence using the omega test
– Pugh, Wonnacott
- 1992
|
|
55
|
Memory-hierarchy management
– Carr
- 1992
|
|
49
|
High-speed multiprocessors and compilation techniques
– Padua, Kuck, et al.
- 1980
|
|
47
|
On the problem of optimizing data transfers for complex memory systems
– Gallivan, Jalby, et al.
- 1988
|
|
33
|
Skewed-associative caches
– Seznec, Bodin
|
|
29
|
A quantitative algorithm for data locality optimization
– Bodin, Eisenbeis, et al.
- 1992
|
|
28
|
The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory
– Gallivan, Jalby, et al.
- 1987
|
|
26
|
Programmation mathe’matique, the’orie et algorithmes, tomes 1 et 2. Dunod
– Minoux
- 1983
|
|
24
|
Unified management of registers and cache using liveness and cache bypass
– Chi, Dietz
- 1989
|
|
23
|
A Case for Two-way Skewed Associative caches
– Seznec
- 1993
|
|
20
|
Program and Data Transformations for Efficient Execution on Distributed Memory Architectures
– O'Boyle
- 1992
|
|
13
|
A Strategy for Array Management
– Eisenbeis, Jalby, et al.
- 1991
|
|
11
|
An Integrated Hardware/Software Solution for Effective Management of Local Storage in High-Performance Systems
– Granston, Veidenbaum
- 1991
|
|
11
|
An algorithm to generate sequential and parallel code with improved data locality
– Wolf, Lain
- 1989
|
|
9
|
Compiler management of program locality
– Porterfield
- 1988
|
|
7
|
Interprocedural analysis and parallelization
– Burke, Cytron
- 1986
|
|
7
|
On Efficiently Characterizing the Solutions of Linear Diophantine Equations and Its Application to Data Dependence Analysis
– Eisenbeis, Temam, et al.
- 1992
|
|
7
|
Jalby "To Copy or Not to Copy : A Compile-Time Technique for Assessing When Data Copying Should be Used to Eliminate Cache Conflicts
– Temam, Granston, et al.
- 1993
|
|
7
|
Skewed associative caches
– Seznec, Bodin
- 1993
|
|
6
|
Sivaramakrishnan "Randomization and Associativity in the Design of Placement-Insensitive Caches
– Schlansker, Shaw, et al.
- 1993
|
|
5
|
Sekhar Sarukaiand Srivinas Narayana, Neelakantan Sundaresan, Daya Atapattu, and Francois Bodin. Sigma ii: a tool kit for building parallelizing compilers and performance analysis systems
– Gannon, Lee, et al.
- 1992
|
|
5
|
Optimisation de la localit'e de donn'ees et du parall'elisme `a grain fin dans les architectures hautes performances", th`ese, universit'e de Rennes 1
– Windheiser
- 1992
|
|
4
|
On the performance enhancement of paging system through program analysis and transformations
– W, Lawrie
- 1981
|
|
4
|
Automatic and Interactive Parallelization
– McKindley
- 1992
|
|
4
|
Evaluating the impact of cache interference on numerical codes
– Temam, Fricker, et al.
- 1993
|
|
3
|
Theory of Linear and Integer Programming", John Willey and sons
– Schrijver
- 1986
|
|
3
|
Optimizing Supercompilers for Supercomputers
– MJ
- 1982
|
|
2
|
Generalized unimodular loop transformations for distributed memory multiprocessors
– D, Basu
- 1991
|
|
1
|
Gotwals Suresh Srinivas, "Sage++: A Class Library for Building Fortran and C++ Restructuring Tools," To appear Object Oriented Numerics
– Gannon, Bodin, et al.
- 1994
|
|
1
|
Evaluating Associativity in CPU
– Hill
- 1989
|
|
1
|
Fechant C., "Implementing a two dimensionnal pore-scale flow model on different parallel machines", To appear
– Bernard, Bodin, et al.
- 1994
|