(Enter summary)
Abstract: We present a unified approach to locality optimization that employs both data and control transformations. Data transformations include changing the array layout in memory. Control transformations involve changing the execution order of programs. We have developed new techniques for compiler optimizations for distributed shared-memory machines, although the same techniques can be used for sequential machines with a memory hierarchy. Our compiler optimizations are based on an algebraic... (Update)
Cited by: More
Concurrency And Computation: Practice And Experience - Concurrency Computat Pract
(Correct)
Is Search Really Necessary to Generate High-Performance .. - Yotov, Li, Ren.. (2005)
(Correct)
Localizing Non-affine Array References - Mitchell, Carter, Ferrante (1999)
(Correct)
Similar documents (at the sentence level):
5.1%: Compiler Optimizations for Cache Locality and Coherence - Li (1994)
(Correct)
Active bibliography (related documents): More All
0.7: Unifying Data and Control Transformations for Distributed Shared .. - Cierniak (1994)
(Correct)
0.4: Compiler Cache Optimizations for Banded Matrix Problems - Li (1995)
(Correct)
0.1: Optimization and Parallelization of a Commodity Trade Model.. - Bergmark, Pottle (1994)
(Correct)
Similar documents based on text: More All
0.4: Uniform Execution Times - Ib Mcs Am
(Correct)
0.3: Practicing JUDO: Java™ under Dynamic Optimizations - Cierniak, Lueh, Stichnoth
(Correct)
0.3: Just-in-Time Optimizations for High-PerformanceJava Programs - Cierniak, Li (1997)
(Correct)
Related documents from co-citation: More All
60: A Data Locality Optimizing Algorithm (context) - Wolf, Lam - 1991
49: Strategies for cache and local memory management by global program transformaion.. (context) - Gannon, Jalby et al. - 1988
42: Data and computation transformations for multiprocessors
- Anderson, Amarasinghe et al. - 1995
BibTeX entry: (Update)
M. Cierniak and W. Li. Unifying data and control transformations for distributed shared memory machines. Technical Report TR-542, Department of Computer Science, University of Rochester, November 1994. http://citeseer.ist.psu.edu/cierniak94unifying.html More
@inproceedings{ cierniak95unifying,
author = "Michal Cierniak and Wei Li",
title = "Unifying Data and Control Transformations for Distributed Shared Memory Machines",
booktitle = "{SIGPLAN} Conference on Programming Language Design and Implementation",
pages = "205-217",
year = "1995",
url = "citeseer.ist.psu.edu/cierniak94unifying.html" }
Citations (may not include all citations):
1575
Computer Architecture: A Quantitative Approach (context) - Hennessy, Patterson - 1990
723
Memory coherence in shared virtual memory systems
- Li, Hudak - 1989
474
A data locality optimizing algorithm (context) - Wolf, Lam - 1991
422
Implementation and performance of Munin
- Carter, Bennett et al. - 1991
237
Global optimizations for parallelism and locality on scalabl..
- Anderson, Lam - 1993
216
Strategies for cache and local memory management by global p.. (context) - Gannon, Jalby et al. - 1988
202
Distributed shared memory: A survey of issues and algorithms (context) - Nitzberg, Lo - 1991
157
Data optimization: Allocation of arrays to reduce communicat.. (context) - Knobe, Lukas et al. - 1990
146
Demonstration of automatic data partitioning techniques for ..
- Gupta, Banerjee - 1992
142
Mint: a front end for efficient simulation of shared-memory ..
- Veenstra, Fowler - 1994
137
Compiler optimizations for improving data locality
- Carr, McKinley et al. - 1994
107
Software Methods for Improvement of Cache Performance on Sup.. (context) - Porterfield - 1989
106
Index domain alignment: Minimizing cost of cross-referencing.. (context) - Li, Chen - 1989
99
A singular loop transformation framework based on non-singul..
- Li, Pingali - 1994
93
Automatic array alignment in data-parallel programs (context) - Chatterjee, Gilbert et al. - 1993
87
Compile-time techniques for data distribution in distributed..
- Ramanujam, Sadayappan - 1991
87
The implementation of a coherent memory abstraction on a NUM.. (context) - Cox, Fowler - 1989
70
Simple but effective techniques for NUMA memory management
- Bolosky, Fitzgerald et al. - 1989
69
An interactive environment for data partitioning and distrib.. (context) - Balasundaram, Fox et al. - 1990
69
Access normalization: Loop restructuring for NUMA compilers
- Li, Pingali - 1993
65
Eliminating false sharing (context) - Eggers, Jeremiassen - 1991
61
Experimental comparison of memory management policies for NU..
- LaRowe, Ellis - 1991
57
The detection and elimination of useless misses in multiproc..
- Dubois, Skeppstedt et al. - 1993
46
Cache coherence in large-scale sharedmemory multiprocessors:..
- Lilja - 1993
45
Compiler techniques for data partitioning of sequentially it.. (context) - Hudak, Abraham - 1990
44
A strategy for array management in local memory
- Eisenbeis, Jalby et al. - 1990
39
False sharing and its effect on shared memory performance (context) - Bolosky, Scott - 1993
38
Solving alignment using elementary linear algebra
- Bau, Kodukula et al. - 1994
19
Compiler cache optimizations for banded matrix problems
- Li - 1995
16
Software caching on cache-coherent multiprocessors
- Bianchini, LeBlanc - 1992
6
Spatial price equilibrium models with discriminatory ad valo.. (context) - Nagurney, Nicholson et al. - 1995
4
On estimating and exchange cache effectiveness (context) - Ferrante, Sarkar et al. - 1991
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.rochester.edu/u/cierniak/research/papers.html): More
Optimizing Java Bytecodes - Cierniak, Li (1997)
(Correct)
VM-Based Shared Memory on Low-Latency.. - Kontothanassis.. (1997)
(Correct)
Validity of Interprocedural Data Remapping - Cierniak, Li (1996)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC