107 citations found. Retrieving documents...
A. K. Porterfield, "Software methods for improvement of cache performance on supercomputer applications," Ph.D. dissertation, Department of Computer Science, Rice University, Technical Report Rice COMP TR88-93, May 1989. 156

 Home/Search   Document Not in Database   Context   Related Articles   Check  

This paper is cited by the following papers:

First 50 documents  Next 50

Data Locality Optimizations for Multigrid Methods on Structured.. - Weiß   (Correct)
An Automated Method for Software Controlled Cache Prefetching - Zucker, Lee, Flynn (1998)   (7 citations)  (Correct)
Impact of Memory Hierarchy on Program Partitioning and.. - Kaplow, Maniatty.. (1995)   (Correct)
A Selective Hardware/Compiler Approach for Improving.. - Memik, Kandemir..   (Correct)
Architecture And Arithmetic For Multimedia Enhanced Processors - Zucker (1997)   (5 citations)  (Correct)
Comparative Evaluation of Latency Reducing and.. - Gupta, Hennessy.. (1991)   (103 citations)  (Correct)
Design and Evaluation of a Compiler Algorithm for Prefetching - Mowry, Lam, Gupta (1992)   (320 citations)  (Correct)
Improving Memory Hierarchy Performance for Irregular .. - Mellor-Crummey.. (2001)   (6 citations)  (Correct)
Comparing and Combining Read Miss Clustering and Software.. - Pai, Adve (2001)   (1 citation)  (Correct)
Automatic Compiler-Inserted I/O Prefetching for.. - Mowry, Demke, Krieger (1996)   (43 citations)  (Correct)
Hardware and Software Cache Prefetching Techniques for MPEG .. - Zucker, Lee, Flynn (2000)   (1 citation)  (Correct)
A Matrix-Based Approach to Global Locality Optimization - Kandemir, Choudhary.. (1999)   (16 citations)  (Correct)
Fusion of Loops for Parallelism and Locality - Manjikian, Abdelrahman (1995)   (12 citations)  (Correct)
A Comparison of Hardware Prefetching Techniques For.. - Zucker, Flynn, Lee (1995)   (12 citations)  (Correct)
Maximizing Memory Bandwidth for Streamed Computations - McKee (1995)   (7 citations)  (Correct)
An Algebraic Approach to Cache Memory Characterization.. - Kumar Huang Sadayappan (1994)   (4 citations)  (Correct)
Masking Memory Access Latency with a Compiler-Assisted Data.. - VanderWiel (1998)   (Correct)
P³T+: A Performance Estimator for Distributed and Parallel.. - Pozgaj, Fahringer (2000)   (Correct)
A Survey of Data Prefetching Techniques - VanderWiel, Lilja (1996)   (Correct)
Cache Miss Equations: A Compiler Framework for Analyzing.. - Ghosh, Martonosi, Malik (1998)   (57 citations)  (Correct)
Accurate Data Distribution Into Blocks May Boost Cache.. - Truong, Bodin, Seznec (1997)   (1 citation)  (Correct)
Architectural And Software Support For Executing Numerical.. - Anik (1993)   (6 citations)  (Correct)
Improving Effective Bandwidth through Compiler Enhancement of.. - Ding, Kennedy   (10 citations)  (Correct)
Design and Performance of Multithreaded Architectures - Thekkath (1995)   (Correct)
Cache Profiling and the SPEC Benchmarks: A Case Study - Lebeck, Wood (1994)   (103 citations)  (Correct)
Run-time Spatial Locality Detection and Optimization - Johnson, Merten, Hwu (1997)   (16 citations)  (Correct)
Run-time Cache Hierarchy Management via Reference Analysis - Johnson, Hwu (1996)   (3 citations)  (Correct)
Branch Prediction, Instruction-Window Size, and.. - Skadron, Ahuja.. (1999)   (4 citations)  (Correct)
IPU/LTB: A Method for Reducing Effective Memory Latency - Jr., Appelbe, Das (1997)   (Correct)
Blocking Linear Algebra Codes For Memory Hierarchies - Carr, Kennedy (1989)   (23 citations)  (Correct)
Efficient Polynomial-Time Nested Loop Fusion with Full.. - Sha, O'Neil, Passos (1999)   (Correct)
Improving Register Allocation for Subscripted Variables - Callahan, Carr, Kennedy (1990)   (120 citations)  (Correct)
A Compiler-Blockable Algorithm for QR Decomposition - Carr, Lehoucq (1995)   (6 citations)  (Correct)
Impact of Memory Hierarchy on Program Partitioning and.. - Wesley Kaplow William (1995)   (Correct)
Characterizing and Removing Branch Mispredictions - Skadron (1999)   (Correct)
Cache Miss Equations: A Compiler Framework for Analyzing.. - Ghosh, Martonosi, Malik (1998)   (57 citations)  (Correct)
Maintaining Cache Coherence through Compiler-Directed Data.. - Lim, Yew (1998)   (Correct)
Data Prefetch Mechanisms - VanderWiel, Lilja   (19 citations)  (Correct)
Fusion of Loops for Parallelism and Locality - Naraig Manjikian (1995)   (12 citations)  (Correct)
Performance Characterization of Optimizing Compilers - Saavedra, Smith (1992)   (6 citations)  (Correct)
Memory Latency Rediction via Data Prefetching and Data Forwarding .. - Poulsen (1994)   (Correct)
P³T+: A Performance Estimator for Distributed and.. - Pozgaj, Fahringer (2000)   (Correct)
Improving Effective Bandwidth through Compiler Enhancement of.. - Ding, Kennedy   (10 citations)  (Correct)
Estimating Cache Misses and Locality Using Stack Distances - Cascaval, Padua (2003)   (1 citation)  (Correct)
Improving Effective Bandwidth through Compiler Enhancement of.. - Ding (2000)   (10 citations)  (Correct)
Comparing and Combining Read Miss Clustering and Software.. - Pai, Adve (2001)   (1 citation)  (Correct)
Efficient and Accurate Analytical Modeling of Whole-Program Data .. - Xue, Vera (2003)   (Correct)
Dynamic Access Ordering for Symmetric Shared-Memory Multiprocessors - McKee (1994)   (Correct)
Nonlinear Array Layouts for Hierarchical Memory Systems - Chatterjee, Jain.. (1999)   (50 citations)  (Correct)
Push vs. Pull: Data Movement for Linked Data Structures - Yang, Lebeck (2000)   (2 citations)  (Correct)

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC