(Enter summary)
Abstract: A swap instruction, which exchanges a value in memory with a value in a register, is available on many architectures. The primary application of a swap instruction has been for process synchronization. This thesis shows that a swap instruction can often be used to coalesce loads and stores in a variety of applications. The thesis describes the analysis necessary to detect opportunities to exploit a swap and the transformation required to coalesce a load and a store into a swap instruction. The... (Update)
Context of citations to this paper: More
.... transformation to coalesce a load and a store into a swap instruction was accomplished using an algorithm described in detail elsewhere [4]. The algorithm finds a load followed by a store to the same address and coalesces the two memory references together into a single swap...
Cited by: More
Using a Swap Instruction to Coalesce Loads and Stores - Qasem, Whalley, Yuan, van..
(Correct)
Active bibliography (related documents): More All
0.1: Static Cache Simulation and its Applications - Mueller (1994)
(Correct)
0.1: Data-Specific Optimizations - Jinturkar (1996)
(Correct)
0.1: An Aggressive Approach to Loop Unrolling - Davidson, Jinturkar (1995)
(Correct)
Similar documents based on text: More All
0.8: On-Line Intrusion Detection Using Sequences of System Calls - Snyder (2001)
(Correct)
0.8: Avoiding Unconditional Jumps by Code Replication - Mueller (1992)
(Correct)
0.7: Effectively Exploiting Indirect Jumps - Uh (1997)
(Correct)
BibTeX entry: (Update)
A. Qasem, D. Whalley, X. Yuan, R. van Engelen, "Using a Swap Instruction to Coalesce Loads and Stores," Technical Report TR-010501, Computer Science Dept., Florida State University. 5 http://citeseer.ist.psu.edu/article/qasem01using.html More
@article{ qasem01using,
author = "Apan Qasem and David Whalley and Xin Yuan and Robert van Engelen",
title = "Using a Swap Instruction to Coalesce Loads and Stores",
journal = "Lecture Notes in Computer Science",
volume = "2150",
pages = "235--240",
year = "2001",
url = "citeseer.ist.psu.edu/article/qasem01using.html" }
Citations (may not include all citations):
1575
Computer Architecture: A Quantitative Approach (context) - Hennessy, Patterson - 1996 ACM
983
The Art of Scientific Computing (context) - Press, Teukolsky et al. - 1996
376
The Cache Performance and Optimizations of Blocked Algorithm.. (context) - Lam, Rothberg et al. - 1991 ACM DBLP
103
A Case for Direct--Mapped Caches (context) - Hill - 1988
79
A Portable Global Optimizer and Linker (context) - Benitez, Davidson - 1988 ACM DBLP
23
Memory Access Coalescing: A Technique for Eliminating Redund..
- Davidson, Jinturkar - 1994 ACM DBLP
21
A Design Environment for Addressing Architecture and Compile..
- Davidson, Whalley - 1991
11
Ease: An Environment for Architecture Study and Experimentat.. (context) - Davidson, Whalley - 1990 ACM DBLP
11
Digital Image Processing Algorithms and Applications (context) - Pitas - 2000 ACM
2
Simple Algorithms for Traversing a Tree without a Stack (context) - Dwyer - 1973
Documents on the same site (http://www.cs.fsu.edu/~whalley/research.html): More
A Design Environment for Addressing Architecture and.. - Davidson, Whalley (1991)
(Correct)
Quick Compilers Using Peephole Optimization - Davidson, Whalley (1989)
(Correct)
Fast Context Switches: Compiler and Architectural Support .. - Snyder, Whalley, Baker
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC