(Enter summary)
Abstract: With the advent of chip-multiprocessors (CMPs), Thread-Level Speculation (TLS) remains a
promising technique for exploiting this highly multithreaded hardware to improve the performance
of an individual program. However, with such speculatively-parallel execution the cache
locality once enjoyed by the original uniprocessor execution is significantly disrupted: for TLS
execution on a four-processor CMP, we find that the data-cache miss rates are nearly four-times
those of the uniprocessor case,... (Update)
Cited by: More
Tolerating Dependences Between Large Speculative.. - Christopher Colohan..
(Correct)
Active bibliography (related documents): More All
1.1: Compiler Optimization of Value Communication for Thread-Level.. - Zhai (2005)
(Correct)
0.6: Enhancing Software Reliability With Speculative Threads - And The Committee
(Correct)
0.5: Performance Implications Of Future-Generation Memory Systems.. - Fertig (2003)
(Correct)
Similar documents based on text: More All
0.2: Speculative Lock Reordering: Optimistic Out-of-Order.. - Rundberg, Stenström
(Correct)
0.1: Glenn M. Fung Moo - Gfung Cs Wisc
(Correct)
0.1: A Low-Overhead Software Approach to Thread-Level Data Dependence .. - Rundberg (2000)
(Correct)
BibTeX entry: (Update)
S. Fung and J. G. Steffan. Improving cache locality for thread-level speculation. In IPDPS 20, April 2006. http://citeseer.ist.psu.edu/fung05improving.html More
@misc{ fung06improving,
author = "S. Fung and J. Steffan",
title = "Improving cache locality for thread-level speculation",
text = "S. Fung and J. G. Steffan. Improving cache locality for thread-level speculation.
In IPDPS 20, April 2006.",
year = "2006",
url = "citeseer.ist.psu.edu/fung05improving.html" }
Citations (may not include all citations):
474
A data locality optimizing algorithm (context) - Wolf, Lam - 1991
344
Design and evaluation of a compiler algorithm for prefetchin..
- Mowry, Lam et al. - 1992
269
Multiscalar processors
- Breach, Sohi et al. - 1995
251
Simultaneous Multithreading: Maximizing On-Chip Parallelism
- Tullsen, Eggers et al. - 1995
249
Tolerating latency through software-controlled prefetching i..
- Mowry, Gupta - 1991
195
Department of Computer Sciences
- Franklin, Architecture - 1993
159
The LRPD Test: Speculative Run-Time Parallelization of Loops..
- Rauchwerger, Padua - 1995
137
Compiler optimizations for improving data locality
- Carr, McKinley et al. - 1994
136
superscalar microprocessor (context) - Yeager - 1996
104
Prefetching using markov predictors
- Joseph, Grunwald - 1997
94
Stride directed prefetching in scalar processors (context) - Fu, Patel et al. - 1992
74
Speculative versioning cache
- Gopal, Vijaykumar et al. - 1998
72
A dynamic multithreading processor
- Akkary, Driscoll - 1998
72
Data speculation support for a chip multiprocessor (context) - Hammond, Willey et al. - 1998
70
Adaptive cache coherency for detecting migratory shared data
- Cox, Fowler - 1993
67
Arb: A hardware mechanism for dynamic reordering of memory r..
- Franklin, Sohi - 1996
45
Dynamic self-invalidation: reducing coherence overhead in sh..
- Lebeck, Wood - 1995
41
Speculative data-driven multithreading (context) - Roth, Sohi - 2001
38
Fixed and adaptive sequential prefetching in shared-memory m.. (context) - Dahlgren, Dubois et al. - 1993
37
Tolerating memory latency through Software-Controlled Pre-Ex..
- Luk - 2001
35
Execution-based prediction using speculative slices
- Zilles, Sohi - 2001
30
Clustered speculative multithreaded processors
- Marcuello, Gonzalez - 1999
30
A chip multiprocessor architecture with speculative multithr..
- Krishnan, Torrellas - 1999
25
Compiling for the Multiscalar Architecture
- Vijaykumar - 1998
20
The stanford hydra cmp (context) - Hammond, Hubbert et al. - 2000
19
Coarse-Grained Speculative Execution in Shared-Memory Multip..
- at, Kazi et al. - 1998
17
An evaluation of fine-grain producer-initiated communication..
- Abdel-Shafi, Hall et al. - 1997
12
ective hardware-based data prefetching for highperformance p.. (context) - Baer, Chen - 1995
11
i and Babak Falsafi. Selective, accurate, and timely self-in.. (context) - La - 2000
9
Evaluating stream bu#ers as a secondary cache replacement (context) - Palacharla, Kessler - 1994
5
A study of three dynamic approaches to handle widely shared .. (context) - Kaxiras, Gjessing et al. - 1998
5
A quantitative framework for automated pre-execution thread .. (context) - Roth, Sohi - 2002
4
A scalable approach to thread-level speculation (context) - Ste, Christopher et al. - 2000
3
Improving value communication for thread-level speculation (context) - Ste, Christopher et al. - 2002
3
Tcp: Tag correlating prefetchers
- Hu, Martonosi et al. - 2003
3
Physical experimentation with prefetching helper threads on .. (context) - Kim, wei et al. - 2004
3
Hardware Support for Thread-Level Speculation (context) - Ste - 2003
2
Guided region prefetching cooperative hardwaresoftware appro..
- Doug, Reinhardt et al. - 2003
1
Quinn Jacobson and Jim Smith (context) - Eric - 1997
1
architecture: Compiler-assisted fine-grained multithreading (context) - Breach, Vijaykumar et al.
1
Compiler optimization of memory-resident value communication.. (context) - Ste, Zhai et al. - 2004
1
Speculative precomputation on chip multiprocessors (context) - ery, Brown et al. - 2001
1
Compiler optimization of scalar value communication between .. (context) - Ste, Zhai et al. - 2002
1
A hardware-base cache pollution filtering mechanism for aggr.. (context) - Zhuang, Lee - 2003
1
Speculative multithreaded processors (context) - Pedro, Jordi - 1998
http://www.spechbench.org
Documents on the same site (http://www.eecg.toronto.edu/~steffan/publications.html): More
Extending Cache Coherence to Support Thread-Level Data.. - Steffan, Colohan, Mowry (1998)
(Correct)
Generating Network Topologies That Obey Power Laws - Palmer, Steffan (2000)
(Correct)
Improving Value Communication for Thread-Level Speculation - Steffan, Colohan, Zhai.. (2002)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC