See this document in CiteSeerX!

Improving Cache Locality for Thread-Level Speculation Systems (2005)  (Make Corrections)  (1 citation)
Stanley Lap Chiu Fung



  Home/Search   Context   Related

 
View or download:
toronto.edu/~steff...ng_masc_thesis.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  toronto.edu/~steff...publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: With the advent of chip-multiprocessors (CMPs), Thread-Level Speculation (TLS) remains a promising technique for exploiting this highly multithreaded hardware to improve the performance of an individual program. However, with such speculatively-parallel execution the cache locality once enjoyed by the original uniprocessor execution is significantly disrupted: for TLS execution on a four-processor CMP, we find that the data-cache miss rates are nearly four-times those of the uniprocessor case,... (Update)

Cited by:   More
Tolerating Dependences Between Large Speculative.. - Christopher Colohan..   (Correct)

Active bibliography (related documents):   More   All
1.1:   Compiler Optimization of Value Communication for Thread-Level.. - Zhai (2005)   (Correct)
0.6:   Enhancing Software Reliability With Speculative Threads - And The Committee   (Correct)
0.5:   Performance Implications Of Future-Generation Memory Systems.. - Fertig (2003)   (Correct)

Similar documents based on text:   More   All
0.2:   Speculative Lock Reordering: Optimistic Out-of-Order.. - Rundberg, Stenström   (Correct)
0.1:   Glenn M. Fung Moo - Gfung Cs Wisc   (Correct)
0.1:   A Low-Overhead Software Approach to Thread-Level Data Dependence .. - Rundberg (2000)   (Correct)

BibTeX entry:   (Update)

S. Fung and J. G. Steffan. Improving cache locality for thread-level speculation. In IPDPS 20, April 2006. http://citeseer.ist.psu.edu/fung05improving.html   More

@misc{ fung06improving,
  author = "S. Fung and J. Steffan",
  title = "Improving cache locality for thread-level speculation",
  text = "S. Fung and J. G. Steffan. Improving cache locality for thread-level speculation.
    In IPDPS 20, April 2006.",
  year = "2006",
  url = "citeseer.ist.psu.edu/fung05improving.html" }
Citations (may not include all citations):
474   A data locality optimizing algorithm (context) - Wolf, Lam - 1991
344   Design and evaluation of a compiler algorithm for prefetchin.. - Mowry, Lam et al. - 1992
269   Multiscalar processors - Breach, Sohi et al. - 1995
251   Simultaneous Multithreading: Maximizing On-Chip Parallelism - Tullsen, Eggers et al. - 1995
249   Tolerating latency through software-controlled prefetching i.. - Mowry, Gupta - 1991
195   Department of Computer Sciences - Franklin, Architecture - 1993
159   The LRPD Test: Speculative Run-Time Parallelization of Loops.. - Rauchwerger, Padua - 1995
137   Compiler optimizations for improving data locality - Carr, McKinley et al. - 1994
136   superscalar microprocessor (context) - Yeager - 1996
104   Prefetching using markov predictors - Joseph, Grunwald - 1997
94   Stride directed prefetching in scalar processors (context) - Fu, Patel et al. - 1992
74   Speculative versioning cache - Gopal, Vijaykumar et al. - 1998
72   A dynamic multithreading processor - Akkary, Driscoll - 1998
72   Data speculation support for a chip multiprocessor (context) - Hammond, Willey et al. - 1998
70   Adaptive cache coherency for detecting migratory shared data - Cox, Fowler - 1993
67   Arb: A hardware mechanism for dynamic reordering of memory r.. - Franklin, Sohi - 1996
45   Dynamic self-invalidation: reducing coherence overhead in sh.. - Lebeck, Wood - 1995
41   Speculative data-driven multithreading (context) - Roth, Sohi - 2001
38   Fixed and adaptive sequential prefetching in shared-memory m.. (context) - Dahlgren, Dubois et al. - 1993
37   Tolerating memory latency through Software-Controlled Pre-Ex.. - Luk - 2001
35   Execution-based prediction using speculative slices - Zilles, Sohi - 2001
30   Clustered speculative multithreaded processors - Marcuello, Gonzalez - 1999
30   A chip multiprocessor architecture with speculative multithr.. - Krishnan, Torrellas - 1999
25   Compiling for the Multiscalar Architecture - Vijaykumar - 1998
20   The stanford hydra cmp (context) - Hammond, Hubbert et al. - 2000
19   Coarse-Grained Speculative Execution in Shared-Memory Multip.. - at, Kazi et al. - 1998
17   An evaluation of fine-grain producer-initiated communication.. - Abdel-Shafi, Hall et al. - 1997
12   ective hardware-based data prefetching for highperformance p.. (context) - Baer, Chen - 1995
11   i and Babak Falsafi. Selective, accurate, and timely self-in.. (context) - La - 2000
9   Evaluating stream bu#ers as a secondary cache replacement (context) - Palacharla, Kessler - 1994
5   A study of three dynamic approaches to handle widely shared .. (context) - Kaxiras, Gjessing et al. - 1998
5   A quantitative framework for automated pre-execution thread .. (context) - Roth, Sohi - 2002
4   A scalable approach to thread-level speculation (context) - Ste, Christopher et al. - 2000
3   Improving value communication for thread-level speculation (context) - Ste, Christopher et al. - 2002
3   Tcp: Tag correlating prefetchers - Hu, Martonosi et al. - 2003
3   Physical experimentation with prefetching helper threads on .. (context) - Kim, wei et al. - 2004
3   Hardware Support for Thread-Level Speculation (context) - Ste - 2003
2   Guided region prefetching cooperative hardwaresoftware appro.. - Doug, Reinhardt et al. - 2003
1   Quinn Jacobson and Jim Smith (context) - Eric - 1997
1   architecture: Compiler-assisted fine-grained multithreading (context) - Breach, Vijaykumar et al.
1   Compiler optimization of memory-resident value communication.. (context) - Ste, Zhai et al. - 2004
1   Speculative precomputation on chip multiprocessors (context) - ery, Brown et al. - 2001
1   Compiler optimization of scalar value communication between .. (context) - Ste, Zhai et al. - 2002
1   A hardware-base cache pollution filtering mechanism for aggr.. (context) - Zhuang, Lee - 2003
1   Speculative multithreaded processors (context) - Pedro, Jordi - 1998
http://www.spechbench.org

Documents on the same site (http://www.eecg.toronto.edu/~steffan/publications.html):   More
Extending Cache Coherence to Support Thread-Level Data.. - Steffan, Colohan, Mowry (1998)   (Correct)
Generating Network Topologies That Obey Power Laws - Palmer, Steffan (2000)   (Correct)
Improving Value Communication for Thread-Level Speculation - Steffan, Colohan, Zhai.. (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC