(Enter summary)
Abstract: Speculative parallel execution may well be the best way of
speeding up codes whose dependences can not be analyzed by
the compiler. Supporting speculative parallelization in hardware
in Distributed Shared-Memory (DSM) multiprocessors
is challenging because of the long-latency of memory accesses
and the physical distribution of the memory system. Our approach
is to add extensions to the cache coherence protocol
hardware of the machine to automatically detect dependence
violations at run time.... (Update)
Context of citations to this paper: More
...to be exploited. For example, today s compilers are still limited in exploiting parallelism in most C programs. Speculative architectures [4 8] can potentially overcome these limitations. This paper makes two contributions towards our goal of developing advanced compiler...
Cited by: More
Compiling for Speculative Architectures - Kim, Eigenmann
(Correct)
Similar documents (at the sentence level):
78.1%: Hardware for Speculative Parallelization of.. - Zhang, Rauchwerger.. (1999)
(Correct)
23.4%: Hardware for Speculative Reduction Parallelization and .. - Zhang, Rauchwerger.. (1999)
(Correct)
8.0%: Speculative Parallel Execution of Loops with.. - Zhang, Rauchwerger.. (1997)
(Correct)
Active bibliography (related documents): More All
0.6: Architectural Support for Scalable Speculative.. - Cintra, Martinez.. (2000)
(Correct)
0.5: Constant Propagation Through Array Variables - Wonnacott (1999)
(Correct)
0.3: Software Logging under Speculative Parallelization - Garzaran, Prvulovic..
(Correct)
Similar documents based on text: More All
0.6: Removing Architectural Bottlenecks to the.. - Prvulovic..
(Correct)
0.4: Speculative Synchronization: Applying Thread-Level.. - Martinez, Torrellas (2002)
(Correct)
0.3: Scal-Tool: Pinpointing and Quantifying Scalability.. - Solihin, Lam, Torrellas (1999)
(Correct)
BibTeX entry: (Update)
Ye Zhang, Lawrence Rauchwerger, and Josep Torrellas. Hardware for speculative parallelization in high-end multiprocessors. The Third PetaFlop Workshop (TPF-3), February 1999. http://citeseer.ist.psu.edu/article/zhang99hardware.html More
@inproceedings{ zhang99hardware,
author = "Y. Zhang and Lawrence Rauchwerger and Josep Torrellas",
title = "Hardware for Speculative Parallelization of Partially-Parallel Loops in {DSM} Multiprocessors",
booktitle = "{HPCA}",
pages = "135-",
year = "1999",
url = "citeseer.ist.psu.edu/article/zhang99hardware.html" }
Citations (may not include all citations):
478
The Stanford Dash Multiprocessor (context) - Lenoski, Laudon et al. - 1992 ACM DBLP
197
Maximizing Multiprocessor Performance with the SUIF Compiler
- Hall, Anderson et al. - 1996 DBLP
159
The LRPD Test: Speculative Run-Time Parallelization of Loops..
- Rauchwerger, Padua - 1995
74
Speculative Versioning Cache
- Gopal, Vijaykumar et al. - 1998 ACM DBLP
72
Data Speculation Support for a Chip Multiprocessor (context) - Hammond, Willey et al. - 1998 ACM DBLP
65
The ParaScope Parallel Programming Environment
- Cooper, Hall et al. - 1993
50
Runtime Compilation Methods for Multicomputers (context) - Wu, Saltz et al. - 1991 DBLP
44
Hardware for Speculative Run-Time Parallelization in Distrib..
- Zhang, Rauchwerger et al. - 1998
43
Advanced Program Restructuring for High-Performance Computer..
- Blume, Doallo et al. - 1996
42
Improving the Performance of Runtime Parallelization
- Leung, Zahorjan - 1993
26
Hardware and Software Support for Speculative Execution of S..
- Krishnan, Torrellas - 1998 ACM DBLP
19
A Scalable Method for Run-Time Loop Parallelization
- Rauchwerger, Amato et al. - 1995
17
MINT: A Front End for Ecient Simulation of Shared-Memory Mul.. (context) - Veenstra, Fowler - 1994
13
The Potential for Using ThreadLevel Data Speculation to Faci.. (context) - Ste, Mowry - 1998
12
Hardware for Speculative Parallelization of Partially-Parall..
- Zhang, Rauchwerger et al. - 1999 ACM DBLP
8
HPF-2 Scope of Activities and Motivating Applications (context) - Du, Schreiber et al. - 1994
3
An Ecient Algorithm for the Run-Time Parallelization of Do-A.. (context) - Chen, Torrellas et al. - 1994
2
The Perfect Club Benchmarks: Eective Performance Evaluation.. (context) - Berry - 1989
2
Hardware for Speculative Parallelization in DSM Multiprocess.. (context) - Zhang - 1999
Documents on the same site (http://iacoma.cs.uiuc.edu/papers.html): More
Comprehensive Hardware and Software Support for Operating.. - Xia, Torrellas (1999)
(Correct)
Software Trace Cache - Ramírez, Larriba-Pey.. (1999)
(Correct)
An Efficient Algorithm for the Run-time Parallelization of .. - Chen, Torrellas, Yew (1994)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC