(Enter summary)
Abstract: While automatic parallelization of loops is generally based on compile-time analysis of data
dependences, sometimes the data dependences can not be determined at compile time. This
often occurs, for example, when arrays are accessed via subscripted subscripts. In these cases, it
is necessary to use run-time parallelization algorithms. In this paper, we present and evaluate a
new run-time parallelization algorithm based on an inspector-executor pair. The scheme, called
CYT, handles all types of... (Update)
Context of citations to this paper: More
...summarized in chapter 6. 14 Chapter 2 Related Work A variety of existing run time parallelization techniques, both software [11, 37, 38, 39, 54] and hardware based [34, 43, 53] can exploit coarse grained loop level parallelism from application programs that cannot be easily...
.... During the past few years, techniques have been developed for the run time analysis and scheduling of loops [5, 9, 13, 17, 20, 23, 25, 26, 27, 28, 29, 30, 33, 34]. The majority of this workhas concentrated on developing run time methods for constructing execution schedules...
Cited by: More
The LRPD Test: Speculative Run-Time Parallelization of.. - Rauchwerger, Padua (1999)
(Correct)
Exploiting Locality in the Run-Time Parallelization.. - Martín, Singh.. (2002)
(Correct)
New OPENMP Directives for Irregular Data Access Loops - Labarta, Ayguadé.. (2000)
(Correct)
Similar documents (at the sentence level):
10.1%: An Efficient Algorithm for the Run-time Parallelization of .. - Chen, Torrellas, Yew (1994)
(Correct)
6.3%: Compiler Optimizations For Parallel Loops With Fine-Grained.. - Chen (1994)
(Correct)
Active bibliography (related documents): More All
0.6: Run-time parallelization of irregular DOACROSS loops - Thulasiraman, Krothapalli, .. (1995)
(Correct)
0.3: Software Logging under Speculative Parallelization - Garzaran, Prvulovic..
(Correct)
0.3: Implementation Of Run Time Techniques In The Polaris Fortran.. - Lawrence (1996)
(Correct)
Similar documents based on text: More All
0.2: iWatcher: Efficient Architectural Support for Software.. - Zhou, Qin, Liu, Zhou.. (2004)
(Correct)
0.1: Effects of Parallelism Degree on Run-Time Parallelization of Loops - Xu (1998)
(Correct)
0.1: On Effective Execution of Non-Uniform DOACROSS Loops - Chen, Yew (1996)
(Correct)
Related documents from co-citation: More All
26: A scheme to enforce data dependence on large multiprocessor systems (context) - Zhu, Yew - 1987
20: Improving the performance of run-time parallelization
- Leung, Zahorjan - 1993
18: Run-time parallelization and scheduling of loops (context) - Saltz, Mirchandaney et al. - 1991
BibTeX entry: (Update)
D.-K. Chen, J. Torrellas and P.-C. Yew, An efficient algorithm for the run-time parallelization of DOACROSS Loops, Proceedings of Supercomputing, 1994. http://citeseer.ist.psu.edu/article/chen94efficient.html More
@inproceedings{ chen94efficient,
author = "Ding-Kai Chen and Josep Torrellas and Pen-Chung Yew",
title = "An efficient algorithm for the run-time parallelization of {DOACROSS} loops",
booktitle = "Supercomputing",
pages = "518-527",
year = "1994",
url = "citeseer.ist.psu.edu/article/chen94efficient.html" }
Citations (may not include all citations):
299
Dependence Analysis for Supercomputing (context) - Banerjee - 1988 ACM
217
The Perfect Club Benchmarks: Effective Performance Evaluatio..
- Berry - 1989
159
The LRPD Test: Speculative Run-Time Parallelization of Loops..
- Rauchwerger, Padua - 1995 DBLP
94
Run-Time Parallelization and Scheduling of Loops (context) - Saltz, Mirchandaney et al. - 1991 ACM DBLP
78
Compiler Algorithms for Synchronization (context) - Midkiff, Padua - 1987
69
Runtime Compilation Techniques for Data Partitioning and Com..
- Ponnusamy, Saltz et al. - 1993 ACM DBLP
55
A Scheme to Enforce Data Dependence on Large Multiprocessor .. (context) - Zhu, Yew - 1987
44
Optimizing Compilers for Supercomputers (context) - Wolfe - 1982
44
The PRIVATIZING DOALL Test: A Run-Time Technique for DOALL L..
- Rauchwerger, Padua - 1994
42
Improving the Performance of Runtime Parallelization
- Leung, Zahorjan - 1993 ACM DBLP
42
Loop Skewing: The Wavefront Method Revisited (context) - Wolfe - 1986
30
The Cedar System and an Initial Performance Study (context) - Kuck - 1993
25
Multiprocessors: Discussion of Some Theoretical and Practica.. (context) - Padua - 1979
23
An Approach to Synchronization of Parallel Computing (context) - Krothapalli, Sadayappan - 1988
21
Dependence Uniformization: A Loop Parallelization Technique
- Tzen, Ni - 1993
21
Center for Supercomputing Research and Development (context) - Eigenmann, Hoeflinger et al. - 1992
5
Advanced Loop Optimizations for Parallel Computers (context) - Polychronopoulos - 1987 ACM DBLP
5
A Scheme for Effective Execution of Irregular DOACROSS Loops (context) - Chen, Yew - 1992
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://iacoma.cs.uiuc.edu/papers.html): More
Comprehensive Hardware and Software Support for Operating.. - Xia, Torrellas (1999)
(Correct)
Software Trace Cache - Ramírez, Larriba-Pey.. (1999)
(Correct)
Evaluating the Performance of Cache-Affinity Scheduling.. - Torrellas, Tucker, Gupta (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC