Download:
|
by Kuang-chih Liu, Chung-ta King
Journal of System Architecture
ftp://pads1.cs.nthu.edu.tw/pub/papers/kcliu/jsa.ps.gz
Add To MetaCart
Abstract:
Run-time parallelization is a technique for solving problems whose data access patterns are difficult to analyze at compile time. In this paper we propose a worker-checker framework to classify different run-time parallelization schemes. Under the framework, operations performed during run-time parallelization are classified loosely into a worker and a checker. Different schemes are then cast into the framework based on the relative execution order of their worker and checker. From the framework, we identified several new run-time parallelization methods. In the second part of the paper we then examine the implementation of one such method derived from speculative parallelization [10]. The implementation is based on the idea of embedding hardware checkers inside memory controllers. We will present the design of the hardware checker and evaluate the effectiveness of the design on run-time parallelizing DOALL and DOACROSS loops. Keywords: Run-time parallelization, speculative parallelization, inspector-executor, irregular problem, smart memory 1
Citations
|
3148
|
Computer architecture: a quantitative approach
– Hennessy, Patterson
- 1990
|
|
801
|
How to Make a Multiprocessor Computer that Correctly Executes Multiprocess Programs
– Lamport
- 1979
|
|
157
|
The LRPD Test: Speculative Run-Time Parallelization of Loops with Privatization and Reduction Parallelization
– Rauchwerger, Padua
- 1995
|
|
115
|
RunTime Parallelization and Scheduling of Loops
– Saltz, Mirchandaney, et al.
- 1991
|
|
59
|
Runtime Support and Compilation Methods for User Specified Data
– Ponnusamy, Saltz, et al.
- 1993
|
|
58
|
A Scheme to Enforce Data Dependence on Large Multiprocessor Systems
– Zhu, Yew
- 1987
|
|
56
|
Improving the performance of runtime parallelization
– Leung, Zahorjan
- 1993
|
|
23
|
Augmint: A multiprocessor simulation environment for intel x86 architectures
– Sharma, Nguyen, et al.
- 1996
|
|
6
|
Josep Torrellas, and Pen-Chung Yew. An efficient algorithm for the run-time parallelization of doacross loops
– Chen
- 1994
|
|
6
|
Memory contention in scalable cache-coherent multiprocessors
– Bianchini, Crovella, et al.
- 1993
|
|
4
|
Joep Torrellas, Speeding up Irregular Applications in Shared-Memory Multiprocessors: Memory Binding and Group
– Zhang
- 1995
|
|
3
|
The PRIVATIZING DOALL Test: A Run-time Technique for
– Rauchwerger, Padua
- 1994
|
|
2
|
A Scalable Method for Run-Time
– Rauchwerger, Amato, et al.
- 1995
|
|
1
|
J.Carbajal, "Augmint --- A Multiprocessor Simulation Environment for Intel x86 Architecture", CSRD
– Sharma, Maged
- 1996
|