(Enter summary)
Abstract: We examine multiprocessor runtime support for fine-grained, irregular directed
acyclic graphs (DAGs) such as those that arise from sparse-matrix triangular solves. We
conduct our experiments on the CM-5, whose lower latencies and active-message support
allow us to achieve unprecedented speedups for a general multiprocessor. Where as
previous implementations have maximum speedups of less than 4 on even simple banded
matrices, we are able to obtain scalable performance on extremely small and ... (Update)
Context of citations to this paper: More
...with fine grain partitions. But the overhead for fine grain computation is high and it needs to be amortized over many iterations, [6]. These results clearly demonstrate that scheduling provides benefits in practice as long as the overhead is kept low. The paper is...
...memory machines. Recently impressive performance is obtained for sparse Cholesky factorization [19, 20] and sparse triangular solver [6]. We discuss how we can exploit task parallelism in sparse Cholesky factorization and LU without pivoting using our techniques in a...
Cited by: More
Symbolic Partitioning and Scheduling of Parameterized Task.. - Cosnard, Jeannot
(Correct)
Communication Optimization of Parallel Applications on Network of.. - Zhu
(Correct)
SLC: Symbolic Scheduling for Executing Parameterized Task.. - Michel Cosnard (1999)
(Correct)
Similar documents (at the sentence level):
59.1%: Multiprocessor Runtime Support for Fine-Grained, Irregular DAGs - Chong, Sharma, al. (1995)
(Correct)
9.9%: Parallelization of Fine-grained Irregular DAGs - Chong, Sharma, Brewer, Saltz
(Correct)
Active bibliography (related documents): More All
0.4: Parallel Sparse Triangular Solution with Partitioned.. - Chong, Schreiber (1994)
(Correct)
0.3: An Overview of Message Passing Environments - McBryan (1994)
(Correct)
0.3: The Implementation of Portable Programming Layers: A Case Study - van der Linden (1994)
(Correct)
Similar documents based on text: More All
0.1: Parallel Implementation of Triangular Solve - Thierry Joffrain May
(Correct)
0.1: UPC Implementation of the Sparse Triangular Solve and NAS FT - Bell, Nishtala (2004)
(Correct)
0.1: Stack And Queue Layouts Of Directed Acyclic Graphs: Part I - Heath, Pemmaraju, Trenk (1996)
(Correct)
Related documents from co-citation: More All
8: Partitioning and Scheduling Parallel Programs for Multiprocessor (context) - Sarkar - 1989
8: Scheduling of Structured and Unstructured Computation (context) - Gerasoulis, Jiao et al. - 1995
8: PYRROS: Static task scheduling and code generation for message-passing multiproc..
- Yang, Gerasoulis - 1992
BibTeX entry: (Update)
F. T. Chong, Shamik D. Sharma, Eric A. Brewer and Joel Saltz, Multiprocessor runtime support for fine-grained irregular DAGs, Draft, 1994. http://citeseer.ist.psu.edu/article/chong94multiprocessor.html More
@misc{ chong94multiprocessor,
author = "F. Chong and S. Sharma and E. Brewer and J. Saltz",
title = "Multiprocessor runtime support for fine-grained irregular DAGs",
text = "F. T. Chong, Shamik D. Sharma, Eric A. Brewer and Joel Saltz, Multiprocessor
runtime support for fine-grained irregular DAGs, Draft, 1994.",
year = "1994",
url = "citeseer.ist.psu.edu/article/chong94multiprocessor.html" }
Citations (may not include all citations):
595
Active messages: a mechanism for integrated communication an..
- Eicken - 1992
247
Partitioning and Scheduling Parallel Programs for Execution .. (context) - Sarkar - 1989
94
Run-time parallelization and scheduling of loops (context) - Saltz, Mirchandaney et al. - 1991
88
User's guide for the Harwell-Boeing sparse matrix collection (context) - Duff, Grimes et al. - 1992
85
CM-5 Technical Summary (context) - Corp, MA - 1993
76
A comparison of clustering heuristics for scheduling directe.. (context) - Gerasoulis, Yang - 1992
69
The network architecture of the connection machine CM-5 (context) - Leiserson - 1992
35
Optimal parallel solution of sparse triangular systems (context) - Alvarado, Schreiber - 1992
33
A parallel solution method for large sparse systems of equat.. (context) - Lucas, Blank et al. - 1987
19
Performance of the iPSC/860 node architecture (context) - Moyer - 1991
18
Distributed solution of sparse linear systems
- Heath, Raghavan - 1993
13
Assessing the benefits of fine-grained parallelism in datafl.. (context) - Arvind, Culler - 1988
13
How to get good performance from the CM-5 data network (context) - Brewer, Kuszmaul - 1994
8
Aggregation methods for solving sparse triangular systems on.. (context) - Saltz - 1990
6
Parallel sparse triangular solution with partitioned inverse..
- Chong, Schreiber - 1995
4
Strata: A high-performance communications library (context) - Brewer, Blumofe - 1994
3
CMMD Reference Manual V (context) - Corp, MA - 1993
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.rutgers.edu/hpcd/Area_III.3/all_html_files/sched.html):
Parallel Sparse Triangular Solution with Partitioned.. - Chong, Schreiber (1994)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC