See this document in CiteSeerX!

Automatic Performance Tuning and Analysis of Sparse Triangular Solve (2002)  (Make Corrections)  (2 citations)
Richard Vuduc, Shoaib Kamil, Jen Hsu, Rajesh Nishtala, James W. Demmel, Katherine A. Yelick



  Home/Search   Context   Related

 
View or download:
lsu.edu/jxr/pohll02/papers...vuduc.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  lsu.edu/jxr/ics02workshop (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: this paper, we consider the solution of the sparse lower triangular system Lx = y for a single dense vector x, given the lower triangular sparse matrix L and dense vector y (Update)

Cited by:   More
Memory Hierarchy Optimizations and Performance Bounds .. - Vuduc, Gyulassy.. (2003)   (Correct)
UPC Implementation of the Sparse Triangular Solve and NAS FT - Bell, Nishtala (2004)   (Correct)

Similar documents (at the sentence level):
5.0%:   Automatic Performance Tuning and Analysis of Sparse .. - Vuduc, Kamil, Hsu, .. (2002)   (Correct)

Active bibliography (related documents):   More   All
1.1:   Performance Optimizations and Bounds for Sparse.. - Vuduc, Demmel.. (2002)   (Correct)
0.9:   Performance Modeling and Analysis of Cache Blocking.. - Nishtala, Vuduc.. (2004)   (Correct)
0.8:   Memory Hierarchy Optimizations and Performance Bounds.. - Vuduc, Gyulassy.. (2003)   (Correct)

Similar documents based on text:   More   All
0.7:   Performance Tuning and Analysis of Sparse Triangular Solve .. - Richie Bebop Computer (2002)   (Correct)
0.5:   Automatic Performance Tuning of Sparse Matrix Kernels - Vuduc (2003)   (Correct)
0.4:   When Cache Blocking of Sparse Matrix Vector Multiply.. - Nishtala, Vuduc..   (Correct)

Related documents from co-citation:   More   All
2:   SPARSKIT: A Basic Tool Kit for Sparse Matrix Computation - Saad - 1990

BibTeX entry:   (Update)

Richard Vuduc, Shoaib Kamil, Jen Hsu, Rajesh Nishtala, James W. Demmel, and Katherine A. Yelick. Automatic performance tuning and analysis of sparse triangular solve. In ICS 2002. http://citeseer.ist.psu.edu/article/vuduc02automatic.html   More

@misc{ vuduc02automatic,
  author = "R. Vuduc and S. Kamil and J. Hsu and R. Nishtala and J. Demmel and K. Yelick",
  title = "Automatic performance tuning and analysis of sparse triangular solve",
  text = "Richard Vuduc, Shoaib Kamil, Jen Hsu, Rajesh Nishtala, James W. Demmel,
    and Katherine A. Yelick. Automatic performance tuning and analysis of sparse
    triangular solve. In ICS 2002.",
  year = "2002",
  url = "citeseer.ist.psu.edu/article/vuduc02automatic.html" }
Citations (may not include all citations):
474   A data locality optimizing algorithm (context) - Wolf, Lam - 1991  ACM   DBLP
417   Templates for the Solution of Linear Systems: Building Block.. - Barrett, Berry et al. - 1994
165   SPARSKIT: A basic toolkit for sparse matrix computations - Saad - 1994
162   Improving data locality with loop transformations - McKinley, Carr et al. - 1996
157   Automatically tuned linear algebra software - Whaley, Dongarra - 1998  ACM   DBLP
123   Optimizing matrix multiply using PHiPAC: a portable - Bilmes, Asanovic et al. - 1997
84   Compiler blockability of numerical algorithms - Carr, Kennedy - 1992  ACM   DBLP
75   A supernodal approach to sparse partial pivoting - Demmel, Eisenstat et al. - 1999  ACM
63   An unsymmetric-pattern multifrontal method for sparse lu fac.. - Davis, Du - 1997  ACM
58   Cache miss equations: a compiler framework for analyzing and.. - Ghosh, Martonosi et al. - 1999  DBLP
35   Optimal parallel solution of sparse triangular systems (context) - Alvarado, Schreiber - 1993  ACM
33   Exact analysis of the cache behavior of nested loops (context) - Chatterjee, Parker et al. - 2001  ACM   DBLP
32   A fully asynchronous multifrontal solver using distributed d.. - Amestoy, Du et al. - 2001  ACM
29   SPOOLES: An object-oriented sparse matrix library - Ashcraft, Grimes - 1999  DBLP
27   Characterizing the behavior of sparse algorithms on caches - Temam, Jalby - 1992  ACM   DBLP
25   A parallel triangular solver for a distributed-memory multip.. (context) - Li, Coleman - 1998
23   Optimizing the performance of sparse matrix-vector multiplic.. (context) - Im - 2000  ACM
22   A scalable cross-platform infrastructure for application per.. - Browne, Dongarra et al. - 2000  ACM   DBLP
15   Optimizing sparse matrix computations for register reuse in .. - Im, Yelick - 2073  ACM   DBLP
15   NIST Sparse BLAS: User's Guide - Remington, Pozo - 1996
13   CPU Performance Evaluation and Execution Time Prediction Usi.. (context) - Saavedra-Barrera - 1992  ACM
13   A Relational Approach to the Automatic Generation of Sequent.. (context) - Stodghill - 1997  ACM
12   Automatic nonzero structure analysis (context) - Bik, Wijsho - 1999  ACM   DBLP
10   Performance optimizations and bounds for sparse matrix-vecto.. - Vuduc, Demmel et al. - 2002  ACM   DBLP
10   Memory hierarchy performance prediction for sparse blocked a.. (context) - Fraguela, Doallo et al. - 1999
9   Modeling and improving locality for irregular problems: spar.. - Heras, Perez et al. - 1999
8   Parallel ICCG on a hierarchical memory multiprocessor---addr.. (context) - Rothberg, Gupta - 1992
7   Parallel algorithms for forward elimination and backward sub.. - Gupta, Kumar - 1995
6   Algorithms for sparse matrix computations on high-performanc.. (context) - Navarro, ia et al. - 1996
6   Document for the Basic Linear Algebra Subprograms (context) - Blackford, Corliss et al. - 2001
5   LAWRA--Linear Algebra With Recursive Algorithms (context) - Andersen, Gustavson et al. - 1999
5   A high performance two dimensional scalable parallel algorit.. - Joshi, Karypis et al. - 1997
5   PSBLAS: A library for parallel linear algebra computation on.. (context) - Filippone, Colajanni - 2000  DBLP
4   Alternatives for solving sparse triangular systems on distri.. (context) - Rothberg - 1995  ACM   DBLP
4   Solving triangular linear systems in parallel using substitu.. (context) - Santos - 1995  ACM
3   The performance of parallel sparse triangular solution - Heath, Raghavan - 1998
2   cient parallel triangular solution using selective inversion (context) - Raghavan - 1998
2   dersson, R. Bell, J. Hague, H. Holtho#, P. Mayes, J. Nakano,.. (context) - An - 1998
2   cient and scalable parallel sparse direct solver (context) - Joshi, Karypis et al. - 1999
http://www.cs.virginia.edu/stream

Documents on the same site (http://www.ece.lsu.edu/jxr/ics02workshop.html):   More
The Science of Programming High-Performance Linear Algebra.. - Paolo Bientinesi John (2002)   (Correct)
Compiler Support for Optimizing Tensor Contraction.. - Baumgartner, Cociorva, ..   (Correct)
A Component Architecture for High-Performance Computing - Bernholdt, Elwasif, Kohl, al. (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC