See this document in CiteSeerX!

Improving Performance of Sparse Matrix-Vector Multiplication (1999)  (Make Corrections)  (5 citations)
Ali Pinar, Michael T. Heath



  Home/Search   Context   Related

Links:   ACM

 
View or download:
uiuc.edu/~alipinar/papers/sc99.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uiuc.edu/~alipinar/pubs (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Sparse matrix-vector multiplication (SpMxV) is one of the most important computational kernels in scientific computing. It often suffers from poor cache utilization and extra load operations because of memory indirections used to exploit sparsity. We propose alternative data structures, along with reordering algorithms to increase effectiveness of these data structures, to reduce the number of memory indirections. Toledo proposed handling the 1x2 blocks of a matrix separately, doing only one... (Update)

Context of citations to this paper:   More

...due to their particular sparsity patterns, exhibit more conflicts or reduced spatial locality. Some form of matrix reordering [30, 19, 24, 15], or the use of multiple rc block sizes are likely to be the most e#ective way to address this performance issue. On the Power3,...

...of numerous efforts. Some of them [32,33] target convergence properties of iterative solvers in a parallel setting while others [34 38] optimize the most common numerical kernels or provide intelligent algorithmic choices for the entire problem solv ing process [39,40] In...

Cited by:   More
Performance Optimizations and Bounds for Sparse.. - Vuduc, Demmel, Yelick (2002)   (Correct)
Memory Hierarchy Optimizations and Performance Bounds.. - Vuduc, Gyulassy.. (2003)   (Correct)
Self Adapting Software for Numerical Linear Algebra.. - Chen, Dongarra.. (2003)   (Correct)

Active bibliography (related documents):   More   All
0.2:   Improving Memory-System Performance of Sparse Matrix-Vector.. - Toledo (1997)   (Correct)
0.1:   Optimizing Sparse Matrix-Vector Product Computations.. - Mellor-Crummey, Garvin (2003)   (Correct)
0.1:   Self-adapting Numerical Software for Next Generation.. - Dongarra, Eijkhout (2002)   (Correct)

Similar documents based on text:   More   All
0.3:   Implementing Efficient Indirections - Maisonneuve, Shapiro (1993)   (Correct)
0.3:   Efficient Resolution of Sparse Indirections in Data-Parallel.. - Ujaldon, Zapata (1995)   (Correct)
0.1:   Design and Implementation of a Parallel I/O.. - No, Park.. (1998)   (Correct)

Related documents from co-citation:   More   All
4:   Improving memory-system performance of sparse matrix-vector multiplication - Toledo - 1997
3:   Optimizing the Performance of Sparse Matrix-Vector Multiply (context) - Im - 2000
3:   Automatic nonzero structure analysis (context) - Bik, Wijshoff - 1999

BibTeX entry:   (Update)

Pinar, A. and M. T. Heath, Improving Performance of Sparse Matrix-Vector Multiplication, Proceedings of Supercomputing 99, Portland, OR, November 1999. http://citeseer.ist.psu.edu/pinar99improving.html   More

@inproceedings{ pinar99improving,
    author = "Ali Pinar and Michael T. Heath",
    title = "Improving Performance of Sparse Matrix-vector Multiplication",
    year = "1999",
    url = "citeseer.ist.psu.edu/pinar99improving.html" }
Citations (may not include all citations):
70   The design and implementation of a parallel unstructured Eul.. - Das, Mavriplis et al. - 1994
46   Lecture Notes in Computer Science (context) - The, salesman et al. - 1994
27   Characterizing the behavior of sparse algorithms on caches - Temam, Jalby - 1992  ACM   DBLP
25   Improving Memory-System Performance of Sparse Matrix-Vector .. - Toledo - 1997  DBLP
11   A high performance algorithm using pre-processing for sparse.. (context) - Agarwal, Gustavson et al.
10   Renumbering unstructured grids to improve the performance of.. - Burgess, Giles - 1995  ACM
1   The traveling salesman problem: the significance of initial .. (context) - Perttunen - 1989

Documents on the same site (http://www.cse.uiuc.edu/~alipinar/pubs.html):   More
On Identifying Strongly Connected Components in Parallel - Fleischer, Hendrickson, Pinar (2000)   (Correct)
Sparse Matrix Decomposition with Optimal Load Balancing - Pinar, Aykanat (1997)   (Correct)
Decomposing Linear Programs for Parallel Solution - Pinar, Catalyurek, Ayakanat.. (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC