10 citations found. Retrieving documents...
R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of Supercomputing, Baltimore, MD, USA, November 2002.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Self Adaptivity in Grid Computing - Vadhiyar, Dongarra   (Correct)

....the computing environment is to build self adaptive software that studies the characteristics of the computing environments and chooses the software parameters to achieve high efficiency on the computing environment. Recently, a number of self adaptive software have been designed and implemented [13, 27, 24, 26, 7, 3]. Some of the software apply adaptivity to the computational processors [13, 27] some are tuned for communication networks [24] some are This work is supported in part by the National Science Foundation contract GRANT #EIA 9975020, SC #R36505 29200099 and GRANT #EIA 9975015 intended for ....

....contract GRANT #EIA 9975020, SC #R36505 29200099 and GRANT #EIA 9975015 intended for workstation clusters [7] and some have been developed for computational Grids [3] The adaptive software also differ in terms of the time when adaptivity is performed. Some perform adaptivity at installation time [27,24, 26] while some perform adaptivity at the run time [7, 3] There are very few self adaptive software that dynamically adapts to changes in the load characteristics of the resources on computational Grids. Computa tional Grids [11] involve large system dynamics that the ability to migrate ex ecuting ....

[Article contains additional citation context not shown here]

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply. In Proceedings of Supercomputing, Baltimore, MD, USA, November 2002.


Performance Modeling and Analysis of Cache Blocking.. - Nishtala, Vuduc.. (2004)   Self-citation (Vuduc Demmel Yelick Nishtala)   (Correct)

No context found.

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of Supercomputing, Baltimore, MD, USA, November 2002.


Automatic Performance Tuning and Analysis of Sparse .. - Vuduc, Kamil, Hsu, .. (2002)   Self-citation (Vuduc Demmel Yelick Kamil Nishtala)   (Correct)

No context found.

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of Supercomputing, Baltimore, MD, 2002. (To appear.).


Memory Hierarchy Optimizations and Performance Bounds .. - Vuduc, Gyulassy.. (2003)   Self-citation (Vuduc Demmel Yelick)   (Correct)

No context found.

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of Supercomputing, Baltimore, MD, USA, November 2002.


Performance Modeling and Analysis of Cache Blocking.. - Nishtala, Vuduc.. (2004)   Self-citation (Vuduc Demmel Yelick Nishtala)   (Correct)

No context found.

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of Supercomputing, Baltimore, MD, USA, November 2002.


When Cache Blocking of Sparse Matrix Vector Multiply.. - Nishtala, Vuduc..   Self-citation (Vuduc Demmel Yelick Nishtala)   (Correct)

No context found.

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of Supercomputing, Baltimore, MD, USA, November 2002.


Automatic Performance Tuning and Analysis of Sparse .. - Vuduc, Kamil, Hsu, .. (2002)   Self-citation (Vuduc Demmel Yelick Kamil Nishtala)   (Correct)

....sparse data structures. Consequently, it is not unusual to see kernels such as SpTS run at under 10 of peak uniprocessor floating point performance. Our approach to automatic tuning of SpTS builds on prior experience with building tuning systems for sparse matrix vector multiply (SpMV) [21, 22, 40], and dense matrix kernels [8, 41] In particular, we adopt the two step methodology of previous approaches: 1) we identify and generate a set of reasonable candidate implementations, and (2) search this set for the fastest implementation by some combination of performance modeling and actually ....

....furthermore, we observe speedups of up to 1.8x when both register blocking and switch to dense optimizations are applied. We also present preliminary results confirming that our heuristics choose reasonable values for the tuning parameters. These results support our prior findings with SpMV [40], suggesting two new directions for performance enhancements: 1) the use of higher level matrix structures (e.g. matrix reordering and multiple register block sizes) and (2) optimizing kernels with more opportunities for data reuse (e.g. multiplication and solve with multiple vectors, ....

[Article contains additional citation context not shown here]

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of Supercomputing, Baltimore, MD, 2002. (To appear.).


Adaptive History-Based Memory Schedulers - Hur, Lin   (Correct)

No context found.

R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of the 2002.


Improving Geographical Locality of Data for Shared.. - Löf, Nordén.. (2004)   (Correct)

No context found.

Richard Vuduc, James W. Demmel, Katherine A. Yelick, Shoaib Kamil, Rajesh Nishtala, and Benjamin Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of the 2002.


Optimizing Sparse Matrix-Vector Product Computations.. - Mellor-Crummey, Garvin (2003)   (Correct)

No context found.

R. Vuduc, J. Demmel, K. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Proceedings of SC'02: High Performance Networking and Computing, Nov. 2002.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC