| C.C. Douglas, M. Heroux, G. Slishman, and R.M. Smith. GEMMV: A Portable Level 3 BLAS Winograd Variant of Strassen's Matrix-Matrix Multiply Algorithm. J. Comp. Physics, 110:1--10, 1994. |
....(GEMV) Morover, the GEMM based approach provides possibilities to invoke parallelism, for example, by using parallel versions of the underlying routines. It is also possible to create a level 3 BLAS library based on fast algorithms for the GEMM operation, e.g. Strassen s or Winograd s algorithms [25, 26, 15, 11]. Our contribution is two fold. First, the model implementations in Fortran 77 of the GEMM based level 3 BLAS, which are structured to effectively reduce data traffic in a memory hierarchy. Second, the GEMM based level 3 BLAS performance evaluation benchmark, which is a tool for evaluating and ....
....in underlying BLAS routines) ffl Parallelism, through automatic parallelization by a compiler, or by using parallel underlying BLAS kernels. ffl A level 3 BLAS library based on unconventional underlying matrix multiply algorithms like, for example, Strassen s or Winograd s algorithms (e.g. see [25, 26, 15, 11]) We have also contributed with the GEMM based level 3 BLAS performance evaluation benchmark. This program package facilitates the evaluation and comparison between different level 3 BLAS libraries. The benchmark compares a user specified level 3 BLAS library (e.g. a vendor supplied library) ....
C.C. Douglas, M. Heroux, G. Slishman, and R.M. Smith. GEMMV: A Portable Level 3 BLAS Winograd Variant of Strassen's Matrix-Matrix Multiply Algorithm. J. Comp. Physics, 110:1--10, 1994.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC