| B. Kagstrom, P. Ling, and C. Van Loan. GEMM--Based Level--3 BLAS: Algorithms for the Model Implementations. Report UMINF-94.13, Department of Computing Science, Umea University, S-901 87 Umea, Sweden, December 1994. |
....Fortran 77 and designed to be highly efficient on machines with a memory hierarchy. All routines are effectively structured to reduce data traffic in a memory hierarchy. A detailed description of the algorithms used in our model implementation for the different level 3 operations is presented in [10]. The design principles, including blocking strategy, use of local arrays and alternative code sections performing the same task are discussed in [10] and to some extent in [8] It is beyond the scope of this extended abstract to present any design principles in detail. The user supplies ....
....a memory hierarchy. A detailed description of the algorithms used in our model implementation for the different level 3 operations is presented in [10] The design principles, including blocking strategy, use of local arrays and alternative code sections performing the same task are discussed in [10] (and to some extent in [8] It is beyond the scope of this extended abstract to present any design principles in detail. The user supplies underlying routines, the level 3 BLAS routine GEMM and some level 1 and level 2 BLAS kernels. If they are efficiently optimized for the target machine, the ....
B. Kagstrom, P. Ling, and C. Van Loan. GEMM--Based Level--3 BLAS: Algorithms for the Model Implementations. Report UMINF-94.13, Department of Computing Science, Umea University, S-901 87 Umea, Sweden, December 1994.
....5 High Performance Model Implementations The model implementations are written in Fortran 77 and are structured to effectively reduce data traffic in a memory hierarchy. A detailed description of the algorithms used in our model implementations for the different level 3 operations is presented in [19]. These descriptions include block partitionings and associated GEMM based templates for different options of the operations. Since these descriptions are very spacedemanding we only give a brief description of the GEMM based implementations here. This includes the characteristics of each complete ....
B. Kagstrom, P. Ling, and C. Van Loan. GEMM-Based Level 3 BLAS: Algorithms for the Model Implementations. Report UMINF-94.13, Department of Computing Science, Umea University, S-901 87 Umea, Sweden, December 1994.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC