3 citations found. Retrieving documents...
IBM. Engineering and Scientific Subroutine Library, Guide and Reference, 2 edition, 1992.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Generic Programming for High Performance Numerical Linear.. - Siek, Lumsdaine, Lee (1998)   (9 citations)  (Correct)

....C compiler and the Fortran BLAS (obtained from Netlib) were compiled with the Solaris Fortran 77 compiler. All possible compiler optimization flags were used in all cases. To demonstrate portability across different architectures and compilers, Fig. 4 also compares the performance of MTL with ESSL [9] on an IBM RS 6000 590. In this case, the MTL executable was compiled with the KCC and IBM xlc compilers. To demonstrate genericity across different data structures and data types, Fig. 5 shows performance results obtained using the same generic matrix vector multiplication algorithm for dense and ....

IBM. Engineering and Scientific Subroutine Library, Guide and Reference, 2 edition, 1992.


GEMM-Based Level 3 BLAS: High-Performance Model.. - Kågström, Ling, Van Loan (1995)   (9 citations)  (Correct)

....and fi in HER2K are defined to be real and not complex. 6 GEMM Based Level 3 BLAS Benchmark Many people have put a lot of effort into developing fast level 3 BLAS since the specification was published in 1990 [9, 10] Some vendors provide highly optimized BLAS for their machines, see for example [2, 1, 16, 4, 24], while others provide optimized versions of some or none of the routines. Vendor independent groups have also developed tuned level 3 kernels for different machines, for example [23, 17, 13, 6, 14] where some are based on the GEMM based concept [17, 6, 14] Today different implementations with ....

....data are very similar. 7.1 Performance results of the GEMM based model implementations In the first set of tables we compare the performance of the GEMM based routines with optimized vendor supplied level 3 BLAS. Tables 9 and 10 show GEMM ratios for level 3 BLAS provided in the IBM ESSL library [16]. The results are obtained on IBM RS6000 250 and IBM RS6000 530H (Table 9) and on a thin and wide node, respectively, of the scalable IBM SP2 system (Table 10) The underlying routines of the GEMM based library are from ESSL, except for the results on IBM RS6000, where we used our own developed ....

IBM. Engineering and Scientific Subroutine Library, Guide and Reference, January 1994. SC23--0526--01.


Parallelization of the two-dimensional Ising Model on a.. - Altevogt, Linke   (Correct)

....auxiliary lattices if necessary. In detail: Allocates storage (an array) for random numbers (one for each spin of a line) For each line, starting with the upper line: Initializes the array with random numbers between 0 and 1. using the surand( subroutine of the ESSL subroutine library [31] and For all spins in a line, starting with the leftmost spin: Calculates the sum of the neighbouring spins (this is done in parallel for ten spins with the help of the Multispin Coding technique) For all spins in a word: Uses the above sum as an index in the look up table for e ....

Engineering and Scientific Subroutine Library (Version 2), Guide and Reference, IBM Corporation (1992).

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC