Download:
|
by Chih-yung Chang, Jiann-yuan Tzeng, Jang-ping Sheu
http://axp2.csie.ncu.edu.tw/ftp/pub/tech-report/1996/./johny.ps.gz
Add To MetaCart
Abstract:
In this paper, we present the design and implementation of source-to-source High Performance Fortran assistant Tool (HPFT) in DEC 3000 workstations. For a given sequential program written in Fortran 77, HPFT generates a vectorized, reuse exploited, and/or parallelized version for vector computers. Several new compilation schemes in vectorization, reuse exploitation, and multi-threading are designed in HPFT. Performance evaluator is developed for measuring the system performance. The user interface is also designed for programmer to capture the information related to the compilation and execution of program. Experimental results based on the Convex C3840 vector computer show that the developed HPFT enhances the system performance and usually reduces the program execution time.
Citations
|
676
|
A data locality optimizing algorithm
– Wolf, Lam
- 1991
|
|
640
|
Depth-First Search and Linear Graph Algorithms
– Tarjan
- 1972
|
|
401
|
Supercompilers for Parallel and Vector Computers
– Zima, Chapman
- 1991
|
|
296
|
Advanced compiler optimizations for supercomputers
– Padua, Wolfe
- 1986
|
|
200
|
Improving register allocation for subscripted variables
– Callahan, Carr, et al.
- 1990
|
|
179
|
SUPERB: A tool for semi-automatic MIMD/SIMD parallelization
– ZIMA, BAST, et al.
- 1988
|
|
87
|
Parafrase-2: An environment for parallelizing, partitioning, synchronizing, and scheduling programs on multiprocessors
– Polychronopoulos, Girkar, et al.
- 1989
|
|
67
|
PFC: A Program to Convert Fortran to Parallel Form
– Allen, Kennedy
- 1984
|
|
63
|
Automatic decomposition of scientific programs for parallel execution
– Allen, Callahan, et al.
- 1987
|
|
23
|
On the parallelism of nested for-loops using index shift method
– Liu, Ho, et al.
- 1990
|
|
18
|
Motif programming manual: O'Reilly & Associates
– Heller, Ferguson, et al.
- 1994
|
|
14
|
An Introduction to Numerical Computations
– Yakowitz, Szidarovszky
- 1989
|
|
13
|
Squeezing the most out of an algorithm in Cray FORTRAN
– Dongarra, Eisenstat
- 1984
|
|
9
|
Automatic translation of data parallel programs to message passing programs
– Brandes
- 1993
|
|
4
|
To Compute Numerically: Concepts and Strategies
– Pizer, Wallace
- 1983
|
|
2
|
Synthesizing nested loop algorithms using nonlinear transformation method
– Sheu, Chang
- 1991
|
|
1
|
register allocation
– Vector
- 1992
|
|
1
|
Dependence Analysis for Supercomputing, Kuwer
– Banerjee
- 1988
|
|
1
|
Vectorizing do-loops and exploiting data reuse for vector computers
– Chang, Sheu
- 1993
|
|
1
|
Research Inc., CF77 volumn 4: parallel processing guide
– Cray
- 1981
|