Download:
|
by Cevdet Aykanat, Argun Dervis
http://www.cs.bilkent.edu.tr/~aykanat/papers/94paa.ps
Add To MetaCart
Abstract:
In this work, we propose parallel FFT algorithms, for medium-to-coarse grain hypercubeconnected multicomputers, which are more elegant and efficient than the existing ones. The proposed algorithms achieve perfect load-balance for the efficient simplified-butterfly scheme, minimize the communication overhead by decreasing both the number and the volume of concurrent communications. Communication and computation cannot be overlapped easily due to the strong data dependencies in the FFT algorithm. In this paper, we propose a restructuring for the FFT algorithm which enables overlapping each communication with one fifth of the local computations involved in a stage. Two of the proposed parallel FFT algorithms achieve overlapping by exploiting this restructuring while using the efficient table-lookup scheme for complex coefficients. The proposed algorithms are implemented on an Intel's 32-node iPSC/2 hypercube multicomputer. High efficiency values are obtained even for small size FFT problems.
Citations
|
158
|
Schafer Discrete-Time Signal Processing
– Oppenheim, W
- 1989
|
|
63
|
Multiprocessor FFTs
– Swarztrauber
- 1987
|
|
46
|
On Computing the Discrete Fourier Transform
– Winograd
- 1978
|
|
25
|
Benchmarking the iPSC/2 hypercube multiprocessor. Concurrency: Practice and Experience
– Bomans, Roose
- 1989
|
|
24
|
Iterative algorithms for solution of large sparse systems of linear equations on hypercubes
– Aykanat, Ozguner, et al.
- 1988
|
|
21
|
FFT algorithms for vector computers
– Swarztrauber
- 1984
|
|
20
|
An algorithm for the evaluation of finite trigonometric series
– Goertzel
- 1958
|
|
14
|
A parallel FFT on an MIMD machine
– Averbuch, Gabber, et al.
- 1990
|
|
13
|
Tukey, "An Algorithm for the Machine Calculation of the Complex Fourier Series
– Cooley, W
- 1965
|
|
8
|
Multiprocessor FFT methods
– Briggs, Hart, et al.
- 1987
|
|
8
|
Portable Programming within a Message-Passing Model: the FFT as an Example
– Walker
- 1988
|
|
5
|
A Vector Implementation of the Fast Fourier Transform Algorithm
– Fornberg
- 1981
|
|
5
|
The relationship between two fast Fourier transforms
– Good
- 1971
|
|
4
|
An Efficient FFT algorithm on Multiprocessors with Distributed Memory
– Zhu
- 1990
|
|
2
|
Performance of the One-Dimensional Fast Fourier Transform on the Hypercube
– Walton
- 1986
|