See this document in CiteSeerX!

Low Communication FFTs (2002)  (Make Corrections)  
Franz Franchetti, Juergen Lorenz, Christoph W. Ueberhuber



  Home/Search   Context   Related

 
View or download:
vcpc.univie.ac.at/...oratr200227.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  vcpc.univie.ac.at/aurora/publi... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Top performance implementations of the fast Fourier transform (FFT) can only be realized through massive parallelism, in particular by utilizing the most advanced hardware architectures such as cellular computing . (Update)

Active bibliography (related documents):   More   All
0.7:   Parallel FFT Algorithms with Reduced Communication Overhead - Karner, Ueberhuber (1998)   (Correct)
0.7:   Optimum Complexity FFT Algorithms for RISC Processors - Karner, Auer, Ueberhuber (1998)   (Correct)
0.3:   Challenges of Computing the Fast Fourier Transform - Johnson, Johnson (1997)   (Correct)

Similar documents based on text:   More   All
0.6:   Latency Hiding Parallel FFTs - Franchetti, Lorenz, Ueberhuber (2002)   (Correct)
0.6:   Optimization Techniques for SIMD Vectorized Straight.. - Kral, Franchetti.. (2003)   (Correct)
0.4:   Practical Assessment of MAP's Vectorizer and Backend - Kral, Franchetti, Lorenz.. (2003)   (Correct)

BibTeX entry:   (Update)

@misc{ franchetti-low,
  author = "Franz Franchetti and Juergen Lorenz and Christoph W. Ueberhuber",
  title = "Low Communication FFTs",
  url = "citeseer.ist.psu.edu/franchetti02low.html" }
Citations (may not include all citations):
326   Topics in Matrix Analysis (context) - Horn, Johnson - 1991
98   Parallel Programming with MPI (context) - Pacheco - 1997
62   Computational Frameworks for the Fast Fourier Transform (context) - Van Loan - 1992
55   Multiprocessor FFTs (context) - Swarztrauber - 1987
22   SPIRAL: A Generator for Platform-Adapted Libraries of Signal.. (context) - uschel, Singer et al. - 2002
18   and Implementing Fourier Transform Algorithms on Various Arc.. (context) - Johnson, Johnson et al. - 1990
16   Parallelization and Performance Analysis of the Cooley-Tukey.. (context) - Norton, Silberger - 1987
14   The Emergence of Cellular Computing (context) - Sipper - 1999
12   A Framework for Generating Distributed-Memory Parallel Progr.. (context) - Gupta, Huang et al. - 1996
12   Fast Mixed-Radix Real Fourier Transforms (context) - Temperton - 1983
8   and Implementing FFT Algorithms on Various Architectures (context) - Johnson, Johnson et al. - 1990
7   Numerical Computation (context) - Ueberhuber - 1997
7   The Kronecker Product in Optimization and Fast Transform Gen.. (context) - Pitsianis - 1997
5   An Adaption of the Fast Fourier Transform for Parallel Proce.. (context) - Pease - 1968
4   Implementation of a Prime Factor FFT Algorithm on CRAY (context) - Temperton - 1988
1   Parallel FFT Algorithms with Reduced Communication Overhead - Karner, Ueberhuber - 1998
1   Two Challenges for Computing Today (context) - Rideau, Free et al. - 1999
1   Challenges of Computing the Fast Fourier Fransform (context) - Johnson, Johnson - 1997

Documents on the same site (http://www.vcpc.univie.ac.at/aurora/publications/):   More
Dynamic Load Balancing on Heterogeneous Workstation.. - Hlavacs, Ueberhuber (1998)   (Correct)
Dynamic Asset Allocation under Uncertainty for Pension Fund .. - Pflug, Swietanowski (1998)   (Correct)
Estimating Cache Performance for Sequential and Data Parallel.. - Fahringer (1997)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC