See this document in CiteSeerX!

Building a High-Performance Collective Communication Library  (Make Corrections)  (24 citations)
Mike Barnett, Satya Gupta, David G. Payne, Lance Shuler, et al.
Supercomputing



  Home/Search   Context   Related

 
View or download:
syr.edu/~jwatts/SC94.ps
utexas.edu/pub/rvdg/SC94paper.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  syr.edu/html/publications (more)
From:  utexas.edu
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: In this paper, we report on a project to develop a unified approach for building a library of collective communication operations that performs well on a cross-section of problems encountered in real applications. The target architecture is a two-dimensional mesh with worm-hole routing, but the techniques are more general. The approach differs from traditional library implementations in that we address the need for implementations that perform well for various sized vectors and grid dimensions, ... (Update)

Context of citations to this paper:   More

...tree broadcast operation. This communication pattern is often used to implement other common operations such as reduce, gather and scatter [15, 2]. Since a binary tree is used for sending messages one would expect executing time to increase logarithmically with the number of...

...for scientific computing and the algorithms chosen for their implementation can greatly influence their performance. In papers [8, 13], the authors present a library of collective communication routines, called iCC, that uses different algorithms for small and large...

Cited by:   More
Comparing the Communication Performance and Scalability - Of Linux And   (Correct)
Evaluating the Performance of MPI-2 One-Sided Routines on a - Cray Sv Glenn   (Correct)
The Performance of the MPI Collective Communication - Routines For Large   (Correct)

Similar documents (at the sentence level):
22.7%:   Fast Collective Communication Libraries, Please - Mitra, Payne (1995)   (Correct)
19.1%:   Interprocessor Collective Communication Library (InterCom) - Barnett, Gupta, Payne, al. (1994)   (Correct)

Active bibliography (related documents):   More   All
0.6:   Efficient Collective Communication on Multidimensional Meshes with .. - Watts (1994)   (Correct)
0.5:   Broadcasting on Meshes with Worm-Hole Routing - Barnett, Payne (1996)   (Correct)
0.5:   A Pipelined Broadcast for Multidimensional Meshes - Watts, al.   (Correct)

Similar documents based on text:   More   All
0.2:   Global Combine on Mesh Architectures with Wormhole Routing - Barnett Littlefield Payne (1993)   (Correct)
0.2:   Global Combine Algorithms for 2-D Meshes With Wormhole Routing - Barnett Littlefield   (Correct)
0.2:   Short Vector Code Generation for the Discrete Fourier Transform - Franchetti, Püschel   (Correct)

Related documents from co-citation:   More   All
10:   Technical report (context) - Servers - 1997
9:   Cray TE Network Adapatative Routing High Perfromance D Toru (context) - Thorson, Network et al. - 1996
9:   Performance CRAY TE Multiprocessor (context) - Grassl, of et al. - 1997

BibTeX entry:   (Update)

M. Barnett; S. Gupta; D. Payne; L. Shuler; R. Vande Geijn, "Building a High Performance Collective Communication Library," available at http://www.cs.utexas.edu/users/rvdg/confe rence.html. http://citeseer.ist.psu.edu/140591.html   More

@inproceedings{ barnett94building,
    author = "Michael Barnett and Lance Shuler and Satya Gupta and David G. Payne and Robert A. van de Geijn and Jerrell Watts",
    title = "Building a high-performance collective communication library",
    booktitle = "Supercomputing",
    pages = "107-116",
    year = "1994",
    url = "citeseer.ist.psu.edu/140591.html" }
Citations (may not include all citations):
351   A Survey of Wormhole Routing Techniques in Direct Networks (context) - Ni, McKinley - 1993
71   Interprocessor Collective Communication Library - Barnett, Gupta et al. - 1994
53   The Design of a Standard Message Passing Interface for Distr.. - Walker - 1994
35   Data Communication in Parallel Architectures (context) - Saad, Schultz - 1989
34   Distributed Routing Algorithms for Broadcasting and Personal.. (context) - Ho, Johnsson - 1986
34   Optimal broadcasting in mesh-connected architectures - Barnett, Payne et al. - 1991
33   Global Combine on Mesh Architectures with Wormhole Routing - Barnett, Littlefield et al. - 1993
31   Broadcasting on Meshes with WormHole Routing - Barnett, Payne et al.
28   Characterizing and Tuning Communications Perfomance on the T.. (context) - Littlefield - 1992
17   IEEE Computer Society Press (context) - Lillevik, Gigaflop et al. - 1991
15   Efficient Global Combine Operations (context) - Geijn - 1991
15   Broadcasting in Wraparound Meshes with Parallel Monodirectio.. (context) - Bermond, Michallon et al. - 1992
12   A Pipelined Broadcast for Multidimensional Meshes - Geijn, Watts
12   Efficient Communication Primitives on Mesh Architectures wit.. (context) - Barnett, Littlefield et al. - 1993
4   Oak Ridge National Laboratory Technical Report ORNL/TM (context) - Seidel, Linear et al. - 1993
3   Comments on Broadcast Algorithms for Two-Dimensional Grids P.. (context) - Simmen - 1991



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.scp.syr.edu/html/publications.html):   More
Practical Dynamic Load Balancing for Irregular Problems - Watts, Rieffel, Taylor (1996)   (Correct)
A Practical Approach to Dynamic Load Balancing - Watts, Taylor (1997)   (Correct)
SUMMA: Scalable Universal Matrix Multiplication Algorithm - Geijn, Watts (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC