See this document in CiteSeerX!

Design, Implementation, and Impact of Multicast in the ParPar Control Network  (Make Corrections)  
David Er-El, Avi Kavas, Dror G. Feitelson



  Home/Search   Context   Related

 
View or download:
cs.huji.ac.il/labs/paralle...rdgm.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cs.huji.ac.il/~feit/pub (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: The ParPar system is a high-performance cluster environment supporting a multiuser parallel workload. Its design follows a master-nodes structure, where the master controls all aspects of system activity using a dedicated control network. As nearly all control messages are multicast to a set of nodes, we implemented a reliable multicast protocol for this network based on UDP. This did not have a large impact on performance most of the time, as sending messages is only a small part of the... (Update)

Similar documents (at the sentence level):
9.9%:   The ParPar System: A Software MPP - Dror Feitelson Anat (1999)   (Correct)

Active bibliography (related documents):   More   All
0.7:   Job Scheduling in Multiprogrammed Parallel Systems - Feitelson (1997)   (Correct)
0.4:   Topology and Routing in Clusters: From Theory to Practice - Etsion, Raizman, Feitelson   (Correct)
0.3:   Exception Propagation in the ParPar System - Dror Feitelson Institute   (Correct)

Similar documents based on text:   More   All
0.1:   Comparing Windows NT, Linux, and QNX as the Basis for Cluster .. - Kavas, Feitelson   (Correct)
0.1:   Learning Efficient Parsing - With application to Data Oriented.. - Sima'an (1996)   (Correct)
0.1:   User-Level Communication in a System with Gang Scheduling - Etsion, Feitelson (2001)   (Correct)

BibTeX entry:   (Update)

@misc{ er-el-design,
  author = "David Er-El and Avi Kavas and Dror G. Feitelson",
  title = "Design, Implementation, and Impact of Multicast in the ParPar Control Network",
  url = "citeseer.ist.psu.edu/252332.html" }
Citations (may not include all citations):
609   Myrinet: a gigabit-per-second local area network - Boden, Cohen et al. - 1995
267   Internetworking with TCP/IP (context) - Comer - 1995
205   Totem: a fault-tolerant multicast group communication system - Moser, Melliar-Smith et al. - 1996
182   The Transis approach to high availability cluster communicat.. - Dolev, Malki - 1996
43   The ISIS project: real experience with a fault tolerant prog.. (context) - Birman, Cooper - 1991
30   Scalable Parallel Computing (context) - Hwang, Xu - 1998
13   How to get good performance from the CM-5 data network (context) - Brewer, Kuszmaul - 1994
13   Scalability of the Cedar system - Turner, Veidenbaum - 1994
6   Modeling the communication performance of the IBM SP2 (context) - Abandah, Davidson - 1996
4   A TeraFLOP supercomputer in 1996: the ASCI TFLOP system (context) - Mattson, Scott et al. - 1996
3   ParPar design document version 0.2 (context) - Feitelson, Volovic - 1997
3   Increasing network bandwidth on meshes (context) - Stamatopoulos, Solworth - 1994

Documents on the same site (http://www.cs.huji.ac.il/~feit/pub.html):   More
Communicators: Object-Based Multiparty Interactions for Parallel .. - Feitelson (1991)   (Correct)
The BoW Project - Feitelson   (Correct)
Limitations on Optical Free-Space Crossbar-Like.. - Feitelson, Rudolph.. (1990)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC