This paper presents efficient algorithms to implement multicast communication in scalable, wormhole-routed multiprocessors. A multicast communication service may in turn be used to support various higher-level operations for data movement and global process control. The algorithms presented are applicable to many existing scalable architectures, in particular, those using wormhole routing and n-dimensional mesh network topologies. Performance evaluation of implementations on a 64-node nCUBE-2 and a
|
539
|
Deadlock-Free Message Routing in Multiprocessor Interconnection Networks
– Dally, Seitz
- 1987
|
|
428
|
A survey of wormhole routing techniques in direct networks
– Ni, McKinley
- 1993
|
|
371
|
Virtual-Channel Flow Control
– Dally
- 1992
|
|
229
|
Optimum broadcasting and personalized communication in hypercubes
– Johnsson, Ho
- 1989
|
|
224
|
The torus routing chip
– Dally, Seitz
- 1986
|
|
209
|
The turn model for adaptive routing
– Glass, Ni
- 1998
|
|
135
|
Deadlock-Free Adaptive Routing in Multicomputer Networks Using Virtual Channels
– Dally, Aoki
- 1993
|
|
129
|
iWARP: An integrated solution to high-speed parallel computing
– Borkar, Cohn, et al.
- 1988
|
|
128
|
An Adaptive and Fault Tolerant Wormhole Routing Strategy for k-ary n-cubes
– Linder, Harden
- 1991
|
|
126
|
Deadlock-Free Multicast Wormhole Routing in Multicomputer Networks
– Lin, Ni
- 1991
|
|
94
|
Fortran 90 Explained
– Metcalf, Reid
- 1990
|
|
89
|
On the Design of Deadlock-Free Adaptive Routing Algorithms for Multicomputers: Design Methodologies
– Duato
- 1991
|
|
72
|
The Message-Driven Processor: A Multicomputer Processing Node with Efficient Mechanisms
– Dally, Fiske, et al.
- 1992
|
|
53
|
Efficient Implementation of Barrier Synchronization in Wormhole-Routed Hypercubes Multicomputers
– Xu, McKinley, et al.
- 1992
|
|
46
|
Adaptive deadlock- and livelock-free routing with all minimal paths in torus networks
– Gravano, Pifarre, et al.
- 1994
|
|
45
|
Multicast in Hypercube Multiprocessors
– Lan, Esfhanian, et al.
- 1990
|
|
36
|
The architecture and programming of the ametek series 2010 multicomputer
– Seitz, Athas, et al.
- 1988
|
|
36
|
Distributed Routing Algorithms for Broadcasting and Personalized Communication in Hypercubes
– Ho, Johnsson
- 1986
|
|
33
|
Scalability of parallel algorithms for the all-pairs shortest path problem
– Kumar, Singh
- 1991
|
|
29
|
Performance Evaluation of Multicast Wormhole Routing
– Lin, McKinley, et al.
- 1991
|
|
28
|
A Hypercube Shared Virtual Memory
– Li, Schaefer
- 1989
|
|
27
|
An overview of the nCUBE 3 supercomputer
– Duzett, Buck
- 1992
|
|
26
|
High performance communications in processor networks
– Jesshope, Miller, et al.
- 1989
|
|
26
|
The message flow model for routing in wormhole-routed networks
– Lin, Mckinley, et al.
- 1993
|
|
25
|
MPP Fortran Programming Model
– Pase, McDonald, et al.
- 1994
|
|
20
|
ComPaSS: Efficient communication services for scalable architectures
– McKinley, Xu, et al.
- 1992
|
|
17
|
A Survey of Routing Techniques in Wormhole Networks
– Ni, McKinley
- 1991
|
|
17
|
The J-Machine: System Support for Actors
– Dally
- 1989
|
|
15
|
Performance indices for parallel marker-propagation
– DeMara, Moldovan
- 1991
|
|
10
|
Efficient collective data distribution in all-port wormhole-routed hypercubes
– Robinson, Judd, et al.
- 1993
|
|
6
|
A VLSI Router Design for Hypercube Multiprocessors
– Lan, Ni, et al.
- 1989
|
|
2
|
Fortran D language specification
– Kennedy, Koelbel, et al.
- 1990
|
|
2
|
Efficient implementation of distributed barrier synchronization in hypercube multicomputers." to appear
– Xu, McKinley, et al.
- 1992
|