(Enter summary)
Abstract: This paper describes the synchronization and communication primitives of the Cray T3E multiprocessor, a shared memory system scalable to 2048 processors. We discuss what we have learned from the T3D project (the predecessor to the T3E) and the rationale behind changes made for the T3E. We include performance measurements for various aspects of communication and synchronization. The T3E augments the memory interface of the DEC 21164 microprocessor with a large set of explicitly-managed, external ... (Update)
Cited by: More
Software Methods to Improve Data Locality and Cache Behavior - Beyls (2004)
(Correct)
Merl -- A Mitsubishi Electric Research Laboratory - Http Www Merl (1998)
(Correct)
Adaptive History-Based Memory Schedulers - Hur, Lin
(Correct)
Active bibliography (related documents): More All
0.3: Communication Performance of Wormhole Interconnection Networks - Petrini (1997)
(Correct)
0.3: Cache Coherence Protocol and its Implementation in a.. - Jaseemuddin, Vranesic
(Correct)
0.3: Two Virtual Memory Mapped Network Interface Designs - Blumrich, Dubnicki.. (1994)
(Correct)
Similar documents based on text: More All
0.0: Func_mkdb User's Manual - Hol (1988)
(Correct)
0.0: SLS: Switch-Level Simulator User's Manual - de Graaf, van Genderen (1988)
(Correct)
0.0: SpecC System-level Static Scheduling - Chang, Gajski (1999)
(Correct)
Related documents from co-citation: More All
14: Virtual memory mapped network interface for the SHRIMP multicomputer
- Blumrich - 1994
13: Parallel programming in Split-C
- Culler - 1993
12: Active Messages: a Mechanism for Integrated Communication and Computation
- von Eicken, Culler et al. - 1992
BibTeX entry: (Update)
S. L. Scott, Synchronization and communication in the T3E multiprocessor, in Architectural Support for Programming Languages and Operating Systems (ASPLOS-VII), Cambridge, Massachusetts, October 1996, pp. 26--36. Available from http://reality.sgi.com/ sls craypark/Papers/asplos96.html. http://citeseer.ist.psu.edu/scott96synchronization.html More
@inproceedings{ scott96synchronization,
author = "Steven L. Scott",
title = "Synchronization and Communication in the T3E Multiprocessor",
booktitle = "Architectural Support for Programming Languages and Operating Systems",
pages = "26-36",
year = "1996",
url = "citeseer.ist.psu.edu/scott96synchronization.html" }
Citations (may not include all citations):
912
MPI: A Message-Passing Interface Standard
- Interface - 1994
835
High Performance Fortran Language Specification Version
- Fortran - 1994
362
The Stanford FLASH Multiprocessor (context) - Kuskin, Ofelt et al. - 1994
357
The Directory-Based Cache Coherence Protocol for the DASH Mu.. (context) - Lenoski, Laudon et al. - 1990
268
Tempest and Typhoon: User Level Shared Memory
- Reinhardt, Larus et al. - 1994
239
Algorithms for Scalable Synchronization on Shared-Memory Mul.. (context) - Mellor-Crummey, Scott - 1991
212
The MIT Alewife Machine: Architecture and Performance
- Agarwal, Bianchini et al.
186
A Methodology for Implementing Highly Concurrent Data Object..
- Herlihy - 1993
173
Hot Spot Contention and Combining in Multistage Interconnect.. (context) - Pfister, Norton - 1985
159
The NYU Ultracomputer - Designing an MIMD Shared Memory Para.. (context) - Gottlieb, Grishman et al. - 1983
125
Wait-Free Synchronization
- Herlihy - 1991
114
CRAY TD System Architecture Overview (context) - Inc, Architecture - 1993
108
Paragon XP/S Product Overview (context) - Corporation - 1991
98
Evaluating Stream Buffers as a Secondary Cache Replacement (context) - Palacharla, Kessler - 1994
85
CM5 Technical Summary (context) - Corporation - 1992
61
A Tightly-Coupled ProcessorNetwork Interface
- Henry, Joerg - 1992
61
Vienna Fortran - A Fortran Language Extension for Distribute.. (context) - Chapman, Mehrotra et al. - 1991
50
KSR-1 Technical Summary (context) - Research - 1992
49
PVM: A Users' Guide and Tutorial for Networked Parallel Comp.. (context) - Geist, Beguelin et al. - 1994
44
The IBM Research Parallel Processor Prototype (RP3): Introdu.. (context) - Pfister, Brantley et al. - 1985
41
The PowerPC Architecture: A specification for a new family o.. (context) - May, Silha et al. - 1994
41
A Comparison of Architectural Support for Messaging in the T..
- Karamcheti, Chien - 1995
38
The J-Machine Multicomputer: An Architectural Evaluation
- Noakes, Wallach et al. - 1993
34
Alpha 21164 Microprocessor Hardware Reference Manual (context) - Corporation - 1995
31
Memory Bandwidth and Machine Balance in Current High Perform.. (context) - McCalpin - 1995
29
Language Specification (context) - Fox, Hiranandani et al. - 1991
28
Translation Lookaside Buffer Consistency: A Software Approac.. (context) - Black, Rashid et al. - 1989
27
T: A Multithreaded Massively Parallel Architecture
- Nikhil, Papadopoulos - 1992
26
The Cray T3E Network: Adaptive Routing in a High Performance.. (context) - Scott, Thorson - 1996
20
MIPS IV Instruction Set (context) - Price - 1995
16
DECchip 21064-AA Microprocessor Hardware Reference Manual (context) - Corporation - 1992
15
A Shared-Memory MPP from Cray Research (context) - Koeninger, Furtney et al. - 1994
14
The NCUBE family of high-performance parallel computer syste.. (context) - Palmer - 1988
9
First and Second Generation Hypercube Performance (context) - Bradley - 1988
7
Empirical Evaluation of the CRAY-T3D: A Compiler Perspective (context) - Arpaci, Culler et al. - 1995
7
The CRAFT Fortran Programming Model (context) - Pase, MacDonald et al. - 1994
6
Limits on Network Performance (context) - Agarwal - 1991
6
Meiko CS-2 interconnect Elan-Elite design (context) - Homewood, McLaren - 1993
6
Simple, Fast, and Practical Non-Blocking and Blocking Concur..
- Michael, Scott - 1996
5
Alpha AXP Architecture Handbook (context) - Corporation - 1994
4
Application Programmer's Library Reference Manual (context) - Research - 1994
4
NAS Parallel Benchmarks Results 395 (context) - Saini, Bailey - 1995
3
The GigaRing Channel (context) - Scott - 1996
1
STREAM `standard' results (context) - McCalpin - 1996
1
Specification and Interface (context) - Systems, Kernel - 1993
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.eecg.toronto.edu/~tcm/CourseECE1760.html): More
The Performance Impact of Flexibility in the Stanford FLASH.. - Heinrich (1994)
(Correct)
An Integrated Compile-Time/Run-Time Software.. - Dwarkadas, Cox.. (1996)
(Correct)
Compiler and Hardware Support for Cache Coherence in Large-Scale .. - Choi, Yew (1996)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC