(Enter summary)
Abstract: Symmetric multiprocessor (SMP) clusters are emerging as the cost-effective medium- to
large-scale parallel computers of choice, exploiting the superior cost-performance of SMP
desktops and servers. These machines implement communication among SMP nodes by
sending/receiving messages through an interconnection network. Many applications and
systems use a variety of software protocols to coordinate this communication. As such,
protocol performance can significantly impact communication time and... (Update)
Context of citations to this paper: More
.... to 4) of independent coherence events is quite small and well within the maximum depth that can be searched in a single cycle in hardware [2]. While dynamic partitioning can best balance the load, it requires the directory and the remote cache to allow simultaneous access from...
Cited by: More
Address Partitioning in DSM Clusters with Parallel.. - Pragaspathy, Falsafi (2000)
(Correct)
Active bibliography (related documents): More All
1.9: Fine-Grain Distributed Shared Memory on Clusters of Workstations - Schoinas (1997)
(Correct)
1.1: Design and Evaluation of Network Interfaces for System Area.. - Mukherjee (1998)
(Correct)
1.0: Sirocco: Cost-Effective Fine-Grain Distributed . . . - Schoinas, al. (1998)
(Correct)
Similar documents based on text: More All
0.2: JETTY: Filtering Snoops for Reduced Energy Consumption in SMP.. - Moshovos, al. (2001)
(Correct)
0.2: Dynamic Feature Selection for Hardware Prediction - Alan Fern Robert (2000)
(Correct)
0.1: Coherent Network Interfaces for Fine-Grain Communication - Mukherjee, Falsafi, al. (1996)
(Correct)
BibTeX entry: (Update)
B. Falsafi. Fine-Grain Protocol Execution Mechanisms & Scheduling Policies on SMP Clusters. PhD thesis, Computer Sciences Department, University of Wisconsin--Madison, 1998. http://citeseer.ist.psu.edu/falsafi98finegrain.html More
@techreport{ falsafi98finegrain,
author = "Babak Falsafi",
title = "Fine-Grain Protocol Execution Mechanisms and Scheduling Policies on {SMP} Clusters",
number = "CS-TR-1998-1374",
year = "1998",
url = "citeseer.ist.psu.edu/falsafi98finegrain.html" }
Citations (may not include all citations):
723
Memory coherence in shared virtual memory systems
- Li, Hudak - 1989
422
Implementation and performance of Munin
- Carter, Bennett et al. - 1991
406
Treadmarks: Distributed shared memory on standard workstatio..
- Keleher, Dwarkadas et al. - 1993
367
Computer Architecture: A Quantitative Approach (context) - Hennessy, Patterson - 1990
362
The stanford FLASH multiprocessor (context) - Kuskin - 1994
341
Parallel programming in Split-C
- Culler, Dusseau et al. - 1993
304
Scheduler activations: Effective kernel support for the user..
- Anderson, Bershad et al. - 1992
275
Virtual memory mapped network interface for the SHRIMP multi..
- Blumrich, Li et al. - 1994
268
Tempest and Typhoon: User-level shared memory
- Reinhardt, Larus et al. - 1994
257
force calculation algorithm (context) - Barnes, Hut et al. - 1986
240
for mimd distributed-memory machines (context) - Hiranandani, Kennedy et al. - 1992
230
Cilk: An efficient multithreaded runtime system
- Blumofe, Joerg et al. - 1995
226
Lightweight remote procedure call (context) - Bershad, Anderson et al. - 1990
222
The SGI Origin: A ccNUMA highly scalable server (context) - Laudon, Lenoski - 1997
200
Programming languages for distributed computing systems (context) - Bal, Steiner et al. - 1989
190
Process control and scheduling issues for multiprogrammed sh.. (context) - Tucker, Gupta - 1989
173
Protocol verification as a hardware design aid
- Dill, Drexler et al. - 1992
171
Eel: Machine-independent executable editing (context) - Larus, Schnarr - 1995
158
Implementing remote procedure calls (context) - Birrel, Nelson - 1984
155
Cache coherence protocols: Evaluation using a multiprocessor.. (context) - Archibald, Baer - 1986
149
Multicomputers: Message-passing concurrent computers (context) - Athas, Seitz - 1988
145
CRL: High-performance all-software distributed shared memory
- Johnson, Kaashoek et al. - 1995
131
Fine-grain access control for distributed shared memory
- Schoinas, Falsafi et al. - 1994
120
Experiences with a highspeed network adaptor: A software per..
- Druschel, Peterson et al. - 1994
111
Myrinet: A gigabit-per-second local area network (context) - Boden, Cohen et al. - 1995
109
Comparative evaluation of latency reducing and tolerating te..
- Gupta, Hennessy et al. - 1991
103
software-only approach for supporting fine-grain shared memo.. (context) - Scales, Gharachorloo et al. - 1996
101
IEEE Transactions on Parallel and Distributed Systems (context) - Agarwal, in - 1992
101
The SPLASH-2 programs: Characterization and methodological c..
- Woo, Ohara et al. - 1995
100
Using continuations to implement thread management and commu..
- Draves, Bershad et al. - 1991
99
Adaptive software cache management for distributed shared me..
- Bennett, Carter et al. - 1990
95
Application-specific protocols for user-level shared memory
- Falsafi, Lebeck et al. - 1994
87
Pathfinder: A pattern-based packet classifier
- Bailey, Gopal et al. - 1994
82
Kernel: An architecture for implementing network protocols (context) - Hutchinson, Peterson - 1991
81
Implementing precise interrupts in pipelined processors (context) - Smith, Plezkun - 1988
77
Net: A user-level network interface for parallel and distrib.. (context) - Basu, Werner et al. - 1995
74
The implications of cache affinity on processor scheduling f.. (context) - Vaswani, Zahorjan - 1991
70
Adaptive cache coherency for detecting migratory shared data
- Cox, Fowler - 1993
65
Object distribution in Orca using compiletime and run-time t..
- Bal, Kashoek - 1993
61
Where is time spent in message -passing and shared-memory pr..
- Chandra, Larus et al. - 1994
61
Department of Computer Science (context) - Mosberger, Peterson et al. - 1995
58
Using processor-cache affinity information in shared-memory .. (context) - Squillante, Lazowska - 1990
52
Decoupled hardware support for distributed shared memory (context) - Reinhardt, Pfile et al. - 1996
52
Adaptive cache coherence protocol optimized for migratory sh.. (context) - Stenstrom, Brorsson et al. - 1993
48
Optimistic active messages: A mechanism for scheduling commu..
- Wallach, Hsieh et al. - 1995
47
Cummings Publishing Company (context) - Almasi, Gottlieb et al. - 1989
46
Fine-grain software distributed shared memory on SMP cluster..
- Scales, Gharachorloo et al. - 1997
43
Implementing lightweight threads
- Stein, Shah - 1992
41
hpf compiler for distributed memory mimd computers: Design (context) - Bozkus, Choudhary et al. - 1993
37
and node granularity issues for large-scale multiprocessors (context) - Rothberg, Singh et al. - 1993
36
Reactive NUMA: A design for unifying SCOMA and CC-NUMA (context) - Falsafi, Wood - 1997
34
and bandwidth in distributed shared memory multiprocessors (context) - Holt, Heinrich et al. - 1995
34
Coherent network interfaces for fine-grain communication
- Mukherjee, Falsafi et al. - 1996
33
Wisconsin Wind Tunnel II: A fast and portable parallel archi..
- Mukherjee, Reinhardt et al. - 1997
31
Fast interrupt priority management in operating systems
- Stodolsky, Chen et al. - 1993
30
Cost-effective parallel computing
- Wood, Hill - 1995
27
Orca: A language for distributed programming
- Bal, Tanenbaum et al. - 1990
26
Kernel support for the Wisconsin Wind Tunnel
- Reinhardt, Falsafi et al. - 1993
24
Multis: A new class of multiprocessor computers (context) - Bell - 1985
21
Billion-transistor architectures
- Burger, Goodman - 1997
21
Scheduling communication on an SMP node parallel machine (context) - Falsafi, Wood - 1997
19
The CM-5 connection machine: A scalable supercomputer (context) - Hillis, Tucker - 1993
18
Hardware and software support for efficient exception handli..
- Thekkath, Levy - 1994
17
and iPSC/860 User's Guide (context) - Corporation - 1990
16
Coherence controller architectures for SMP-based CC-NUMA mul..
- Michael, Nanda et al. - 1997
16
Simple COMA node implementations
- Hagersten, Saulsbury et al. - 1994
16
The effectiveness of affinity -based scheduling in multiproc..
- Salehi, Kurose et al. - 1996
15
Intel Supercomputer Systems Division (context) - Corporation, summary - 1993
14
Dynamic pointer allocation for scalable cache coherence dire..
- Simoni, Horowitz - 1991
14
Typhoon-Zero implementation: The vortex module
- Pfile - 1995
12
Protected User-level DMA for the SHRIMP Network Interface
- Blumrich, Dubnicki et al. - 1996
11
Computing surface 2: Overview documentation set (context) - Inc - 1993
11
Tempest interface specification (context) - Reinhardt - 1995
9
and David Patterson (context) - Anderson, Culler - 1995
8
Gigaplane -XB: Extending the Ultra Enterprise family
- Charlesworth, Phelps et al. - 1997
8
Multiprocessor runtime support for fine-grained (context) - Chong, Sharma et al. - 1995
7
performance of a parallel computer simulator (context) - Falsafi, Wood - 1994
6
STiNG: A CC-NUMA compute system for the commercial marketpla.. (context) - Lovett, Clapp - 1996
6
protected communication on SMP clusters (context) - Lim, Heidelberg et al. - 1997
6
LU factorization and the LINPACK benchmark on the Intel Para..
- Womble, Greenberg - 1994
6
Active messages: a mechanism for integrating communication a.. (context) - von Eicken, Culler et al. - 1992
6
The Mercury interconnect architecture: A cost-effective infr.. (context) - Weber, Gold et al. - 1997
6
An SCI interconnect chipset and adapter (context) - Clark, Alnes - 1996
6
Sirocco: Cost-effective fine-grain distributed shared memory (context) - Schoinas, Falsafi et al. - 1997
6
Remote queues: Exposing message queues or optimization and a.. (context) - Brewer, Chong et al. - 1995
5
Mechanisms for Distributed Shared Memory
- Reinhardt - 1996
5
When does dedicated protocol processing make sense
- Falsafi, Wood - 1996
5
StartT-ng: Delivering seamless parallel computing (context) - Chiou, Ang et al. - 1995
5
Locking effects in multiprocessor implementations of protoco.. (context) - Bjoerkman, Gunningberg - 1993
4
SCI: Scalable Coherent Interface: Logical (context) - Gustavson, James - 1991
4
Fine-Grain Distributed Shared Memory on a Cluster of Worksta..
- Schoinas - 1997
2
The impact of message traffic on multicomputer memory hierar..
- Pakin, Chien - 1994
2
ACM Transactions on Modeling and Computer Simulation (context) - Falsafi, Wood et al. - 1997
1
ACM Transactions on Networking (context) - Kaiserswerth, protocol et al. - 1993
1
Software Techniques for Customizable Distributed Shared Memo.. (context) - Chandra - 1997
1
A Simulation Study of Network Interface Design Alternatives (context) - Mukherjee - 1998
Documents on the same site (http://www.cs.wisc.edu/~david/): More
Reflections on "Tempest and Typhoon: User-level Shared Memory" - Reinhardt, Larus, Wood (1994)
(Correct)
Paging Tradeoffs in Distributed-Shared-Memory.. - Burger, Hyder, Miller, Wood (1994)
(Correct)
Relaxed Consistency and Coherence Granularity in DSM Systems.. - Yuanyuan Zhou (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC