See this document in CiteSeerX!

Fine-Grain Protocol Execution Mechanisms & Scheduling Policies on SMP Clusters (1998)  (Make Corrections)  (1 citation)
Babak Falsafi



  Home/Search   Context   Related

 
View or download:
wisc.edu/wwt/theses/falsafi.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  wisc.edu/~david/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Symmetric multiprocessor (SMP) clusters are emerging as the cost-effective medium- to large-scale parallel computers of choice, exploiting the superior cost-performance of SMP desktops and servers. These machines implement communication among SMP nodes by sending/receiving messages through an interconnection network. Many applications and systems use a variety of software protocols to coordinate this communication. As such, protocol performance can significantly impact communication time and... (Update)

Context of citations to this paper:   More

.... to 4) of independent coherence events is quite small and well within the maximum depth that can be searched in a single cycle in hardware [2]. While dynamic partitioning can best balance the load, it requires the directory and the remote cache to allow simultaneous access from...

Cited by:   More
Address Partitioning in DSM Clusters with Parallel.. - Pragaspathy, Falsafi (2000)   (Correct)

Active bibliography (related documents):   More   All
1.9:   Fine-Grain Distributed Shared Memory on Clusters of Workstations - Schoinas (1997)   (Correct)
1.1:   Design and Evaluation of Network Interfaces for System Area.. - Mukherjee (1998)   (Correct)
1.0:   Sirocco: Cost-Effective Fine-Grain Distributed . . . - Schoinas, al. (1998)   (Correct)

Similar documents based on text:   More   All
0.2:   JETTY: Filtering Snoops for Reduced Energy Consumption in SMP.. - Moshovos, al. (2001)   (Correct)
0.2:   Dynamic Feature Selection for Hardware Prediction - Alan Fern Robert (2000)   (Correct)
0.1:   Coherent Network Interfaces for Fine-Grain Communication - Mukherjee, Falsafi, al. (1996)   (Correct)

BibTeX entry:   (Update)

B. Falsafi. Fine-Grain Protocol Execution Mechanisms & Scheduling Policies on SMP Clusters. PhD thesis, Computer Sciences Department, University of Wisconsin--Madison, 1998. http://citeseer.ist.psu.edu/falsafi98finegrain.html   More

@techreport{ falsafi98finegrain,
    author = "Babak Falsafi",
    title = "Fine-Grain Protocol Execution Mechanisms and Scheduling Policies on {SMP} Clusters",
    number = "CS-TR-1998-1374",
    year = "1998",
    url = "citeseer.ist.psu.edu/falsafi98finegrain.html" }
Citations (may not include all citations):
723   Memory coherence in shared virtual memory systems - Li, Hudak - 1989
422   Implementation and performance of Munin - Carter, Bennett et al. - 1991
406   Treadmarks: Distributed shared memory on standard workstatio.. - Keleher, Dwarkadas et al. - 1993
367   Computer Architecture: A Quantitative Approach (context) - Hennessy, Patterson - 1990
362   The stanford FLASH multiprocessor (context) - Kuskin - 1994
341   Parallel programming in Split-C - Culler, Dusseau et al. - 1993
304   Scheduler activations: Effective kernel support for the user.. - Anderson, Bershad et al. - 1992
275   Virtual memory mapped network interface for the SHRIMP multi.. - Blumrich, Li et al. - 1994
268   Tempest and Typhoon: User-level shared memory - Reinhardt, Larus et al. - 1994
257   force calculation algorithm (context) - Barnes, Hut et al. - 1986
240   for mimd distributed-memory machines (context) - Hiranandani, Kennedy et al. - 1992
230   Cilk: An efficient multithreaded runtime system - Blumofe, Joerg et al. - 1995
226   Lightweight remote procedure call (context) - Bershad, Anderson et al. - 1990
222   The SGI Origin: A ccNUMA highly scalable server (context) - Laudon, Lenoski - 1997
200   Programming languages for distributed computing systems (context) - Bal, Steiner et al. - 1989
190   Process control and scheduling issues for multiprogrammed sh.. (context) - Tucker, Gupta - 1989
173   Protocol verification as a hardware design aid - Dill, Drexler et al. - 1992
171   Eel: Machine-independent executable editing (context) - Larus, Schnarr - 1995
158   Implementing remote procedure calls (context) - Birrel, Nelson - 1984
155   Cache coherence protocols: Evaluation using a multiprocessor.. (context) - Archibald, Baer - 1986
149   Multicomputers: Message-passing concurrent computers (context) - Athas, Seitz - 1988
145   CRL: High-performance all-software distributed shared memory - Johnson, Kaashoek et al. - 1995
131   Fine-grain access control for distributed shared memory - Schoinas, Falsafi et al. - 1994
120   Experiences with a highspeed network adaptor: A software per.. - Druschel, Peterson et al. - 1994
111   Myrinet: A gigabit-per-second local area network (context) - Boden, Cohen et al. - 1995
109   Comparative evaluation of latency reducing and tolerating te.. - Gupta, Hennessy et al. - 1991
103   software-only approach for supporting fine-grain shared memo.. (context) - Scales, Gharachorloo et al. - 1996
101   IEEE Transactions on Parallel and Distributed Systems (context) - Agarwal, in - 1992
101   The SPLASH-2 programs: Characterization and methodological c.. - Woo, Ohara et al. - 1995
100   Using continuations to implement thread management and commu.. - Draves, Bershad et al. - 1991
99   Adaptive software cache management for distributed shared me.. - Bennett, Carter et al. - 1990
95   Application-specific protocols for user-level shared memory - Falsafi, Lebeck et al. - 1994
87   Pathfinder: A pattern-based packet classifier - Bailey, Gopal et al. - 1994
82   Kernel: An architecture for implementing network protocols (context) - Hutchinson, Peterson - 1991
81   Implementing precise interrupts in pipelined processors (context) - Smith, Plezkun - 1988
77   Net: A user-level network interface for parallel and distrib.. (context) - Basu, Werner et al. - 1995
74   The implications of cache affinity on processor scheduling f.. (context) - Vaswani, Zahorjan - 1991
70   Adaptive cache coherency for detecting migratory shared data - Cox, Fowler - 1993
65   Object distribution in Orca using compiletime and run-time t.. - Bal, Kashoek - 1993
61   Where is time spent in message -passing and shared-memory pr.. - Chandra, Larus et al. - 1994
61   Department of Computer Science (context) - Mosberger, Peterson et al. - 1995
58   Using processor-cache affinity information in shared-memory .. (context) - Squillante, Lazowska - 1990
52   Decoupled hardware support for distributed shared memory (context) - Reinhardt, Pfile et al. - 1996
52   Adaptive cache coherence protocol optimized for migratory sh.. (context) - Stenstrom, Brorsson et al. - 1993
48   Optimistic active messages: A mechanism for scheduling commu.. - Wallach, Hsieh et al. - 1995
47   Cummings Publishing Company (context) - Almasi, Gottlieb et al. - 1989
46   Fine-grain software distributed shared memory on SMP cluster.. - Scales, Gharachorloo et al. - 1997
43   Implementing lightweight threads - Stein, Shah - 1992
41   hpf compiler for distributed memory mimd computers: Design (context) - Bozkus, Choudhary et al. - 1993
37   and node granularity issues for large-scale multiprocessors (context) - Rothberg, Singh et al. - 1993
36   Reactive NUMA: A design for unifying SCOMA and CC-NUMA (context) - Falsafi, Wood - 1997
34   and bandwidth in distributed shared memory multiprocessors (context) - Holt, Heinrich et al. - 1995
34   Coherent network interfaces for fine-grain communication - Mukherjee, Falsafi et al. - 1996
33   Wisconsin Wind Tunnel II: A fast and portable parallel archi.. - Mukherjee, Reinhardt et al. - 1997
31   Fast interrupt priority management in operating systems - Stodolsky, Chen et al. - 1993
30   Cost-effective parallel computing - Wood, Hill - 1995
27   Orca: A language for distributed programming - Bal, Tanenbaum et al. - 1990
26   Kernel support for the Wisconsin Wind Tunnel - Reinhardt, Falsafi et al. - 1993
24   Multis: A new class of multiprocessor computers (context) - Bell - 1985
21   Billion-transistor architectures - Burger, Goodman - 1997
21   Scheduling communication on an SMP node parallel machine (context) - Falsafi, Wood - 1997
19   The CM-5 connection machine: A scalable supercomputer (context) - Hillis, Tucker - 1993
18   Hardware and software support for efficient exception handli.. - Thekkath, Levy - 1994
17   and iPSC/860 User's Guide (context) - Corporation - 1990
16   Coherence controller architectures for SMP-based CC-NUMA mul.. - Michael, Nanda et al. - 1997
16   Simple COMA node implementations - Hagersten, Saulsbury et al. - 1994
16   The effectiveness of affinity -based scheduling in multiproc.. - Salehi, Kurose et al. - 1996
15   Intel Supercomputer Systems Division (context) - Corporation, summary - 1993
14   Dynamic pointer allocation for scalable cache coherence dire.. - Simoni, Horowitz - 1991
14   Typhoon-Zero implementation: The vortex module - Pfile - 1995
12   Protected User-level DMA for the SHRIMP Network Interface - Blumrich, Dubnicki et al. - 1996
11   Computing surface 2: Overview documentation set (context) - Inc - 1993
11   Tempest interface specification (context) - Reinhardt - 1995
9   and David Patterson (context) - Anderson, Culler - 1995
8   Gigaplane -XB: Extending the Ultra Enterprise family - Charlesworth, Phelps et al. - 1997
8   Multiprocessor runtime support for fine-grained (context) - Chong, Sharma et al. - 1995
7   performance of a parallel computer simulator (context) - Falsafi, Wood - 1994
6   STiNG: A CC-NUMA compute system for the commercial marketpla.. (context) - Lovett, Clapp - 1996
6   protected communication on SMP clusters (context) - Lim, Heidelberg et al. - 1997
6   LU factorization and the LINPACK benchmark on the Intel Para.. - Womble, Greenberg - 1994
6   Active messages: a mechanism for integrating communication a.. (context) - von Eicken, Culler et al. - 1992
6   The Mercury interconnect architecture: A cost-effective infr.. (context) - Weber, Gold et al. - 1997
6   An SCI interconnect chipset and adapter (context) - Clark, Alnes - 1996
6   Sirocco: Cost-effective fine-grain distributed shared memory (context) - Schoinas, Falsafi et al. - 1997
6   Remote queues: Exposing message queues or optimization and a.. (context) - Brewer, Chong et al. - 1995
5   Mechanisms for Distributed Shared Memory - Reinhardt - 1996
5   When does dedicated protocol processing make sense - Falsafi, Wood - 1996
5   StartT-ng: Delivering seamless parallel computing (context) - Chiou, Ang et al. - 1995
5   Locking effects in multiprocessor implementations of protoco.. (context) - Bjoerkman, Gunningberg - 1993
4   SCI: Scalable Coherent Interface: Logical (context) - Gustavson, James - 1991
4   Fine-Grain Distributed Shared Memory on a Cluster of Worksta.. - Schoinas - 1997
2   The impact of message traffic on multicomputer memory hierar.. - Pakin, Chien - 1994
2   ACM Transactions on Modeling and Computer Simulation (context) - Falsafi, Wood et al. - 1997
1   ACM Transactions on Networking (context) - Kaiserswerth, protocol et al. - 1993
1   Software Techniques for Customizable Distributed Shared Memo.. (context) - Chandra - 1997
1   A Simulation Study of Network Interface Design Alternatives (context) - Mukherjee - 1998

Documents on the same site (http://www.cs.wisc.edu/~david/):   More
Reflections on "Tempest and Typhoon: User-level Shared Memory" - Reinhardt, Larus, Wood (1994)   (Correct)
Paging Tradeoffs in Distributed-Shared-Memory.. - Burger, Hyder, Miller, Wood (1994)   (Correct)
Relaxed Consistency and Coherence Granularity in DSM Systems.. - Yuanyuan Zhou (1997)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC