See this document in CiteSeerX!

Comparative Evaluation of Latency Tolerance Techniques for Software Distributed Shared Memory (1998)  (Make Corrections)  (14 citations)
Todd Mowry, Charles Q. C. Chan, Adley K. W. Lo
Proc. of the 4th IEEE Symp. on High-Performance Computer Architecture (HPCA-4)



  Home/Search   Context   Related

 
View or download:
cmu.edu/~tcm/tcm_p...m_latency_hpca4.ps
cmu.edu/~tcm/tcm_p...atency_hpca4.ps.gz
umd.edu/class/spring1...mowryhpca98.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/~tcm/Papers (more)
From:  cmu.edu/~tcm/Papers
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: A key challenge in achieving high performance on software DSMs is overcoming their relatively large communication latencies. In this paper, we consider two techniques which address this problem: prefetching and multithreading. While previous studies have examined each of these techniques in isolation, this paper is the first to evaluate both techniques using a consistent hardware platform and set of applications, thereby allowing direct comparisons. In addition, this is the first study to... (Update)

Context of citations to this paper:   More

...execution times to some extent but it is not clear how much the protocol overhead limited further gains. More recently, Mowry et al. [32] studied compiler controlled prefetching in SVM systems. While prefetching helped cut the memory stall times, the protocol overhead was...

...delay. The first technique is multi threading. Multi threading has been used in many systems to mask communication costs [17, 20, 28]. The difference here is that notification delay may be many times larger than the base communication costs, and therefore multi...

Cited by:   More
Efficient Categorization of Memory Sharing Patterns in.. - De Castro Computer (2001)   (Correct)
Comparing Latency-Tolerance Techniques - For Software Dsm (2003)   (Correct)
Active Correlation Tracking - Thitikamol, Keleher (1999)   (Correct)

Active bibliography (related documents):   More   All
0.1:   Hardware Support for Thread-Level Speculation - Thesis Summary Gregory   (Correct)
0.1:   Software-Controlled Multithreading Using Informing Memory.. - Mowry, Ramkissoon (1998)   (Correct)
0.1:   Architectural Requirements for In-Pipeline CCMs - Gregory Steffan Reconfigurable   (Correct)

Similar documents based on text:   More   All
0.3:   Comparative Evaluation of Latency Reducing and.. - Gupta, Hennessy.. (1991)   (Correct)
0.1:   Communication in Hardware/Software Embedded Systems - O'Nils (1997)   (Correct)
0.1:   Informing Memory Operations: Providing Memory Performance.. - Horowitz (1996)   (Correct)

Related documents from co-citation:   More   All
8:   Treadmarks: Distributed shared memory on standard workstations and operating sys.. - Keleher, Dwarkadas et al. - 1994
7:   The SPLASH-2 programs: Characterization and methodological considerations - Woo, Ohara et al. - 1995
6:   Implementation and Performance of MUNIN - Carter, Bennett et al. - 1991

BibTeX entry:   (Update)

T.C. Mowry, C.Q.C. Chan, and A.K.W. Lo. Comparative evaluation of latency tolerance techniques for software distributed shared memory. In Proceedings of the Fourth High Performance Computer Architecture Symposium, February 1998. http://citeseer.ist.psu.edu/mowry98comparative.html   More

@inproceedings{ mowry:1998:comparative,
    author = "Todd C. Mowry and Charles Q. C. Chan and Adley K. W. Lo",
    title = "Comparative Evaluation of Latency Tolerance Techniques for Software Distributed Shared Memory",
    booktitle = "Proc. of the 4th {IEEE} Symp. on High-Performance Computer Architecture ({HPCA}-4)",
    pages = "300--311",
    year = "1998",
    url = "citeseer.ist.psu.edu/mowry98comparative.html" }
Citations (may not include all citations):
723   Memory Coherence in Shared Virtual Memory Systems - Li, Hudak - 1989
406   TreadMarks: Distributed Shared Memory on Standard Workstatio.. - Keleher, Cox et al. - 1994
353   The SPLASH-2 Programs: Characterization and Methodological C.. - Woo, Ohara et al. - 1995
344   Design and Evaluation of a Compiler Algorithm for Prefetchin.. - Mowry, Lam et al. - 1992
300   Lazy Release Consistency for Software Distributed Shared Mem.. - Keleher, Cox et al. - 1992
249   Tolerating Latency Through Software-Controlled Prefetching i.. - Mowry, Gupta - 1991
249   Tolerating Latency Through SoftwareControlled Data Prefetchi.. - Mowry - 1994
222   The SGI Origin: A ccNUMA Highly Scalable Server (context) - Laudon, Lenoski - 1997
212   APRIL: A Processor Architecture for Multiprocessing - Agarwal, Lim et al. - 1990
149   Software Prefetching (context) - Callahan, Kennedy et al. - 1991
133   A Unified Formalization of Four Shared-Memory Models - Adve, Hill - 1991
109   Comparative Evaluation of Latency Reducing and Tolerating Te.. - Gupta, Hennessy et al. - 1991
104   Compiler-Based Prefetching for Recursive Data Structures - Luk, Mowry - 1996
74   Software Versus Hardware Shared-Memory Implementation: A Cas.. - Cox, Dwarkadas et al. - 1994
72   MASA: A Multithreaded Processor Architecture for Parallel Sy.. (context) - Halstead, Fujita - 1988
51   Understanding Application Performance on Shared Virtual Memo.. - Iftode, Singh et al. - 1996
51   An Integrated Compile-Time/Run-Time Software Distributed Sha.. - Dwarkadas, Cox et al. - 1996
49   Automatic Compiler-Inserted I/O Prefetching for Out-ofCore A.. - Mowry, Demke et al. - 1996
40   Interleaving: A Multithreading Technique Targeting Multiproc.. - Laudon, Gupta et al. - 1994
36   Hiding Communication Latency and Coherence Overhead in Softw.. - Bianchini, Kontothanassis et al. - 1996
33   Parallel MIMD Computation : The HEP Supercomputer and Its Ap.. (context) - Kowalik - 1985
19   Multi-threading and Remote Latency in Software DSMs - Thitikamol, Keleher - 1997
8   Languages and Compilers for Parallel Computing (context) - Tjiang, Wolf et al. - 1992
1   Tolerating Latency in Software Distributed Shared Memory Sys.. (context) - Lo
1   Tolerating Latency in Software Distributed Shared Memory Sys.. (context) - Chan



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.cmu.edu/~tcm/Papers.html):   More
Predicting Data Cache Misses in Non-Numeric Applications.. - Mowry, Luk (1997)   (Correct)
Automatic Compiler-Inserted I/O Prefetching for.. - Mowry, Demke, Krieger (1996)   (Correct)
Informing Loads: Enabling Software To Observe And.. - Horowitz.. (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC