See this document in CiteSeerX!

The Use of Prediction for Accelerating Upgrade Misses in cc-NUMA  (Make Corrections)  
Multiprocessors Manuel E. Acacio, Jose Gonzalez, Jose M. Garca and Jose...



  Home/Search   Context   Related

 
View or download:
ditec.um.es/~jmgar...pact02invpred.pdf
Cached:  PDF   PS.gz  PS  Image  Update  Help

From:  ditec.um.es/~jmgarcia/research (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This work is focused on accelerating upgrade misses in cc-NUMA multiprocessors. These misses are caused by store instructions for which a read-only copy of the line is found in the L2 cache. Upgrade misses require a message sent from the missing node to the directory, a directory lookup in order to find the set of sharers, invalidation messages being sent to the sharers and responses to the invalidations being sent back. Therefore, the penalty paid by these misses is not negligible, mainly if... (Update)

Similar documents (at the sentence level):
16.9%:   Owner Prediction for Accelerating Cache-to-Cache.. - Acacio, Gonzalez, .. (2002)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Using Destination-Set Prediction to Improve the.. - Martin, al. (2003)   (Correct)
0.5:   A New Scalable Directory Architecture for Large-Scale .. - Acacio, González..   (Correct)
0.5:   Local Relaxed Consistency Schemes on Shared-Memory Clusters - Schulz, Tao, McKee   (Correct)

Similar documents based on text:   More   All
0.7:   A Novel Approach to Reduce L2 Miss Latency in Shared-Memory - Multiprocessors Manuel..   (Correct)
0.4:   Reducing the Latency of L2 Misses in Shared-Memory.. - On-Chip Directory..   (Correct)
0.1:   The MPI-Delphi Interface: A Visual Programming.. - Acacio.. (1999)   (Correct)

BibTeX entry:   (Update)

@misc{ acacio-use,
  author = "Multiprocessors Manuel Acacio",
  title = "The Use of Prediction for Accelerating Upgrade Misses in cc-NUMA",
  url = "citeseer.ist.psu.edu/649512.html" }
Citations (may not include all citations):
496   SPLASH: Stanford Parallel Applications for Shared-Memory (context) - Singh, Weber et al. - 1992
222   The SGI Origin: A ccNUMA Highly Scalable Server (context) - Laudon, Lenoski - 1997
131   Parallel Computer Architecture: A Hardware/Software Approach (context) - Culler, Singh et al. - 1999
105   The SPLASH-2 Programs: Characterization and Methodological C.. (context) - Woo, Ohara et al. - 1995
64   Cache Invalidation Patterns in Shared-Memory Multiprocessors (context) - Gupta, Weber - 1992
51   Reducing Memory and Traffic Requirements for Scalable Direct.. - Gupta, Weber et al. - 1990
45   Dynamic Self-Invalidation: Reducing Coherence Overhead in Sh.. - Lebeck, Wood - 1995
36   Multiprocessors Should Support Simple Memory-Consistency Mod.. - Hill - 1998
26   Using Prediction to Accelerate Coherence Protocols - Mukherjee, Hill - 1998
25   Improving CC-NUMA Performance Using Instruction-Based Predic.. (context) - Kaxiras, Goodman - 1999
24   Reducing Cache Invalidation Overheads in Wormhole Routed DSM.. - Dai, Panda - 1996
23   RSIM: Simulating Shared-Memory Multiprocessors with ILP Proc.. (context) - Hughes, Pai et al. - 2002
17   An Efficient Implementation of Tree-based Multicast Routing .. - Malumbres, Duato et al. - 1996
16   Eliminating Cache Conflict Misses through XOR-Based Placemen.. (context) - Gonzalez, Valero et al. - 1997
16   Multicast Snooping: A New Coherence Method Using a Multicast.. - Bilir, Dickson et al. - 1999
13   The Impact of Exploiting Instruction-Level Parallelism on Sh.. - Pai, Ranganathan et al. - 1999
8   Coherence Communication Prediction in Shared-Memory Multipro.. - Kaxiras, Young - 2000
5   Alpha 21364 to Ease Memory Bottleneck (context) - Gwennap - 1998
4   High-Throughput Coherence Controllers - Nanda, Nguyen et al. - 2000
4   An Empirical Evaluation of Two Memory-Efficient Directory Me.. (context) - O'Krafka, Newton - 1990
3   Architecture and Design of AlphaServer GS320 (context) - Gharachorloo, Sharma et al. - 2000
3   Extending the SMP Envelope (context) - Charlesworth - 1998
3   Memory Sharing Predictor: The Key to a Speculative DSM (context) - Lai, Falsafi - 1999
2   Selective, Accurate, and Timely Self-Invalidation Using Last.. - Lai, Falsafi - 2000
2   Reducing Ownership Overhead for Load-Store Sequences in Cach.. - Nilsson, Dahlgren - 2000
2   A Novel Approach to Reduce L2 Miss Latency in SharedMemory M.. (context) - Acacio, Gonzalez et al. - 2002
1   A Novel Multicast Scheme to Reduce Cache Invalidation Overhe.. - Zhou, Shi et al. - 2000

Documents on the same site (http://ditec.um.es/~jmgarcia/research.html):   More
The MPI-Delphi Interface: A Visual Programming.. - Acacio.. (1999)   (Correct)
A Novel Approach to Improve the Performance of.. - Garcia, Flores   (Correct)
A New Language for Multicomputer Programming - Carrasco (1992)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC