See this document in CiteSeerX!

Efficient Shared-Memory Support for Parallel Graph Reduction (1996)  (Make Corrections)  
Andrew J. Bennett, Paul H.J. Kelly



  Home/Search   Context   Related

Links:   ACM

 
View or download:
ala.doc.ic.ac.uk/~...rPGR..FGCS97.ps.gz
wwwhomes.doc.ic.ac...rPGR..FGCS97.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  ala.doc.ic.ac.u...jkPublications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper presents the results of a simulation study of cache coherency issues in parallel implementations of functional programming languages. Parallel graph reduction uses a heap shared between processors for all synchronisation and communication. We show that a high degree of spatial locality is often present and that the rate of synchronisation is much greater than for imperative programs. We propose a modified coherency protocol with static cache line ownership and show that this... (Update)

Active bibliography (related documents):   More   All
1.3:   Locality and False Sharing in Coherent-Cache Parallel Graph.. - Bennett, Kelly (1993)   (Correct)
1.2:   Eliminating Invalidation in Coherent-Cache Parallel Graph.. - Bennett, Kelly   (Correct)
0.5:   Experience with a Clustered Parallel Reduction Machine - Beemster, Hartel.. (1993)   (Correct)

Similar documents based on text:   More   All
0.1:   Automatic Generation of Verifiable Cache Coherence.. - Field, Harrison, Kanani (1997)   (Correct)
0.1:   Adaptive Push-Pull: Disseminating Dynamic Web Data - Deolasee, Katkar..   (Correct)
0.1:   Coherency Calculations in the Presence of Structural Dip - Marfurt, Sudhaker.. (1999)   (Correct)

BibTeX entry:   (Update)

@misc{ bennett-efficient,
  author = "Andrew J. Bennett and Paul H.J. Kelly",
  title = "Efficient Shared-Memory Support for Parallel Graph Reduction",
  url = "citeseer.ist.psu.edu/bennett96efficient.html" }
Citations (may not include all citations):
723   Memory coherence in shared virtual memory systems - Li, Hudak - 1989  ACM   DBLP
606   How to make a multiprocessor computer that correctly execute.. (context) - Lamport - 1979  DBLP
367   Computer Architecture: a Quantitative Approach (context) - Hennessy, Patterson - 1990  ACM
357   The directory-based cache coherence protocol for the DASH mu.. (context) - Lenoski, Laudon et al. - 1990  ACM   DBLP
230   Cilk: An efficient multithreaded runtime system - Blumofe, Joerg et al. - 1995  ACM   DBLP
183   Report on the programming language Haskell -- a non-strict p.. (context) - Hudak, Jones et al. - 1992
173   Lazy task creation: a technique for increasing the granulari.. - Mohr, Kranz et al. - 1991  ACM   DBLP
156   An evaluation of directory schemes for cache coherence - Agarwal, Simoni et al. - 1988  ACM   DBLP
88   A survey of cache coherence schemes for multiprocessors (context) - Stenstrom - 1990  ACM   DBLP
72   Semantics and Pragmatics of the lambda calculus (context) - Wadsworth - 1971
65   GUM: a portable parallel implementation of Haskell - Hammond, Jr et al. - 1995  DBLP
64   Cache invalidation patterns in shared-memory multiprocessors (context) - Gupta, Weber - 1992  ACM   DBLP
62   The Symmetry multiprocessor system (context) - Lovett, Thakkar - 1988  DBLP
60   KSR1 Principles of Operations (context) - Research - 1992
46   Caching consideration for generational garbage collection - Wilson, Lam et al. - 1990
42   Multiprocessor Execution of Functional Programs (context) - Goldberg - 1988  ACM
39   Toward Scalable Cache Only Memory Architectures - Hagersten - 1992
36   Cache coherence protocols: evaluation using a multiprocessor.. (context) - Archibald, Baer - 1986  ACM   DBLP
32   Improving locality of reference in a garbage-collecting memo.. (context) - Courts - 1988  ACM   DBLP
31   Benchmarking implementations of lazy functional languages - Hartel, Langendoen - 1993  ACM   DBLP
29   An empirical and analytic study of stack vs (context) - Appel, Shao - 1994
26   High-performance parallel graph reduction (context) - Jones, Clack et al. - 1989  ACM   DBLP
25   Design Considerations for a Parallel Reduction Machine (context) - Vree - 1989
18   Trace-driven simulations of parallel and distributed algorit.. (context) - Dubois, Briggs et al. - 1986  DBLP
15   Parallel graph reduction with the h; Gi-machine (context) - Augustsson, Johnsson - 1989
12   for distributed-memory machines (context) - Nikhil, parallel - 1994
10   Message passing on the Meiko CS (context) - Barton, Cownie et al. - 1994
10   Performance measurements on a 128-node Butterfly parallel pr.. (context) - Crowther - 1985  DBLP
9   The Balance multiprocessor system (context) - Thakkar, Gifford et al. - 1988  ACM
8   Estimating the performance advantages of relaxing consistenc.. (context) - Torrellas, Hennessy - 1990
8   An introduction to the IBM Research Parallel Processor Proto.. (context) - Pfister - 1987
7   An abstract machine for parallel graph reduction (context) - George - 1989  ACM   DBLP
5   Parallel graph reduction for shared-memory architectures - Bennett - 1993
4   Locality and false sharing in coherent-cache parallel graph .. - Bennett, Kelly - 1993
3   Computing Laboratory (context) - Trinder, Database et al. - 1989
3   Memory management for parallel tasks in shared memory (context) - Langendoen, Muller et al. - 1992  ACM   DBLP
3   Task Exposure in the Parallel Implementation of Functional P.. (context) - Rushall - 1995
2   Delayed consistency (context) - Dubois - 1992
2   th Annual International Symposium on Computer Architecture (context) - Adve, Hill et al. - 1990
2   Cache behaviour of lazy functional programs - Langendoen, Agterkamp - 1992
2   An implementation of static process networks (context) - Cox, Huang et al. - 1992
2   Memory models and cache management for a multithreaded progr.. (context) - Dennis, Gao - 1994
2   Cache behaviour of combinator graph reduction (context) - Koopman, Peter et al. - 1992
1   Derivation and performance of a pipelined transaction proces.. - Bennett, Kelly et al. - 1994

Documents on the same site (http://www-ala.doc.ic.ac.uk/~phjk/phjk-Publications.html):   More
Eliminating Invalidation in Coherent-Cache Parallel Graph.. - Bennett, Kelly   (Correct)
Data Distribution at Run-Time: Re-Using Execution Plans - Beckmann, Kelly (1998)   (Correct)
Parallel Programming Using Skeleton Functions - Darlington, Field, Harrison.. (1993)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC