• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

DMCA

Treadmarks: Shared memory computing on networks of workstations (1996)

Cached

  • Download as a PDF

Download Links

  • [www.cs.rochester.edu]
  • [www4.informatik.uni-erlangen.de]
  • [www.cs.rice.edu]
  • [www.cs.swarthmore.edu]
  • [www.cs.toronto.edu]
  • [www.cs.rice.edu]
  • [www-db.stanford.edu]
  • [www.eecg.toronto.edu]
  • [www.eecg.toronto.edu]
  • [www.cs.rochester.edu]
  • [www.cs.rice.edu]
  • [www.cs.rice.edu]
  • [www.cs.rice.edu]
  • [www.cs.swarthmore.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Cristiana Amza , Alan L. Cox , Hya Dwarkadas , Pete Keleher , Honghui Lu , Ramakrishnan Rajamony , Weimin Yu , Willy Zwaenepoel
Venue:Computer
Citations:487 - 37 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

Citations

1231 How to make a multiprocessor computer that correctly executes multiprocess programs - Lamport - 1979 (Show Context)

Citation Context

... not well defined in a distributed system. A more precise notion is sequential consistency, whereby the memory appears to all processes as if they were executing on a single multiprogrammed processor =-=[7]. With seq-=-uential consistency, the notion of "the last value written" is precisely defined. The simplicity of this model may, however, exact a high price in terms of performance, and therefore much re...

1222 Condor: A hunter of idle workstations - Litzkow, Livny, et al. - 1988 (Show Context)

Citation Context

...sors dedicated for the purpose of providing computing cycles. Alternatively, it may consist of a dynamically varying set of machines on which idle cycles are used to perform long-running computations =-=[14]-=-. In the latter case, the (hardware) cost is essentially zero, since many organizations already have extensive workstation networks in place. In terms of performance, improvements in processor speed a...

957 Memory coherence in shared virtual memory systems,”ACMTransactions onComputer Systems, - Li, Hudak - 1989 (Show Context)

Citation Context

...ions using the TreadMarks distributed shared memory (DSM) system. DSM allows processes to assume a globally shared virtual memory even though they execute on nodes that do not physically share memory =-=[9]-=-. Figure 1 illustrates a DSM system consisting of N networked workstations, each with its own memory, connected by a network. The DSM software provides the abstraction of a globally shared memory, in ...

788 PVM: A Framework for Parallel Distributed Computing," - Sunderam - 1990 (Show Context)

Citation Context

...ming models. 9.1 Alternative Programming Models Message Passing (PVM). Currently, message passing is the prevailing programming paradigm for distributed memory systems. Parallel Virtual Machine (PVM) =-=[12]-=- is a popular software message passing package. It allows a heterogeneous network of computers to appear as a single concurrent computational engine. TreadMarks is currently restricted to a homogeneou...

730 Memory consistency and event ordering in scalable shared-memory multiprocessors,” ISCA, - Gharachorloo, Lenoski, et al. - 1990 (Show Context)

Citation Context

...ory consistency models, and we have designed protocols to combat the false sharing problem. These approaches are discussed next. 4 Lazy Release Consistency 4.1 Release Consistency Release consistency =-=[5]-=- is a relaxed memory consistency model. It does not guarantee consistency at all times; the propagation of updates to shared memory may be delayed for some amount of time. By doing so, a release consi...

587 Implementation and Performance of Munin. - Carter, Bennett, et al. - 1991 (Show Context)

Citation Context

...action does not result in a large amount of communication. Various techniques have been used in TreadMarks to meet this challenge, including lazy release consistency [6] and multiple writer protocols =-=[3]-=-. This paper first describes the application programming interface provided by TreadMarks (Section 2), Next, we discuss the implementation challenges. (Section 3) and the techniques used to meet these...

526 Treadmarks: Distributed shared memory on standard workstations and operating systems. - Keleher, Cox, et al. - 1994 (Show Context)

Citation Context

...between the two environments. Finally, DSM allows for seamless integration of shared-memory multiprocessor workstations in a network environment. The actual system described in this paper, TreadMarks =-=[7]-=-, runs at user-level on Unix workstations. No kernel modifications or special privileges are required, and standard Unix compilers and linkers are used. The challenge in implementing a DSM system is t...

467 Lazy Release Consistency for Software Distributed Shared Memory - KELEHER, COX, et al. - 1992 (Show Context)

Citation Context

... sure that the shared memory abstraction does not result in a large amount of communication. Various techniques have been used in TreadMarks to meet this challenge, including lazy release consistency =-=[6]-=- and multiple writer protocols [3]. This paper first describes the application programming interface provided by TreadMarks (Section 2), Next, we discuss the implementation challenges. (Section 3) and...

427 The High Performance Fortran Handbook. - Koelbel, Loveman, et al. - 1994 (Show Context)

Citation Context

... has to divide the computation among different threads and use either synchronization or message passing to control the interactions among the concurrent threads. With implicit parallelism, as in HPF =-=[6]-=-, the user writes a single-threaded program, which is then parallelized by the compiler. In particular, HPF contains data distribution primitives, which may be used by the compiler to drive the parall...

393 A.: The Midway Distributed Shared Memory System - Bershad, Zekaukas, et al. - 1993 (Show Context)

Citation Context

...te protocol works better than an update protocol, because of the large amount of data resulting from the update protocol. Entry Consistency (Midway). Entry consistency is another relaxed memory model =-=[3]-=-. As in release consistency, consistency actions are taken in conjunction with synchronization operations. Unlike release consistency, however, entry consistency requires that each shared data object ...

369 The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor. - Lenoski, Laudon, et al. - 1990 (Show Context)

Citation Context

...ive approach to shared memory is to implement it in hardware, using a snooping bus protocol for a small number of processors or using a directory-based protocol for larger number of processors (e.g., =-=[8]-=-). We share with this approach the programming model, but our implementation avoids expensive cache controller hardware. On the other hand, a hardware implementation can efficiently support applicatio...

229 Experiences with the amoeba distributed operating system, - Tannenbaum, Renesse, et al. - 1990 (Show Context)

Citation Context

...ng solely on commodity hardware and software, networks of workstations offer parallel processing at a relatively low cost. A network-of-workstations multiprocessor may be realized as a processor bank =-=[17]-=-, a number of processors dedicated for the purpose of providing computing cycles. Alternatively, it may consist of a dynamically varying set of machines on which idle cycles are used to perform long-r...

127 A uni formalization of four shared-memory models - Adve, Hill - 1993 (Show Context)

Citation Context

...n operations can be mapped into acquires and releases as well. A partial order, happened-before-1 , denoted hb1 ! can be defined on releases, acquires, and shared memory accesses in the following way =-=[1]-=-: ffl If a 1 and a 2 are ordinary shared memory accesses, releases, or acquires on the same processor, and a 1 occurs before a 2 in program order, then a 1 hb1 ! a 2 . ffl If a 1 is a release on proce...

120 Algorithms implementing distributed shared memory - Stumm, Zhou - 1990 (Show Context)

Citation Context

...te data to provide the abstraction of shared memory. Most DSM systems choose to replicate data, because this approach gives the best performance for a wide range of application parameters of interest =-=[11]-=-. With replicated data, the provision of memory consistency is at the heart of a DSM system: the DSM software must control replication in a manner that provides the abstraction of a single shared memo...

102 Strategies for Multilocus Linkage Analysis - Lathrop, Lalouel, et al. - 1984 (Show Context)

Citation Context

...on the chromosome can be computed. ILINK [4] is a parallelized version of a widely used genetic linkage analysis program, which is part Mendel Recombination Figure 12 DNA Recombination of the LINKAGE =-=[10]-=- package. ILINK takes as input a family tree, called a pedigree, augmented with some genetic information about the members of the family. It computes a maximum-likelihood estimate of `, the recombinat...

65 Message passing versus distributed shared memory on networks of workstations, Supercomputing, - Lu, Dwarkadas, et al. - 1995 (Show Context)

Citation Context

...omplish the same goal. For a more detailed comparison in programmability and performance between TreadMarks and PVM we refer the reader to Lu's M.S. thesis, which includes nine different applications =-=[10]-=-. 22 901 4682 774 4085 9570 Processors 1 2 3 4 5 6 7 8 Speedup 1 2 3 4 5 6 7 Figure 14 Speedup Results for ILINK: Each line represents a different data set. The numbers at the bottom indicate the sequ...

62 Parallel Programming Using Shared Objects and Broadcasting - TANENBAUM, KAASHOEK, et al. - 1992 (Show Context)

Citation Context

...parallelism, such as TSP, ILINK, or MIP are not easily expressed in the HPF framework. Object-Oriented Parallel Computing (Orca). Rather than providing the programmer with a shared memory space, Orca =-=[16]-=- and other object-oriented parallel computing systems support a shared space of object, each of which is accessed by properly synchronized methods. Besides the advantages from a programming perspectiv...

41 Parallelization of General Linkage Analysis Problems - DWARKADAS, SCHÄFFER, et al. - 1994 (Show Context)

Citation Context

...at recombination has occurred between the gene we are looking for and genes with known locations. From these probabilities an approximate location of the gene on the chromosome can be computed. ILINK =-=[4]-=- is a parallelized version of a widely used genetic linkage analysis program, which is part Mendel Recombination Figure 12 DNA Recombination of the LINKAGE [10] package. ILINK takes as input a family ...

39 Techniques for reducing consistency-related information in distributed shared memory systems - Carter, Bennett, et al. - 1995 (Show Context)

Citation Context

... to implement release consistency. Roughly speaking, lazy release consistency enforces consistency at the time of an acquire, in contrast to the earlier implementation of release consistency in Munin =-=[3]-=-, sometimes referred to as eager release consistency, which enforced consistency at the time of a release. Figure 6 illustrates the intuitive argument behind lazy release consistency. Assume that x is...

29 Computational Experience with a Difficult Mixed-Integer Multicommodity Flow - Bienstock - 1995 (Show Context)

Citation Context

... after the pre-processing step, which is not (yet) parallelized. In addition to the problems from the MIPLIB library, the code was also used to solve a previously unsolved multicommodity flow problem =-=[2]-=-. The problem took roughly 30 CPU days on an 8-processor IBM SP-1 and also exhibited near-linear speedup. misc07 mod011 p0548 p6000 set1al stein45 1 2 3 4 5 6 7 8 0 2 4 6 8 10 12 14 Figure 11 Results ...

18 Distributed Shared Memory Using Lazy Release Consistency - Keleher - 1994 (Show Context)

Citation Context

...n hardware may lead to poor performance because of discrepancies between the page size of the machine and the granularity of sharing in the application. The system discussed in this paper, TreadMarks =-=[5]-=-, provides shared memory as a linear array of bytes. The memory model is a relaxed memory model, namely release consistency. The implementation uses the virtual memory hardware to detect accesses, but...

17 Integrating Parallelization Strategies for Linkage Analysis - Gupta, Schiffer, et al. - 1995 (Show Context)

Citation Context

...ilities an approximate location of the gene on the chromosome can be computed. ILINK is a parallelized version of a widely used genetic linkage analysis program, which is part of the FASTLINK package =-=[4]-=-. ILINK takes as input a family tree, called a pedigree, augmented with some genetic information about the members of the family. It computes a maximum-likelihood estimate of `, the recombination prob...

15 How to make amultiprocessor computer that correctly executes multiprocess programs - Lamport - 1979 (Show Context)

Citation Context

...s not well de ned in a distributed system. A more precise notion is sequential consistency, whereby the memory appears to all processes as if they were executing on a single multiprogrammed processor =-=[7]-=-. With sequential consistency, the notion of \the last value written" is precisely de ned. The simplicity of this model may, however, exact a high price in terms of performance, and therefore much res...

10 A uni ed formalization of four sharedmemory models - Adve, Hill - 1993 (Show Context)

Citation Context

...on operations can be mapped into acquires and releases as well. A partial order, happened-before-1 , denoted hb1 ! can be de ned on releases, acquires, and shared memory accesses in the following way =-=[1]-=-: If a1 and a2 are ordinary shared memory accesses, releases, or acquires on the same processor, hb1 and a1 occurs before a2 in program order, then a1 ! a2. If a1 is a release on processor p1, and a2 ...

4 Munin: Efficient Distributed Shared Memory Using Multi-Protocol Release Consistency - Carter - 1993 (Show Context)

Citation Context

... to implement release consistency. Roughly speaking, lazy release consistency enforces consistency at the time of an acquire, in contrast to the earlier implementation of release consistency in Munin =-=[4]-=-, sometimes referred to as eager release consistency, which enforced consistency at the time of a release. Figure 7 shows the intuitive argument behind lazy release consistency. Assume that x is repli...

3 Munin: E cient Distributed Shared Memory Using Multi-Protocol Release Consistency - Carter - 1993 (Show Context)

Citation Context

... to implement release consistency. Roughly speaking, lazy release consistency enforces consistency at the time of an acquire, in contrast to the earlier implementation of release consistency in Munin =-=[4]-=-, sometimes referred to as eager release consistency, which enforced consistency at the time of a release. Figure 7 shows the intuitive argument behind lazy release consistency. Assume that x is repli...

1 Munin: Ecient Distributed Shared Memory Using Multi-Protocol Release Consistency - Carter - 1993 (Show Context)

Citation Context

... to implement release consistency. Roughly speaking, lazy release consistency enforces consistency at the time of an acquire, in contrast to the earlier implementation of release consistency in Munin =-=[4]-=-, sometimes referred to as eager release consistency, which enforced consistency at the time of a release. Figure 7 shows the intuitive argument behind lazy release consistency. Assume that x is repli...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University