Download:
|
by M. C. Little, S. K. Shrivastava
http://arjuna.ncl.ac.uk/group/papers/p075.ps
Add To MetaCart
Abstract:
A widely used computational model for constructing fault-tolerant distributed applications employs atomic transactions for controlling operations on persistent objects. There has been considerable work on data replication techniques for increasing the availability of persistent data that is manipulated under the control of transactions. Process groups with ordered group communications has also emerged as a model for building available distributed applications. High service availability can be achieved by replicating the service state on multiple processes managed by a group communication infrastructure. These two models are often seen as rivals. This paper explores the role of these models in building fault-tolerant distributed applications. The paper develops a general model of distributed persistent objects and investigates how such objects can be replicated for availability using a system that supports only: (i) transactions; and (ii) process groups. A comparative evaluation shows how the two models can be used together to provide a more flexible approach to supporting high availability applications.
Citations
|
1073
|
Impossibility of distributed consensus with one faulty process
– Fischer, Lynch, et al.
- 1985
|
|
572
|
Implementing fault-tolerant services using the state machine approach: A tutorial
– Schneider
- 1990
|
|
278
|
Exploiting virtual synchrony in distributed systems
– Birman, Joseph
- 1987
|
|
225
|
The Transis approach to high availability cluster communication
– Dolev, Malki
- 1996
|
|
150
|
Newtop: a fault-tolerant group communication protocol
– Ezhilchelvan, Macedo, et al.
- 1995
|
|
142
|
Understanding the limitations of causally and totally ordered communication
– Cheriton, Skeen
- 1993
|
|
122
|
Software-based replication for fault tolerance
– Guerraoui, Schiper
- 1997
|
|
79
|
Replicated distributed programs
– COOPER
- 1985
|
|
67
|
Efficient message ordering in dynamic networks
– Keidar, Dolev
- 1996
|
|
63
|
From Group Communication to Transactions in Distributed Systems
– Schiper, Raynal
- 1996
|
|
61
|
Synchronous and Asynchronous Group Communication
– Cristian
- 1996
|
|
41
|
Object Replication in a Distributed System
– Little
- 1991
|
|
25
|
Replicated K-Resilient Objects in Arjuna
– Little, Shrivastava
- 1990
|
|
21
|
Maintaining information about persistent replicated objects in a distributed system
– Little, McCue, et al.
- 1993
|
|
11
|
et al, "Concurrency Control and Recovery in Database Systems
– Bernstein
- 1987
|
|
11
|
End to end arguments in system design
– Saltzer, Reed, et al.
- 1984
|
|
7
|
Melliar-Smith et al, "Totem: a Fault-tolerant multicast group communication system
– Moser, M
- 1996
|
|
6
|
Enhancing Replica Management Services to Tolerate Group Failures
– Ezhilchelvan, Shrivastava
- 1999
|
|
3
|
A suite of replication protocols based on group communication primitives
– Kemme, Alonso
- 1998
|