by Matti A. Hiltunen, Richard D. Schlichting
DependableComputing---EDCC-1 (Proceedings of the 1st European Dependable Computing Conference), Lecture Notes in Computer Science
ftp://ftp.cs.arizona.edu/people/hiltunen/edcc1.ps.Z
Add To MetaCart
Abstract:
Abstract. An adaptive computing system is one that modifies its behavior based on changes in the environment. Since one common type of environment change in a distributed system is network or processor failure, fault-tolerant distributed systems can be viewed as an important subclass of adaptive systems. As such, use of adaptive methods for dealing with failures in this context has the same potential advantages of improved efficiency and structural simplicity as for adaptive systems in general. This paper describes a model for adaptive systems that can be applied in many failure scenarios arising in distributed systems. This model divides the adaptation process into three different phases---change detection, agreement, and action---that can be used as a common means for describing various fault-tolerance algorithms such as reliable transmission and membership protocols. This serves not only to clarify the logical structure and relationship of such algorithms, but also to provide a unifying implementation framework. Several adaptive fault-tolerant protocols are given as examples. A technique for implementing the model in a distributed system using an event-driven approach for composing protocols in parallel is also presented. 1
Citations
|
219
|
Preserving and using context information in interprocess communication
– Peterson, Buchholz, et al.
- 1989
|
|
209
|
Atomic Broadcast: From Simple Message Diffusion To Byzantine Agreement
– Cristian, Aghili, et al.
- 1985
|
|
168
|
Using process groups to implement failure detection in asynchronous environme
– Ricciardi, Birman
- 1991
|
|
148
|
An efficient reliable broadcast protocol. Operating Systems Review
– Hummel, Bal
- 1989
|
|
128
|
Reaching agreement on processor-group membership in synchronous distributed systems
– Cristian
- 1991
|
|
126
|
Consul: A communication substrate for fault-tolerant distributed programs
– Mishra, Peterson, et al.
- 1993
|
|
90
|
Failure mode assumptions and assumption coverage
– Powell
- 1992
|
|
80
|
Dynamic Adaptation of Real-Time Software
– Bihari, Schwan
- 1991
|
|
76
|
A membership protocol based on partial order
– Mishra, Peterson, et al.
- 1991
|
|
64
|
ADAPTIVE: A Dynamically Assembled Protocol Transformation, Integration, and eValuation Environment
– Schmidt, Box, et al.
- 1993
|
|
48
|
Fault-tolerant membership service in a synchronous distributed real-time system
– Kopetz, Grnsteidl, et al.
- 1991
|
|
30
|
An approach to constructing modular fault-tolerant protocols
– Hiltunen, Schlichting
- 1993
|
|
18
|
Inconsistency and contamination
– GOPAL, TOUEG
- 1991
|
|
12
|
Adaptive Fault Tolerance
– Goldberg, Greenberg, et al.
- 1993
|
|
9
|
Adaptive, reliable software for distributed and parallel real-time systems
– Schwan, Bihari, et al.
- 1987
|
|
1
|
Adaptability Experiments
– Bhargava, Friesen, et al.
|