MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  A model for adaptive fault-tolerant systems (1994) [5 citations — 3 self]

Download:
pdf | ps
by Matti A. Hiltunen, Richard D. Schlichting
DependableComputing---EDCC-1 (Proceedings of the 1st European Dependable Computing Conference), Lecture Notes in Computer Science
ftp://ftp.cs.arizona.edu/people/hiltunen/edcc1.ps.Z
Add To MetaCart

Abstract:

Abstract. An adaptive computing system is one that modifies its behavior based on changes in the environment. Since one common type of environment change in a distributed system is network or processor failure, fault-tolerant distributed systems can be viewed as an important subclass of adaptive systems. As such, use of adaptive methods for dealing with failures in this context has the same potential advantages of improved efficiency and structural simplicity as for adaptive systems in general. This paper describes a model for adaptive systems that can be applied in many failure scenarios arising in distributed systems. This model divides the adaptation process into three different phases---change detection, agreement, and action---that can be used as a common means for describing various fault-tolerance algorithms such as reliable transmission and membership protocols. This serves not only to clarify the logical structure and relationship of such algorithms, but also to provide a unifying implementation framework. Several adaptive fault-tolerant protocols are given as examples. A technique for implementing the model in a distributed system using an event-driven approach for composing protocols in parallel is also presented. 1

Citations

219 Preserving and using context information in interprocess communication – Peterson, Buchholz, et al. - 1989
209 Atomic Broadcast: From Simple Message Diffusion To Byzantine Agreement – Cristian, Aghili, et al. - 1985
168 Using process groups to implement failure detection in asynchronous environme – Ricciardi, Birman - 1991
148 An efficient reliable broadcast protocol. Operating Systems Review – Hummel, Bal - 1989
128 Reaching agreement on processor-group membership in synchronous distributed systems – Cristian - 1991
126 Consul: A communication substrate for fault-tolerant distributed programs – Mishra, Peterson, et al. - 1993
90 Failure mode assumptions and assumption coverage – Powell - 1992
80 Dynamic Adaptation of Real-Time Software – Bihari, Schwan - 1991
76 A membership protocol based on partial order – Mishra, Peterson, et al. - 1991
64 ADAPTIVE: A Dynamically Assembled Protocol Transformation, Integration, and eValuation Environment – Schmidt, Box, et al. - 1993
48 Fault-tolerant membership service in a synchronous distributed real-time system – Kopetz, Grnsteidl, et al. - 1991
30 An approach to constructing modular fault-tolerant protocols – Hiltunen, Schlichting - 1993
18 Inconsistency and contamination – GOPAL, TOUEG - 1991
12 Adaptive Fault Tolerance – Goldberg, Greenberg, et al. - 1993
9 Adaptive, reliable software for distributed and parallel real-time systems – Schwan, Bihari, et al. - 1987
1 Adaptability Experiments – Bhargava, Friesen, et al.