Download:
|
by Taesoon Park, Heon Y. Yeom
In Proc. the 20th International Conference on Distributed Computing Systems
http://arirang.snu.ac.kr/~yeom/paper/icdcs00.ps
Add To MetaCart
Abstract:
This paper presents an asynchronous recovery scheme to provide fault-tolerance for mobile computing systems. The proposed scheme is based on optimistic message logging, since the checkpointing-only schemes are not suitable for the mobile environment in which unreliable mobile hosts and fragile network connection may hinder any kind of coordination for checkpointing and recovery. Also, in order to reduce the overhead imposed on mobile hosts, mobile support stations take charge of logging and dependency tracking, and mobile hosts maintain only a small amount of information for mobility tracking. As a result, truly asynchronous recovery for mobile systems can be achieved with the little overhead.
Citations
|
829
|
Distributed snapshots: Determining global states of distributed systems
– Chandy, Lamport
- 1985
|
|
438
|
System Structure for Software Fault Tolerance
– Randell
- 1975
|
|
248
|
Toueg, Checkpointing and rollback-recovery for distributed systems
– Koo
- 1987
|
|
169
|
Manetho: Transparent rollback-recovery with low overhead, limited rollback and fast output commit
– Elnozahy, Zwaenepoel
- 1992
|
|
96
|
Efficient distributed recovery using message logging
– Sistla, Welch
- 1989
|
|
85
|
Message logging: Pessimistic, optimistic, causal and optimal
– Alvisi, Marzullo
- 1998
|
|
72
|
Checkpointing Distributed Applications on Mobile Computers
– Acharya, Badrinath
- 1994
|
|
72
|
Independent Checkpointing and Concurrent Rollback Recovery for Distributed Systems— An Optimistic Approach
– Bhargava, Lian
- 1988
|
|
64
|
Structuring distributed algorithms for mobile hosts
– Badrinath, Acharya, et al.
- 1994
|
|
63
|
A distributed domino-effect free recovery algorithm
– Briatico, Ciuffoletti, et al.
- 1984
|
|
63
|
Nonblocking and Orphan-Free Message Logging Protocols
– Alvisi, Hoppe, et al.
- 1993
|
|
56
|
P.C.: Reliability Issues in Computing System Design
– Randell, Lee, et al.
- 1978
|
|
55
|
Lazy Checkpoint Coordination for Bounding Rollback
– Wang, Fuchs
- 1993
|
|
47
|
M.: “Low-Cost Checkpointing and Failure Recovery in Mobile Computing Systems
– Prakash, Singhal
- 1996
|
|
45
|
Recovery in mobile environments: Design and trade-off analysis
– Pradhan, Krishna, et al.
- 1996
|
|
44
|
Error recovery in multicomputers using global checkpoints
– Tamir, Sequin
- 1984
|
|
43
|
Adaptive Recovery for Mobile Environments
– Neves, Fuchs
- 1997
|
|
39
|
Optimal checkpointing and local recording for domino-free rollback recovery. Information Processing Letters
– Venkatesh, Radhakrishnan, et al.
- 1987
|
|
38
|
How to Recover Efficiently and Asynchronously when Optimism Fails
– Damani, Garg
- 1996
|
|
32
|
Completely Asynchronous Optimistic Recovery with Minimal Rollbacks
– Smith, Johnson, et al.
- 1995
|
|
20
|
Checkpointing with mutable checkpoints
– Cao, Singhal
- 2003
|
|
20
|
Distributed Recovery with K-Optimistic Logging
– Wang, Damani, et al.
- 1997
|
|
13
|
An Efficient Algorithm for Checkpointing Recovery in Distributed Systems
– Kim, Park
- 1993
|
|
9
|
Failure Recovery Based on Quasi-synchronous Checkpointing in Mobile Computing Systems
– Manivannan, Singhal
- 1996
|
|
3
|
Domino-effect free checkpointing recovery in distributed systems
– Park, Kim
- 1994
|