See this document in CiteSeerX!

Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing (1988)  (Make Corrections)  (156 citations)
David B. Johnson, Willy Zwaenepoel
Proc.\ 7th Annual ACM Symp.\ on Principles of Distributed Computing



  Home/Search   Context   Related

 
View or download:
rice.edu/~willy/papers/podc88.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  rice.edu/~willy/publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: In a distributed system using message logging and checkpointing to provide fault tolerance, there is always a unique maximum recoverable system state, regardless of the message logging protocol used. The proof of this relies on the observation that the set of system states that have occurred during any single execution of a system forms a lattice, with the sets of consistent and recoverable system states as sublattices. The maximum recoverable system state never decreases, and if all messages... (Update)

Cited by:   More
Consistent Main-memory Database Federations under Deferred.. - Schmidt, Pedone (2005)   (Correct)
Libra: A Library for Reliable Distributed Applications - Jinsong Ouyang And   (Correct)
Dependable High Performance Computing on a Parallel.. - Blochinger, Bündgen, al. (2000)   (Correct)

Similar documents (at the sentence level):
7.8%:   Recovery in Distributed Systems Using Optimistic Message.. - Johnson, Zwaenepoel (1988)   (Correct)

Active bibliography (related documents):   More   All
0.0:   Distributed System Fault Tolerance Using Message Logging and.. - Johnson (1989)   (Correct)
0.0:   A Survey of Rollback-Recovery Protocols in Message-Passing.. - Elnozahy, Johnson, Wang (1996)   (Correct)
0.0:   Methods and Models for Management of Distributed and Persistent.. - Feeley (1995)   (Correct)

Similar documents based on text:   More   All
0.3:   A Simple Algorithm for Finding the Maximum Recoverable .. - Johnson, Keleher.. (1990)   (Correct)
0.3:   Transparent Optimistic Rollback Recovery - Johnson, Zwaenepoel (1991)   (Correct)
0.2:   Using Message Semantics to Reduce Rollback in Optimistic.. - Leong, Agrawal (1994)   (Correct)

Related documents from co-citation:   More   All
75:   Optimistic Recovery in Distributed Systems (context) - Strom, Yemini - 1985
52:   Distributed snapshots: Determining global states of distributed systems (context) - Chandy, Lamport - 1985
40:   Efficient Distributed Recovery Using Message Logging (context) - Sistla, Welch - 1989

BibTeX entry:   (Update)

David B. Johnson and Willy Zwaenepoel. Recovery in distributed systems using optimistic message logging and checkpointing. In Proceedings of the Seventh Annual ACM Symposium on Principles of Distributed Computing, pages 171-- 181. ACM, August 1988. May 1988. http://citeseer.ist.psu.edu/johnson88recovery.html   More

@inproceedings{ johnson88recovery,
    author = "David B. Johnson and Willy Zwa{}enepo{}el",
    title = "Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing",
    booktitle = "Proc.\ 7th Annual {ACM} Symp.\ on Principles of Distributed Computing",
    address = "Toronto ({Canada})",
    pages = "171--181",
    year = "1988",
    url = "citeseer.ist.psu.edu/johnson88recovery.html" }
Citations (may not include all citations):
917   and the ordering of events in a distributed system (context) - Lamport, clocks - 1978
572   Distributed snapshots: Determining global states of distribu.. (context) - Chandy, Lamport - 1985  DBLP
293   System structure for software fault tolerance (context) - Randell - 1975  ACM   DBLP
217   Optimistic recovery in distributed systems (context) - Strom, Yemini - 1985  ACM   DBLP
109   Sender-based message logging - Johnson, Zwaenepoel - 1987
98   A message system supporting fault tolerance (context) - Borg, Baumbach et al. - 1983  ACM   DBLP
74   Publishing: A reliable broadcast communication mechanism (context) - Powell, Presotto - 1983  ACM   DBLP
58   Crash recovery in a distributed data storage system - Lampson, Sturgis - 1979
45   State restoration in systems of communicating processes (context) - Russell - 1980  DBLP



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.rice.edu/~willy/publications.html):   More
Causal Distributed Breakpoints - Fowler, Zwaenepoel (1990)   (Correct)
Transparent Adaptive Parallelism on NOWs using OpenMP - Scherer, Lu, Gross, Zwaenepoel (1999)   (Correct)
Locality-Aware Request Distribution in Cluster-based Network Servers - Pai (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC