Alternate document:   Details   Efficient Multiprecision Floating Point Multiplication with Optimal Directional Rounding (93) Werner Krandick, Jeremy R. Johnson

See this document in CiteSeerX!

Efficient Transparent Optimistic Rollback Recovery for Distributed Application Programs (1993)  (Make Corrections)  (38 citations)
David Johnson
Symposium on Reliable Distributed Systems



  Home/Search   Context   Related

 
View or download:
arirang.snu.ac.kr/~woojeong...srds93.ps
cmu.edu/~dbj/ftp/srds93.ps
cmu.edu/usr/anon/199...CMUCS93127.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  arirang.snu.ac.kr/~woojeong/ (more)
From:  cmu.edu/~dbj/ft
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Existing rollback-recovery methods using consistent checkpointing may cause high overhead for applications that frequently send output to the "outside world," since a new consistent checkpoint must be written before the output can be committed, whereas existing methods using optimistic message logging may cause large delays in committing output, since processes may buffer received messages arbitrarily long before logging and may also delay propagating knowledge of their logging or checkpointing ... (Update)

Context of citations to this paper:   More

...messages to other processes. The process then attempts to receive another message, and blocks until one is available. nondeterminism [9, 14] by treating each nondeterministic influence as a message, logging it and replaying it during recovery. The message logging approach...

...to a state that satisfies the no orphan consistency condition. Checkpoints can be independent [SY85,KT87,SW89,JZ87,SBY88,JV87,JZ90, Joh93,WF92,EZ92,VJ94] or coordinated [EZ94] The storage abstraction implemented by the logging component is called a log. To emphasize the...

Cited by:   More
Understanding The Message Logging Paradigm For Masking Process.. - Alvisi (1996)   (Correct)
Completely Asynchronous Optimistic Recovery with Minimal.. - Smith, Johnson, Tygar (1995)   (Correct)
Minimizing Timestamp Size for Completely Asynchronous.. - Smith, Johnson (1996)   (Correct)

Active bibliography (related documents):   More   All
0.6:   A Survey of Rollback-Recovery Protocols in Message-Passing.. - Elnozahy, Johnson, Wang (1996)   (Correct)
0.6:   On the Use and Implementation of Message Logging - Elnozahy (1994)   (Correct)
0.5:   Replay and Distributed Breakpoints in an OSF DCE Environment - Yong (1995)   (Correct)

Similar documents based on text:   More   All
0.1:   Transparent Recovery in Distributed Systems - Bacon (1990)   (Correct)
0.0:   Performance Optimization of Throttled Time-Warp Simulation - Tay, Teo   (Correct)
0.0:   On the "No-Z-Cycle" Property in Distributed Executions - Quaglia, Baldoni, Ciciani   (Correct)

Related documents from co-citation:   More   All
25:   Recovery in distributed systems using optimistic message logging and checkpointi.. - Johnson, Zwaenepoel - 1988
25:   Optimistic Recovery in Distributed Systems (context) - Strom, Yemini - 1985
25:   Manetho: Transparent Rollback-Recovery with Low Overhead, Limited Rollback and F.. - Elnozahy, Zwaenepoel - 1992

BibTeX entry:   (Update)

D.B. Johnson. "Efficient Transparent Optimistic Rollback Recovery for Distributed Application Programs." 13th Symposium on Reliable Distributed Systems. IEEE, October 1993. http://citeseer.ist.psu.edu/johnson93efficient.html   More

@inproceedings{ johnson93efficient,
    author = "David B. Johnson",
    title = "Efficient Transparent Optimistic Rollback Recovery for Distributed Application Programs",
    booktitle = "Symposium on Reliable Distributed Systems",
    pages = "86-95",
    year = "1993",
    url = "citeseer.ist.psu.edu/johnson93efficient.html" }
Citations (may not include all citations):
572   Distributed snapshots: Determining global states of distribu.. (context) - Chandy, Lamport - 1985
361   Reliable communication in the presence of failures (context) - Birman, Joseph - 1987
217   Optimistic recovery in distributed systems (context) - Strom, Yemini - 1985
184   Checkpointing and rollback-recovery for distributed systems (context) - Koo, Toueg - 1987
177   Fail-stop processors: An approach to designing fault-toleran.. - Schlichting, Schneider - 1983
156   Recovery in distributed systems using optimistic message log.. - Johnson, Zwaenepoel
133   Manetho: Transparent rollback-recovery with low overhead - Elnozahy, Zwaenepoel - 1992
125   ACM Transactions on Computer Systems (context) - Chang, Maxemchuck et al. - 1984
120   The performance of consistent checkpointing - Elnozahy, Johnson et al. - 1992
109   Sender-based message logging - Johnson, Zwaenepoel - 1987
98   A message system supporting fault tolerance (context) - Borg, Baumbach et al. - 1983
95   An Ethernet address resolution protocol (context) - Plummer - 1982
83   Efficient distributed recovery using message logging (context) - Sistla, Welch - 1989
74   Publishing: A reliable broadcast communication mechanism (context) - Powell, Presotto - 1983
68   ACM Transactions on Computer Systems (context) - Borg, Blau et al. - 1989
60   Independent checkpointing and concurrent rollback for recove.. (context) - Bhargava, Lian - 1988
47   xAMp: a multi-primitive group communications service - Rodrigues, Verissimo - 1992
46   fault-tolerant distributed systems (context) - Strom, Bacon et al. - 1988
46   A distributed domino-effect free recovery algorithm (context) - Briatico, Ciuffoletti et al. - 1984
44   IEEE Transactions on Parallel and Distributed Systems (context) - Melliar-Smith, Moser et al.
44   A timestamp-based checkpointing protocol for long-lived dist.. (context) - Cristian, Jahanian - 1991
40   Distributed System Fault Tolerance Using Message Logging and.. - Johnson - 1989
39   Information Processing Letters (context) - Lai, Yang et al. - 1987
38   Crash recovery with little overhead (context) - Juang, Venkatesan - 1991
36   Checkpointing multicomputer applications (context) - Li, Naughton et al. - 1991
34   Efficient distributed snapshots (context) - Spezialetti, Kearns - 1986
32   Transparent fault-tolerance in parallel Orca programs - Kaashoek, Michiels et al. - 1992
32   Optimistic message logging for independent checkpointing in .. - Wang, Fuchs - 1992
31   Optimal checkpointing and local recording for domino-free ro.. (context) - Venkatesh, Radhakrishnan et al. - 1987
28   Concurrent robust checkpointing and recovery in distributed .. (context) - Leu, Bhargava - 1988
21   concurrent checkpoint for parallel programs (context) - Li, Naughton et al. - 1990
12   Checkpointing and rollback recovery in a distributed system .. (context) - Ramanathan, Shin - 1988
12   Optimistic failure recovery for very large networks (context) - Lowry, Russell et al. - 1991
8   A non-intrusive checkpointing protocol (context) - Israel, Morris - 1989
5   A low overhead checkpointing and rollback recovery scheme fo.. (context) - Tong, Kain et al. - 1989
2   Global checkpointing for distributed programs (context) - Moura, Silva et al. - 1992



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://arirang.snu.ac.kr/~woojeong/):   More
Recovery in Distributed Systems Using Optimistic Message.. - Johnson, Zwaenepoel (1988)   (Correct)
Lazy Checkpoint Coordination for Bounding Rollback Propagation - Wang, Fuchs (1993)   (Correct)
Tight Upper Bound on Useful Distributed System Checkpoints - Yi-Min Wang (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC