See this document in CiteSeerX!

Lessons from FTM: an Experiment in the Design and Implementation of a Low Cost Fault Tolerant System (1995)  (Make Corrections)  (2 citations)
Gilles Muller, et al.
IEEE Transactions on Reliability



  Home/Search   Context   Related

 
View or download:
irisa.fr/techreports/199...PI913.ps.gz
inria.fr/INRIA/publicat...RR2517.ps.gz
cornell.edu/Courses/cs717/2...PI913.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  fermivista.math.ju...ftp.irisa.fr (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: : This report describes an experiment in the design of a general purpose fault tolerant system, FTM. The main objective of the FTM design was to implement a "low-cost" fault tolerant system that could be used on standard workstations. At the operating system level, our goal was to provide a methodology for the design of modular reliable operating systems, while offering fault tolerance transparency to user applications. In other words, porting an application to FTM had only to require compiling ... (Update)

Cited by:   More
Recovering Device Drivers - Michael Swift Muthukaruppan (2004)   (Correct)
A Survey of Rollback-Recovery Protocols in.. - Elnozahy, Alvisi.. (1996)   (Correct)

Similar documents (at the sentence level):
5.8%:   Performance of Consistent Checkpointing in a Modular.. - Muller, Hue, Peyrouze (1994)   (Correct)

Active bibliography (related documents):   More   All
2.1:   Implementing Dynamic Atomic Actions Using Reliable Servers - Hue, Muller, Peyrouze.. (1993)   (Correct)
0.9:   Fault Tolerance using Stable Memory - Coghlan, (eds.) (1999)   (Correct)
0.8:   Experience with Building Distributed Systems on top of the Mach.. - Muller (1995)   (Correct)

Similar documents based on text:   More   All
0.2:   FT-NFS: an Efficient Fault Tolerant NFS Server Designed for.. - Peyrouze, Muller (1996)   (Correct)
0.2:   Projet CISé - Franchi-Zannettacci, Rueher   (Correct)
0.2:   Causal Multicasts in Overlapping Groups: Towards a Low Cost.. - Achour Mostefaoui (1993)   (Correct)

Related documents from co-citation:   More   All
2:   Fault tolerant matrix operations for networks of workstations using multiple che.. - Kim, Plank et al. - 1997
2:   Application Transparent Fault Management in Fault Tolerant Mach (context) - Russinovich, Segall et al. - 1992
2:   Efficient synchronous checkpointing in distributed systems (context) - Cao - 1992

BibTeX entry:   (Update)

G. Muller, M. Banâtre , N. Peyrouz and B. Rochat. "Lessons from FTM: an experiment in design and implementation of a low-cost fault-tolerant system." In IEEE Transactions on Reliability, 45(2):332---340, Jun. 1996. http://citeseer.ist.psu.edu/muller95lessons.html   More

@article{ muller96lessons,
    author = "Gilles Muller and Michel Banatre and Nadine Peyrouze and Bruno Rochat",
    title = "Lessons from {FTM}: An Experiment in Design and Implementation of a Low-Cost Fault-Tolerant System",
    journal = "IEEE Transactions on Reliability",
    volume = "45",
    number = "2",
    pages = "332-339",
    year = "1996",
    url = "citeseer.ist.psu.edu/muller95lessons.html" }
Citations (may not include all citations):
572   Distributed snapshots : Determining global states of distrib.. (context) - Chandy, Lamport - 1985
496   Splash : Stanford parallel applications for shared-memory (context) - Singh, Weber et al. - 1991
444   Mach: A new kernel foundation for Unix development (context) - Accetta, Baron et al. - 1986
217   Optimistic recovery in distributed systems (context) - Strom, Yemini - 1985
184   Checkpointing and rollback recovery for distributed systems (context) - Koo, Toueg - 1986
158   The Chorus distributed operating system (context) - Rozier, Abrossimov et al. - 1988
133   Manetho: Transparent rollback-recovery with low overhead - Elnozahy, Zwaenepoel - 1992
120   The performance of consistent checkpointing - Elnozahy, Johnson et al. - 1992
92   Designing an extensible distributed language with meta-level.. - Masuda, Chiba - 1993
91   Atomic transactions (context) - Lampson - 1981
76   Camelot and Avalon: A Distributed Transaction Facility (context) - Eppinger, Mummert et al. - 1991
68   ACM Transactions on Computer Systems (context) - Borg, Blau et al. - 1989
60   Independent checkpointing and concurrent rollback for recove.. (context) - Bhargava, Lian - 1988
44   A timestamp-based checkpointing protocol for long-lived dist.. (context) - Cristian, Jahanian - 1991
43   Recovery Management in QuickSilver (context) - Haskin, Malachi et al. - 1988
41   Implementation of Argus (context) - Liskov, Curtis et al. - 1987
38   Crash recovery with little overhead (context) - TY, Venkatesan - 1991
36   Checkpointing multicomputer applications (context) - Li, Naughton et al. - 1991
34   Global checkpointing for distributed programs (context) - Silva, Silva - 1992
32   Error recovery in multicomputers using global checkpoints (context) - Tamir, Sequin - 1984
29   Object Replication in a Distributed System (context) - Little - 1991
28   Concurrent robust checkpointing and recovery in distributed .. (context) - Leu, Bhargava - 1988
22   Transparent recovery of mach applications - Goldberg, Gopal et al. - 1990
20   Experience with transactions in quicksilver - Schmuck, Wyllie - 1991
16   Synchronization and Control of Distributed Systems and Progr.. (context) - Raynal, Helary - 1990
16   volume 60 of Lecture Notes in Computer Science (context) - Gray, Database - 1978
14   Recovery management in the RelaX distributed transaction lay.. (context) - Schumann, Kroger et al. - 1989
14   Dependable computing and faulttolerant systems (context) - Lee, Anderson - 1990
13   Checkpointing and rollback-recovery in distributed object ba.. (context) - Lin, Ahamad - 1990
12   Redundancy in data structures: Improving software fault tole.. (context) - Taylor, Morgan et al. - 1980
12   Department of Computer Science (context) - Nelson, Call - 1981
12   Data cache and storage control units (context) - Hardell, Hicks et al. - 1990
11   Using checkpoints to localize the effects of faults in distr.. (context) - Ahamad, Lin - 1989
8   Performance of consistent checkpointing in a modular operati.. - Muller, Hue et al. - 1994
8   State restoration in distributed systems - Merlin, Randell - 1978
8   Exploiting type inheritance facilities to implement recovera.. (context) - Dixon, Shrivastava - 1987
7   Design decisions for the FTM: A general purpose fault tolera.. (context) - Banatre, Muller et al. - 1991
6   IEEE Computer (context) - Mullender, Van Rossum et al. - 1990
5   How to design reliable servers using fault tolerant micro-ke.. (context) - Banatre, Heng et al. - 1991
4   la construction de services fiables dans les systèmes distri.. (context) - Rochat - 1992
4   a distributed electronic marketing system (context) - Banatre, Banatre et al. - 1986
4   Ensuring data security and integrity with a fast stable stor.. (context) - Banatre, Banatre et al. - 1988
4   An experience in the design of a reliable object based syste.. (context) - Banatre, Heng et al. - 1993
3   A model for concurrent checkpointing and recovery using tran.. (context) - Leu, Bhargava - 1989
2   A stable transactional memory for building robust object ori.. (context) - Muller, Rochat et al. - 1991
2   Fault tolerance: Why should I pay for it (context) - Gleeson - 1993
1   OSF MACH 3 Kernel Final Draft Kernel Interfaces (context) - Loepere - 1993
1   A decentralised recovery control protocol (context) - Wood - 1981
1   alisation d'un lien série haut débit (context) - Muller, Prunault et al. - 1993

Documents on the same site (http://fermivista.math.jussieu.fr/ftp/ftp.irisa.fr.html):   More
Table 51: Topology error tests - Test Path   (Correct)
Solving the Consensus Problem in a Mobile Environment - Badache, Hurfin, Macedo (1997)   (Correct)
A New Method For The Generation Of Strong Prime Numbers - Saouter (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC