(Enter summary)
Abstract: : This report describes an experiment in the design of a general purpose
fault tolerant system, FTM. The main objective of the FTM design was to implement
a "low-cost" fault tolerant system that could be used on standard workstations.
At the operating system level, our goal was to provide a methodology for the design
of modular reliable operating systems, while offering fault tolerance transparency to
user applications. In other words, porting an application to FTM had only to require
compiling ... (Update)
Cited by: More
Recovering Device Drivers - Michael Swift Muthukaruppan (2004)
(Correct)
A Survey of Rollback-Recovery Protocols in.. - Elnozahy, Alvisi.. (1996)
(Correct)
Similar documents (at the sentence level):
5.8%: Performance of Consistent Checkpointing in a Modular.. - Muller, Hue, Peyrouze (1994)
(Correct)
Active bibliography (related documents): More All
2.1: Implementing Dynamic Atomic Actions Using Reliable Servers - Hue, Muller, Peyrouze.. (1993)
(Correct)
0.9: Fault Tolerance using Stable Memory - Coghlan, (eds.) (1999)
(Correct)
0.8: Experience with Building Distributed Systems on top of the Mach.. - Muller (1995)
(Correct)
Similar documents based on text: More All
0.2: FT-NFS: an Efficient Fault Tolerant NFS Server Designed for.. - Peyrouze, Muller (1996)
(Correct)
0.2: Projet CISé - Franchi-Zannettacci, Rueher
(Correct)
0.2: Causal Multicasts in Overlapping Groups: Towards a Low Cost.. - Achour Mostefaoui (1993)
(Correct)
Related documents from co-citation: More All
2: Fault tolerant matrix operations for networks of workstations using multiple che..
- Kim, Plank et al. - 1997
2: Application Transparent Fault Management in Fault Tolerant Mach (context) - Russinovich, Segall et al. - 1992
2: Efficient synchronous checkpointing in distributed systems (context) - Cao - 1992
BibTeX entry: (Update)
G. Muller, M. Banâtre , N. Peyrouz and B. Rochat. "Lessons from FTM: an experiment in design and implementation of a low-cost fault-tolerant system." In IEEE Transactions on Reliability, 45(2):332---340, Jun. 1996. http://citeseer.ist.psu.edu/muller95lessons.html More
@article{ muller96lessons,
author = "Gilles Muller and Michel Banatre and Nadine Peyrouze and Bruno Rochat",
title = "Lessons from {FTM}: An Experiment in Design and Implementation of a Low-Cost Fault-Tolerant System",
journal = "IEEE Transactions on Reliability",
volume = "45",
number = "2",
pages = "332-339",
year = "1996",
url = "citeseer.ist.psu.edu/muller95lessons.html" }
Citations (may not include all citations):
572
Distributed snapshots : Determining global states of distrib.. (context) - Chandy, Lamport - 1985
496
Splash : Stanford parallel applications for shared-memory (context) - Singh, Weber et al. - 1991
444
Mach: A new kernel foundation for Unix development (context) - Accetta, Baron et al. - 1986
217
Optimistic recovery in distributed systems (context) - Strom, Yemini - 1985
184
Checkpointing and rollback recovery for distributed systems (context) - Koo, Toueg - 1986
158
The Chorus distributed operating system (context) - Rozier, Abrossimov et al. - 1988
133
Manetho: Transparent rollback-recovery with low overhead
- Elnozahy, Zwaenepoel - 1992
120
The performance of consistent checkpointing
- Elnozahy, Johnson et al. - 1992
92
Designing an extensible distributed language with meta-level..
- Masuda, Chiba - 1993
91
Atomic transactions (context) - Lampson - 1981
76
Camelot and Avalon: A Distributed Transaction Facility (context) - Eppinger, Mummert et al. - 1991
68
ACM Transactions on Computer Systems (context) - Borg, Blau et al. - 1989
60
Independent checkpointing and concurrent rollback for recove.. (context) - Bhargava, Lian - 1988
44
A timestamp-based checkpointing protocol for long-lived dist.. (context) - Cristian, Jahanian - 1991
43
Recovery Management in QuickSilver (context) - Haskin, Malachi et al. - 1988
41
Implementation of Argus (context) - Liskov, Curtis et al. - 1987
38
Crash recovery with little overhead (context) - TY, Venkatesan - 1991
36
Checkpointing multicomputer applications (context) - Li, Naughton et al. - 1991
34
Global checkpointing for distributed programs (context) - Silva, Silva - 1992
32
Error recovery in multicomputers using global checkpoints (context) - Tamir, Sequin - 1984
29
Object Replication in a Distributed System (context) - Little - 1991
28
Concurrent robust checkpointing and recovery in distributed .. (context) - Leu, Bhargava - 1988
22
Transparent recovery of mach applications
- Goldberg, Gopal et al. - 1990
20
Experience with transactions in quicksilver
- Schmuck, Wyllie - 1991
16
Synchronization and Control of Distributed Systems and Progr.. (context) - Raynal, Helary - 1990
16
volume 60 of Lecture Notes in Computer Science (context) - Gray, Database - 1978
14
Recovery management in the RelaX distributed transaction lay.. (context) - Schumann, Kroger et al. - 1989
14
Dependable computing and faulttolerant systems (context) - Lee, Anderson - 1990
13
Checkpointing and rollback-recovery in distributed object ba.. (context) - Lin, Ahamad - 1990
12
Redundancy in data structures: Improving software fault tole.. (context) - Taylor, Morgan et al. - 1980
12
Department of Computer Science (context) - Nelson, Call - 1981
12
Data cache and storage control units (context) - Hardell, Hicks et al. - 1990
11
Using checkpoints to localize the effects of faults in distr.. (context) - Ahamad, Lin - 1989
8
Performance of consistent checkpointing in a modular operati..
- Muller, Hue et al. - 1994
8
State restoration in distributed systems
- Merlin, Randell - 1978
8
Exploiting type inheritance facilities to implement recovera.. (context) - Dixon, Shrivastava - 1987
7
Design decisions for the FTM: A general purpose fault tolera.. (context) - Banatre, Muller et al. - 1991
6
IEEE Computer (context) - Mullender, Van Rossum et al. - 1990
5
How to design reliable servers using fault tolerant micro-ke.. (context) - Banatre, Heng et al. - 1991
4
la construction de services fiables dans les systèmes distri.. (context) - Rochat - 1992
4
a distributed electronic marketing system (context) - Banatre, Banatre et al. - 1986
4
Ensuring data security and integrity with a fast stable stor.. (context) - Banatre, Banatre et al. - 1988
4
An experience in the design of a reliable object based syste.. (context) - Banatre, Heng et al. - 1993
3
A model for concurrent checkpointing and recovery using tran.. (context) - Leu, Bhargava - 1989
2
A stable transactional memory for building robust object ori.. (context) - Muller, Rochat et al. - 1991
2
Fault tolerance: Why should I pay for it (context) - Gleeson - 1993
1
OSF MACH 3 Kernel Final Draft Kernel Interfaces (context) - Loepere - 1993
1
A decentralised recovery control protocol (context) - Wood - 1981
1
alisation d'un lien série haut débit (context) - Muller, Prunault et al. - 1993
Documents on the same site (http://fermivista.math.jussieu.fr/ftp/ftp.irisa.fr.html): More
Table 51: Topology error tests - Test Path
(Correct)
Solving the Consensus Problem in a Mobile Environment - Badache, Hurfin, Macedo (1997)
(Correct)
A New Method For The Generation Of Strong Prime Numbers - Saouter (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC