See this document in CiteSeerX!

Nonintrusive Failure Detection and Recovery for Internet Services Using Backdoors  (Make Corrections)  
Florin Sultan, Aniruddha Bohra, Yufei Pan, Stephen Smaldone, Iulian Neamtiu, Pascal Gallard, Liviu Iftode



  Home/Search   Context   Related

 
View or download:
rutgers.edu/pub/techn...dcstr524.ps.Z
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  rutgers.edu/pub/technicalrepo... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We describe an architecture for nonintrusive failure detection and recovery in a cluster of Internet servers in which nodes mutually monitor their liveness and recover client sessions from failed nodes. The system is based on Backdoors, a novel architectural approach for remote healing of computer systems. Backdoors enables monitoring and recovery/repair of state in a computer system by remote access to system resources (memory, I/O devices) without using its processors. Backdoors allows remote ... (Update)

Similar documents based on text:   More   All
1.1:   Nonintrusive Remote Healing Using Backdoors - Florin Sultan Aniruddha (2003)   (Correct)
0.6:   Using Remote Memory Communication for Self-Healing Systems - Sultan, Bohra, Neamtiu..   (Correct)
0.4:   Dynamic Streams for Efficient Communications between.. - Gallard, Morin   (Correct)

BibTeX entry:   (Update)

@misc{ sultan-nonintrusive,
  author = "Florin Sultan and Aniruddha Bohra and Yufei Pan and Stephen Smaldone and
    Iulian Neamtiu and Pascal Gallard and Liviu Iftode",
  title = "Nonintrusive Failure Detection and Recovery for Internet Services Using
    Backdoors",
  url = "citeseer.ist.psu.edu/633879.html" }
Citations (may not include all citations):
414   Unreliable Failure Detectors for Reliable Distributed System.. - Chandra, Toueg - 1996
180   A Survey of Rollback-Recovery Protocols in Message-Passing S.. - Elnozahy, Alvisi et al. - 2002
175   Dealing with Disaster: Surviving Misbehaved Kernel Extension.. - Seltzer, Endo et al. - 1996
87   Flash: An Efficient and Portable Web Server - Pai, Druschel et al. - 1999
86   The Virtual Interface Architecture (context) - Dunning - 1998
77   Net: A User-Level Network Interface for Parallel and Distrib.. (context) - Basu, Buch et al. - 1995
68   A NonStop Kernel (context) - Bartlett - 1981
44   Early experience with message-passing on the shrimp multicom.. - Felten, Alpert et al. - 1996
35   Fail-Awareness in Timed Asynchronous Systems - Fetzer, Cristian - 1996
28   FineGrained Failover Using Connection Migration - Snoeren, Andersen et al. - 2001
26   The Recovery Box: Using Fast Recovery to Provide High Availa.. - Baker, Sullivan - 1992
26   Wrapping Server-Side TCP to Mask Connection Failures - Alvisi, Bressoud et al. - 2001
25   Self-Monitoring and Self-Adapting Operating Systems - Seltzer, Small - 1997
18   Why Do Internet Services Fail (context) - Oppenheimer, Ganapathi et al. - 2003
18   On Scalable and Efficient Distributed Failure Detectors - Gupta, Chandra et al. - 2001

[Article contains additional citations not shown here]

Documents on the same site (http://www.cs.rutgers.edu/pub/technical-reports/):   More
Constrained REDO: An Alternative to REPLAY - Liew, Steinberg (1993)   (Correct)
Jambalaya: Using Multicast for Blind Distributed Web Searching .. - Navas, Hirsh (1998)   (Correct)
Law-Governed Regularities in Software Systems - Minsky (1994)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC