See this document in CiteSeerX!

Extensible, Scalable Monitoring for Clusters of Computers (1997)  (Make Corrections)  (4 citations)
Eric Anderson, Dave Patterson



  Home/Search   Context   Related

Links:   ACM   DBLP

 
View or download:
berkeley.edu/Sysadmin/es...lisa97esm.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  berkeley.edu/~eanders/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Introduction Monitoring a large cluster of cooperating computers requires extensibility, fault tolerance, and scalability. We handle the evolution of software and hardware in our cluster by using relational tables to make CARD extensible. We detect and recover from node and network failures by using timestamps to resynchronize out system. We improve data scalability by using a hierarchy of databases and a hybrid push/pull protocol for efficiently delivering data from sources to sinks. Finally, ... (Update)

Context of citations to this paper:   More

.... [16] DOGMA [8] and PARMON [5] The Node Status Reporter (NSR) 14] and the Cluster Administration using Relational Databases (CARD) [2] both provide a standard mechanism for cluster status access, NSR interface and SQL, respectively. But the fixed communication interface is...

Cited by:   More
Informed Data Distribution Selection in a - Self-Predicting Storage System   (Correct)
Informed Data Distribution Selection in a.. - Thereska.. (2006)   (Correct)
ClusterProbe: An Open, Flexible and Scalable Cluster.. - Liang, Sun, Wang (1999)   (Correct)

Similar documents (at the sentence level):
44.0%:   Researching System Administration - Anderson   (Correct)

Active bibliography (related documents):   More   All
5.1:   System Administration: Monitoring, Diagnosing, and Repairing - Anderson   (Correct)
0.5:   A System Diagnostic Console for Networks of Computers - Anderson, Goto, Patterson (1996)   (Correct)
0.5:   Code Generation for VSTL - Guangyi (1996)   (Correct)

Similar documents based on text:   More   All
0.1:   Netreg: An Automated Dhcp Registration System - Valian, Watson (1999)   (Correct)
0.1:   Monitoring Large Systems via Statistical Sampling - Mendes, Reed (2002)   (Correct)
0.0:   Highway Advisory Radio - Offices Of Research   (Correct)

Related documents from co-citation:   More   All
2:   The HP AutoRAID Hierarchical Storage System - Wilkes, Golding et al. - 1995
2:   Metadata efficiency in versioning file systems - Soules, Goodson et al. - 2003
2:   Erasure coding vs replication quantitative approach (context) - Kubiatowicz, replication et al. - 2002

BibTeX entry:   (Update)

Anderson and Patterson, "Extensible, Scalable Monitoring for Clusters of Computers," Proceedings of the 1997 USENIX LISA Conference, http://citeseer.ist.psu.edu/anderson97extensible.html   More

@misc{ patterson97extensible,
  author = "A. Patterson",
  title = "Extensible, Scalable Monitoring for Clusters of Computers",
  text = "Anderson and Patterson, Extensible, Scalable Monitoring for Clusters of
    Computers, Proceedings of the 1997 USENIX LISA Conference,",
  year = "1997",
  url = "citeseer.ist.psu.edu/anderson97extensible.html" }
Citations (may not include all citations):
12   Management Information Base for Version 2 of the Simple Netw.. (context) - Information, Version et al. - 1907
8   ACM Transactions on Computer Systems (context) - in, Internetworks et al. - 1990
7   Sun Microsystems (context) - Call, Guide - 1986
6   A Simple Network Management Protocol (context) - Management, Case et al.
4   ACM Transactions on Networking (context) - for, Network et al. - 1995
4   High Performance Statistics Collection (context) - MON, Affordable - 1996
3   Sun Solstice product (context) - Manager
3   Personal communication with author and some of the developme.. (context) - research
2   Carl Shipley & Chingyow Wang (context) - on, Unix et al. - 1991
2   IEEE CG (context) - for, Use et al. - 1984
2   and the NOW team (context) - for, Networks et al. - 1995
2   John Simonson (context) - Accounting, Systems - 1991
2   Darren Hardy & Herb Morreale (context) - Automated, with et al. - 1992
2   Accepted to Software Practice and Experience (context) - An, for et al.
2   Rex Walters (context) - Configurations, Network et al. - 1995
2   Gary Schaps and Peter Bishop (context) - Approach, Response et al. - 1993
2   TM Language Environment: A White Paper (context) - Java
2   Todd Atkins (context) - Monitoring, Swatch et al. - 1993
1   ACM SIGMOD Intl (context) - Push, Data et al. - 1997
1   and Interactions of a Trigger Subsystem in an Integrated Dat.. (context) - Implementations - 1976
1   ACM Transactions on Computer Systems (context) - snapshots, states et al. - 1985
1   Miron Livny (context) - an, Data et al.
1   Internet draft available as http://home (context) - Protocol, Freier et al.
1   Available from ftp://ftp (context) - Practical, Report et al.
1   and Value Color Model (context) - Saturation
1   Described at http://www (context) - network
1   Reprinted in Readings in Database Systems (context) - Locks, Consistency et al. - 1975
1   and Jussi Myllymaki (context) - of, Sets et al. - 1996
1   Jrgen Schnwlder (context) - Tcl

Documents on the same site (http://www.cs.berkeley.edu/~eanders/):
Researching System Administration - Anderson   (Correct)
Researching System Administration - Anderson   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC