See this document in CiteSeerX!

Concepts for High Availability in Scientific High-End Computing +  (Make Corrections)  
C. Engelmann and S. L. Scott Computer Science and Mathematics Division Oak...
In Proceedings of the High Availability and Performance Workshop (HAPCW) 2005



  Home/Search   Context   Related

 
View or download:
ornl.gov/~engelman...mann05concepts.pdf
Cached:  PDF   PS.gz  PS  Image  Update  Help

From:  ornl.gov/~engelman/publication... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
Redundancy, Concepts

Abstract: Scientific high-end computing (HEC) has become an important tool for scientists world-wide to understand problems, such as in nuclear fusion, human genomics and nanotechnology. Every year, new HEC systems emerge on the market with better performance and higher scale. With only very few exceptions, the overall availability of recently installed systems has been lower in comparison to the same deployment phase of their predecessors. In contrast to the experienced loss of availability, the demand... (Update)

Active bibliography (related documents):   More   All
1.2:   High Availability for Ultra-Scale High-End Scientific - Computing Christian Engelmann   (Correct)
0.7:   A Lightweight Kernel for the Harness Metacomputing Framework - Engelmann And Geist (2005)   (Correct)
0.5:   Asymmetric Active-Active High Availability for High-end - Computing Leangsuksun..   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@inproceedings{ engelmann05concepts,
	author = "C.~Engelmann and S.~L.~Scott",
	title = "{Concepts for High Availability in Scientific High-End Computing}",
	booktitle = "In Proceedings of the High Availability and Performance Workshop (HAPCW) 2005",
	year = "2005",
	month = oct,
	address = "Santa Fe, New Mexico/U.S.A.",
	url = "citeseer.ist.psu.edu/745548.html",
	url = "\url{http://citeseer.ist.psu.edu/745548.html}" }
Citations (may not include all citations):
182   The Transis approach to high availability cluster communicat.. - Dolev, Malki - 1996
150   Extended virtual synchrony - Moser, Amir et al. - 1994
6   Distributed peer-to-peer control in Harness (context) - Engelmann, Scott et al. - 2002
5   A diskless checkpointing algorithm for super-scale architect.. - Engelmann, Geist - 2003
5   IEEE Transactions on Parallel and Distributed Systems (context) - Plank, Li et al. - 1998
3   HARNESS: Adaptable virtual machine environment for heterogen.. (context) - Geist, Kohl et al. - 1999
2   High availability for ultrascale high-end scientific computi.. (context) - Engelmann, Scott - 2005
2   A modern taxonomy of high availability (context) - Resnick - 1996
1   Dissecting cyclops: a detailed analysis of a multithreaded a.. - Almasi, Cascaval et al. - 2003
1   Asymmetric active-active high availability for high-end comp.. (context) - Leangsuksun, Munganuru et al. - 2005
http://www.sgi.com/products/servers/altix
http://ftg.lbl.gov/checkpoint
http://top500.org
http://www.ibm.com/servers/eserver/linux/power/mare
http://www.cray.com/products/x1
http://www.llnl.gov/linux/slurm
http://www.kerrighed.org
http://www.cray.com/products/xt3
http://xcr.cenit.latech.edu/ha-oscar
http://www.lri.fr/#gk/MPICH-V
http://www.nlcf.gov
http://www.altair.com/pdf/PBSPro
http://www.nas.nasa.gov/About/Projects/Co
http://www.research.ibm.com/bluegene

Documents on the same site (http://www.csm.ornl.gov/~engelman/publications/):   More
A Lightweight Kernel for the Harness Metacomputing Framework - Engelmann And Geist (2005)   (Correct)
High Availability for Ultra-Scale High-End Scientific - Computing Christian Engelmann   (Correct)
A Highly Available Cluster Storage System Using Scavenging - Xubin Ben He   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC