An Overview of the BlueGene/L Supercomputer (2002)

by The BlueGene/L Team , T Domany , Mb Dombrowa , W Donath , M Eleftheriou , C Erway , J Esch , J Gagliano , A Gara , R Garg , R Germain , Me Giampapa , B Gopalsamy , J Gunnels , B Rubin , A Ruehli , S Rus , Rk Sahoo , A Sanomiya , E Schenfeld , M Sharma , S Singh , P Song , V Srinivasan , Bd Steinmacher-burow , K Strauss , C Surovic , Tjc Ward , J Marcella , A Muff , A Okomo , M Rouse , A Schram , M Tubbs , G Ulsh , C Wait , J Wittrup , M Bae (ibm Server Group , K Dockser (ibm Microelectronics , L Kissel
Citations:14 - 0 self

Documents Related by Co-Citation

5 The red storm computer architecture and its implementation – W J Camp, J L Tomkins - 2003
15 A System Software Architecture for High-End Computing – David S. Greenberg, Ron Brightwell, Lee Ann Fisk, Arthur Maccabe, Rolf Riesen - 1997
27 Adaptive incremental checkpointing for massively parallel systems – Saurabh Agarwal, Rahul Garg, Meeta S. Gupta - 2004
6 Modeling coordinated checkpointing for large-scale supercomputers – K Pattabiraman, C Vick, A Wood - 2005
4 Optimization of Checkpointing-related I/O for High- Performance Parallel and Distributed Computing – Rajagopal Subramaniyan, Eric Grobelny, Scott Studham
42 Impact of checkpoint latency on overhead ratio of a checkpointing scheme – Nitin H Vaidya - 1997
28 Checkpointing for peta-scale systems: A look into the future of practical rollback-recovery – Elmootazbellah N. Elnozahy, James S. Plank - 2004
60 A first order approximation to the optimum checkpoint interval – J W Young - 1974
24 A higher order estimate of the optimum checkpoint interval for restart dumps – J. T. Daly - 2006
181 The Performance of Consistent Checkpointing – Elmootazbellah Nabil Elnozahy, David B. Johnson, Willy Zwaenepoel - 1992
432 Flit-Reservation Flow Control – Li-shiuan Peh, William J. Dally - 2000
15 Filtering failure logs for a bluegene/l prototype – Yinglung Liang, Yanyong Zhang An, Sivasubramaniam Ramendra, K. Sahoo, Jose Moreira, Manish Gupta - 2005
42 Experimental Assessment of Workstation Failures and Their Impact on Checkpointing Systems – James S. Plank, Wael R. Elwasif - 1997
38 Performance analysis of checkpointing strategies – A N Tantawi, M Ruschitzka - 1984
9 Providing Persistent and Consistent Resources through Event Log Analysis and Predictions for Large-scale Computing Systems – Ramendra K. Sahoo, Myung Bae, Ricardo Vilalta, Jose Moreira, Sheng Ma, Manish Gupta - 2002
9 Sahoo R, Cooperative checkpointing theory – A Oliner, L Rudolph - 2006
36 Failure data analysis of a large-scale heterogeneous server environment – Ramendra K. Sahoo, Mark S. Squillante - 2004
22 Processor Allocation and Checkpoint Interval Selection in Cluster Computing Systems – James S. Plank, Michael G. Thomason - 2001
5 Modeling the impact of checkpoints on next-generation systems – Ron A. Oldfield, Patricia J. Teller, Maria Ruiz Varela, Philip C. Roth, Sarala Arunagiri, Seetharami Seelam, Rolf Riesen - 2007