Cooperative checkpointing: a robust approach to large-scale systems reliability (2006)

by Adam J. Oliner, Larry Rudolph, Ramendra K. Sahoo
Venue:In ICS ’06: Proceedings of the 20th annual international conference on Supercomputing