The Hector Parallel Run--Time Environment Russ, et al. 1 The Hector Parallel Run--Time Environment
Abstract:
This has been submitted to the IEEE Transactions on Parallel and Distributed Systems for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. Abstract---- Harnessing the compuational capabilities of a network of workstations promises to off--load work from overloaded supercomputers onto largely idle resources overnight. Several capabilities are needed to do this, including support for an architecture--indpendent parallel programming environment, task migration, intelligent resource allocation, and fault tolerance. The Hector parallel run--time environment is designed to present these capabilities transparently to programmers. MPI programs can be run under this environment with no modifications to their source code needed. The design of Hector, its internal structure, and several benchmarks and tests are presented.
Citations
| 160 | Using MPI – Gropp, Lusk, et al. - 1995 |
| 143 | The Livermore Fortran Kernels: A Computer Test of the Numerical Performance Range – McMahon - 1986 |
| 56 | The Prospero resource manager: A scalable framework for processor allocation in distributed systems. Concurrency: Practice and Experience – Neumann, Rao - 1994 |
| 47 | Managing checkpoints for parallel programs – Livny, Pruyne - 1996 |
| 37 | MIST: PVM with transparent migration and checkpointing – Casas, Clark, et al. - 1995 |
| 36 | Providing resource management services to parallel applications – Pruyne, Livny - 1994 |
| 23 | Consistent Checkpoints of PVM Applications – Stellner - 1994 |
| 16 | A Task Migration Implementation for the Message--Passing Interface – Robinson, Russ, et al. - 1996 |
| 8 | Hector: Automated Task Allocation for MPI – Russ, Flachs, et al. - 1996 |
| 5 | MPVM: A Migration Transparent Version of PVM", Usenix Computing Systems Journal – Casas, Clark, et al. - 1995 |

