Download:
by Sathish S. Vadhiyar, Jack J. Dongarra
Concurrency & Computation: Practice & Experience
http://icl.cs.utk.edu/news_pub/submissions/sans-grid-edinburgh.pdf
Add To MetaCart
Abstract:
Abstract. Optimizing a given software to exploit the features of the underlying system has been an area of research for many years. Recently, a number of self adapting software have been designed and developed for various computing environments. In this paper, we discuss the design and implementation of a software that dynamically adjusts the parallelism of applications executing on computational Grids in accordance to the changing load characteristics of the underlying resources. The migration framework implemented by our software is oriented towards performance oriented Grid systems and implements tightly coupled policies for both suspension and migration of executing applications. The suspension and migration policies take into account both the load changes on systems as well the remaining execution times of the applications thereby taking into account both system load and application characteristics. The main goal of our migration framework is to improve the response times for individual applications. We also present some results that demonstrate the usefulness of our migration framework.
Citations
|
799
|
Condor - A Hunter of Idle Workstations
– Litzkow, Livny, et al.
- 1988
|
|
483
|
MPI: The Complete Reference
– Snir, Otto, et al.
- 1996
|
|
267
|
FFTW: An adaptive software architecture for the FFT
– Frigo, Johnson
- 1998
|
|
212
|
eds. The Grid: Blueprint for a New Computing Infrastructure
– Foster, Kesselman
- 2004
|
|
201
|
A directory service for configuring high-performance distributed computations
– Fitzgerald, Foster, et al.
|
|
129
|
Cocheck: Checkpointing and process migration for MPI
– Stellner
- 1996
|
|
107
|
Autopilot: Adaptive Control of Distributed Applications
– Ribler, Vetter, et al.
- 1998
|
|
93
|
The AppLeS Project: A Status Report
– Berman, Wolski
- 1997
|
|
91
|
A grid-enabled MPI: Message passing in heterogeneous distributed computing systems
– Foster, Karonis
- 1998
|
|
84
|
CUMULVS: Providing Fault Tolerance, Visualization and Steering
– Geist, Kohl, et al.
- 1997
|
|
78
|
Dome: Parallel Programming in a Heterogeneous Multi-User Environment. Supercomputing
– Arabe, Lowekamp, et al.
- 1995
|
|
61
|
MPVM: A Migration Transparent Version of PVM
– Casas, Clark, et al.
- 1995
|
|
45
|
Adaptive load sharing in heterogeneous distributed systems
– Mirchandaney, Towsley, et al.
- 1990
|
|
25
|
Automatically Tuned Collective Communications
– Vadhiyar, Fagg, et al.
- 2000
|
|
24
|
DynamicPVM: Dynamic Load Balancing on Parallel Systems
– Dikken, Linden, et al.
- 1994
|
|
23
|
MARS - A Framework for Minimizing the Job Execution Time in a Metacomputing Environment
– Gehring, Reinefeld
- 1996
|
|
16
|
Self Adapting Software for Numerical Linear Algebra and LAPACK for Clusters. Submitted to Parallel Computing
– Chen, Dongarra, et al.
- 2003
|
|
14
|
Gang Scheduling in Heterogeneous Distributed Systems
– Saqabi, Otto, et al.
- 1994
|
|
11
|
Dynamite - Blasting Obstacles to Parallel Cluster Computing
– Albada, Clinckemaillie, et al.
- 1995
|