MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Process Tracking for Parallel Job Control

Download:
pdf | ps
by Hubertus Franke, Jose E. Moreira, Pratap Pattnaik
http://www.cs.huji.ac.il/~feit/parsched/p-99-8.ps.gz
Add To MetaCart

Abstract:

Abstract. Job management subsystems in parallel environments have to address two important issues: (i) how to associate processes present in the system to the tasks of parallel jobs, and (ii) how to control execution of these tasks. The standard UNIX mechanism for job control, process groups, is not appropriate for this purpose as processes can escape their original groups and start new ones. We introduce the concept of genealogy, in which a process is identified by the genetic footprint it inherits from its parent. With this concept, tasks are defined by sets of processes with a common ancestor. Process tracking is the mechanism by which we implement the genealogy concept in the IBM AIX operating system. No changes to the kernel are necessary and individual process control is achieved through standard UNIX signaling methods. Performance evaluation, on both uniprocessor and multiprocessor systems, demonstrate the efficacy of job control through process tracking. Process tracking has been incorporated in a research prototype gang-scheduling system for the IBM RS/6000 SP. 1

Citations

799 Condor - A Hunter of Idle Workstations – Litzkow, Livny, et al. - 1988
293 Network Time Protocol (Version 3): Specification, Implementation and Analysis", RFC 1305 – Mills - 1992
240 Scheduling Techniques for Concurrent Systems – Ousterhout - 1982
188 The NAS Parallel Benchmarks 2.0 – Bailey, Harris, et al. - 1995
136 A Survey of Scheduling in Multiprogrammed Parallel Systems – Feitelson - 1994
110 The ANL/IBM SP scheduling system – Lifka - 1995
107 UNIX Internals, The New Frontiers – Vahalia - 1996
101 Effective Distributed Scheduling of Parallel Workloads – Dusseau, Arpaci, et al. - 1996
90 Improved utilization and responsiveness with gang scheduling – Feitelson, Jette - 1997
70 Demand-based Coscheduling of Parallel Jobs on Multiprogrammed Multiprocessors – Sobalvarro - 1997
48 Dynamic Coscheduling on Workstation Clusters – Sobalvarro, Pakin, et al. - 1998
35 Adaptive algorithms for managing a distributed data processing workload – Aman, Eilert, et al. - 1997
32 Evaluation of design choices for gang scheduling using distributed hierarchical control – Feitelson, Rudolph - 1996
16 The EASY-Loadleveller API Project – Skovira, Chan, et al. - 1996
12 Improving First-Come-First-Serve Job Scheduling by Gang Scheduling – Schwiegelshohn, Yahyapour - 1998
11 Time sharing massively parallel machines – Gorda, Wolski - 1995
11 Extensible Resource Management for Cluster Computing – Islam, Prodromidis, et al. - 1996
8 An infrastructure for efficient parallel job execution in terascale computing environments – Moreira, Chan, et al. - 1998
6 Implementing the Combination of Time Sharing and Space Sharing on AP/Linux – Suzaki, Walsh - 1998
5 Expanding symmetric multiprocessor capability through gang scheduling – Jette - 1998
1 Gang Scheduling for Highly Efficient Multiprocessors – Franke, Pattnaik, et al. - 1996