MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Attacking the Bottlenecks of Backfilling Schedulers (2000) [10 citations — 0 self]

Download:
Download as a PDF | Download as a PS
by Dmitry Zotkin, Peter J. Keleher, Dejan Perkovic
Cluster Computing
http://www.cs.umd.edu/~keleher/papers/cluster00-keleher.ps
Add To MetaCart

Abstract:

Backfilling is a simple and effective way of improving the utilization of space-sharing schedulers. Simple firstcome-first-served approaches are ineffective because large jobs can fragment the available resources. Backfilling schedulers address this problem by allowing jobs to move ahead in the queue, provided that they will not delay subsequent jobs. Previous research has shown that inaccurate estimates of execution times can lead to better backfilling schedules. In the first part of this study, we characterize this effect on several workloads, and show that average slowdowns can be effectively reduced by systematically lengthening estimated execution times. Further, we show that the average job slowdown metric can be addressed directly by sorting jobs by increasing execution time. Finally, we modify our sorting scheduler to ensure that incoming jobs can be given hard guarantees. The resulting scheduler guarantees to avoid starvation, and performs significantly better than previous backfilling schedulers. In the second part of this study, we show how queue randomization and even more a combination of queue randomization and sorting by job length can improve performance. We show that these improvements are better then with queue sorting by job length alone in the simulation with actual estimates of job running times. We investigate the real characteristics of these estimates, and show the wide range of overestimation. To exploit even more randomization and queue sorting, we eliminate guarantees from backfilling algorithm, and show significant improvements. Finally, we show a limited usefulness of these guarantees, and show that queue sorting criteria can be modified to prevent starvation in the modified backfilling algorithm. 1

Citations

110 The ANL/IBM SP scheduling system – Lifka - 1995
90 Improved utilization and responsiveness with gang scheduling – Feitelson, Jette - 1997
89 Theory and practice in parallel job scheduling, in job scheduling strategies for parallel processing – Feitelson, Rudolph, et al. - 1995
48 Utilization and predictability in scheduling the IBM SP2 with backfilling – Feitelson, Weil - 1998
7 Imprecise calendars: an approach to scheduling computational grids – Hollingsworth, Maneewongvatana - 1999