(Enter summary)
Abstract: An important issue in the parallel execution of loops
is how to partition and schedule the loops onto the
available processors. While most existing dynamic
scheduling algorithms manage load imbalances well,
they fail to take locality into account and therefore
perform poorly on parallel systems with non-uniform
memory access times. In this paper, we propose
a new loop scheduling algorithm, Locality-based Dynamic
Scheduling (LDS), that exploits locality, and
dynamically balances the load.
Key... (Update)
Similar documents based on text: More All
0.3: Automatic Computation and Data Partitioning . . . - Tandri
(Correct)
0.2: Experiences with Data Distribution on NUMA Shared Memory.. - Tandri, Abdelrahman
(Correct)
0.1: Disk Striping Scalability in the Exedra Media Server - Anastasiadis, Sevcik, Stumm (2001)
(Correct)
BibTeX entry: (Update)
H. Li, S. Tandri, M. Stumm and K.C. Sevcik, "Locality and Loop Scheduling on NUMA Multiprocessors," Int. Conf. on Parallel Processing, pp. II-140-II-147, Aug. 1993. http://citeseer.ist.psu.edu/li93locality.html More
@inproceedings{ li93locality,
author = "Hui Li and Sudarsan Tandri and Michael Stumm and Kenneth C. Sevcik",
title = "Locality and Loop Scheduling on {NUMA} Multiprocessors",
booktitle = "Proceedings of the 1993 International Conference on Parallel Processing",
volume = "II - Software",
publisher = "CRC Press",
address = "Boca Raton, FL",
pages = "II--140--II--147",
year = "1993",
url = "citeseer.ist.psu.edu/li93locality.html" }
Citations (may not include all citations):
478
The Stanford DASH multiprocessor (context) - Lenoski, Laudon et al. - 1992
121
Using processor affinity in loop scheduling on shared-memory..
- Markatos, LeBlanc - 1992
99
Allocating independent subtasks on parallel processors (context) - Kruskal, Weiss - 1985
77
Ultracomputers: A teraflop before its time (context) - Bell - 1992
67
Hector: A hierarchically structured shared memory multiproce..
- Vranesic, Stumm et al. - 1991
44
NUMA policies and their relation to memory architecture (context) - Bolosky, Scott et al. - 1991
36
Dynamic loopscheduling for shared memory multiprocessors (context) - Tzen, Ni - 1991
22
Factoring: A method for scheduling parallel loops (context) - Hummel, Schonberg et al. - 1992
16
Guided self scheduling: A practical scheduling scheme for pa.. (context) - Polychronopoulos, Kuck - 1987
15
Experiences with the Hector multiprocessor
- Stumm, Vranesic et al. - 1993
10
Parallel Computing: Theory and Comparisons (context) - Lipovski, Malek - 1987
10
Parallel Computing: Theory and Comparisons (context) - Lipovski, Malek - 1987
3
A comparison of basic CPU scheduling algorithms for multipro..
- Curran, Stumm - 1990
3
Parallel: A Survey of Available Parallel Computer Systems (context) - Trew, Wilson et al. - 1991
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.eecg.toronto.edu/~stumm/papers.html): More
Hector - A Hierarchically Structured Shared Memory Multiprocessor - Stumm (1991)
(Correct)
CDA Loop Transformations - Kulkarni, Stumm (1995)
(Correct)
Linear Loop Transformations in Optimizing Compilers for.. - Kulkarni, Stumm (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC