See this document in CiteSeerX!

On The Implementation And Effectiveness Of Autoscheduling For Shared-Memory Multiprocessors (1995)  (Make Corrections)  (17 citations)
José Eduardo Moreira



  Home/Search   Context   Related

Links:   ACM

 
View or download:
uiuc.edu/pub/misc/mor...phdthesis.ps.gz
uiuc.edu/pub/CSRD_Reports/...1404.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uiuc.edu/parafrase2/p2papers2 (more)
From:  uiuc.edu
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: processors Physical processors Alignment Distribution dependent mapping Implementation Figure 3.4 HPF approach to data partition and distribution. states that iteration i is to be executed by the processor to which A(i) is assigned. Therefore processor p 1 executes iterations f1; 2; 3; 4g. The ON clause is a feature borrowed from the language Kali [25]. 3.1.3 HPF The High Performance Fortran (HPF) [6, 26, 27] language was designed as a set of extensions and modifications to Fortran 90 ... (Update)

Context of citations to this paper:   More

...proposed scheduling policy. Most well known scheduling policies address simple loop parallelism, and although there has been evidence [More95], that functional parallelism can be present in substantial amounts in certain applications, little has been done to address the...

...Programmers have to learn new syntactic constructs and new semantics to use these language extensions. On the other hand, Poly93][More95] propose to automatically parallelize applications written in standard languages (e.g. C and FORTRAN) They are based on the nano threads...

Cited by:   More
Simultaneous Exploitation of Task and Data Parallelism in.. - Ramaswamy (1996)   (Correct)
Analysis of Several Scheduling Algorithms under the.. - Martorell, Jesus (1997)   (Correct)
Scheduling User-Level Threads on Distributed.. - Eleftherios.. (1998)   (Correct)

Active bibliography (related documents):   More   All
0.8:   On the Implementation and Effectiveness of Autoscheduling - Moreira (1995)   (Correct)
0.6:   The Performance Impact of Granularity Control - And Functional Parallelism   (Correct)
0.5:   A Framework for Exploiting Data and Functional.. - Ramaswamy.. (1994)   (Correct)

Similar documents based on text:   More   All
0.5:   Autoscheduling in a Shared Memory Multiprocessor - Moreira, Polychronopoulos   (Correct)
0.4:   Autoscheduling in a Distributed Shared-Memory Environment - Jos'e Moreira (1994)   (Correct)
0.3:   Enhancing the Performance of Autoscheduling in.. - Nikolopoulos.. (1998)   (Correct)

Related documents from co-citation:   More   All
9:   nanoThreads: A User-Level Threads Architecture (context) - Polychronopoulos, Bitar et al. - 1993
7:   A Library Implementation of the Nano-Threads Programming Model - Martorell, Labarta et al. - 1996
7:   Tools and techniques for building fast portable threads packages - Keppel - 1993

BibTeX entry:   (Update)

J.E. Moreira, On the Implementation and Effectiveness of Autoscheduling for Shared-Memory Multiprocessors, PhD. thesis, Department of Electrical and Computer Engineering, Univ. of Illinois at Urbana-Champaign, 1995. http://citeseer.ist.psu.edu/moreira95implementation.html   More

@misc{ moreira95implementation,
  author = "J. Moreira",
  title = "the Implementation and Effectiveness of Autoscheduling for Shared-Memory
    Multiprocessors",
  text = "J.E. Moreira, On the Implementation and Effectiveness of Autoscheduling
    for Shared-Memory Multiprocessors, PhD. thesis, Department of Electrical
    and Computer Engineering, Univ. of Illinois at Urbana-Champaign, 1995.",
  year = "1995",
  url = "citeseer.ist.psu.edu/moreira95implementation.html" }
Citations (may not include all citations):
2003   The Art of Computer Programming (context) - Knuth - 1973
2003   The Art of Computer Programming (context) - Knuth - 1973
1575   Computer Architecture: A Quantitative Approach (context) - Patterson, Hennessy - 1990  ACM
1399   Compilers -- Principles (context) - Aho, Sethi et al. - 1986
835   High Performance Fortran Language Specification - Fortran - 1993  ACM
537   Graph Theory with Applications (context) - Bondy, Murty - 1976
353   Software pipelining: An effective scheduling technique for V.. (context) - Lam - 1988  ACM   DBLP
247   Partitioning and Scheduling Parallel Programs for Multiproce.. (context) - Sarkar - 1989
216   Performance of various computers using standard linear equat.. - Dongarra - 1994  ACM
202   Distributed shared memory: A survey of issues and algorithms (context) - Nitzberg, Lo - 1991  DBLP
146   Parallelizing programs with recursive data structures (context) - Hendren, Nicolau - 1989  ACM   DBLP
146   Demonstration of automatic data partitioning techniques for .. - Gupta, Banerjee - 1992
142   Guided self-scheduling: A practical scheduling scheme for pa.. (context) - Polychronopoulos, Kuck - 1987  ACM   DBLP
121   Monsoon: an explicit token-store architecture (context) - Papadopoulos, Culler - 1990
120   The performance implications of thread management alternativ.. (context) - Anderson, Lazowska et al. - 1989  ACM   DBLP
94   Performance analysis of parallelizing compilers on the Perfe.. (context) - Blume, Eigenmann - 1992  ACM   DBLP
90   Compiler support for machineindependent parallel programming.. - Hiranandani, Kennedy et al. - 1991
89   Supporting shared data structures on distributed memory mach.. (context) - Koelbel, Mehrotra et al. - 1990
87   Compile-time techniques for data distribution in distributed.. - Ramanujam, Sadayappan - 1991  ACM   DBLP
78   High Performance Fortran (context) - Loveman - 1993  ACM
77   Ultracomputers: A teraflop before its time (context) - Bell - 1992  ACM   DBLP
73   Exploiting task and data parallelism on a multicomputer - Subhlock, Stichnoth et al. - 1993  ACM
71   Fortran D language specification - Fox, Hiranandani et al. - 1990
68   A linear algebra framework for static HPF code distribution - Angourt, Coelho et al. - 1993
64   Interprocedural compilation of Fortran D for MIMD distribute.. - Hall, Hiranandani et al. - 1992
63   Compiling for distributed-memory systems (context) - Zima, Chapman - 1993
63   An architecture of a dataflow single chip processor (context) - Sakai, Yamaguchi et al. - 1989  ACM   DBLP
62   The transitive reduction of a directed graph (context) - Aho, Garey et al. - 1972  DBLP
59   MD: The Johns Hopkins University Press (context) - Golub, Loan et al. - 1989
57   Compiling Fortran D for MIMD distributed-memory machines - Hiranandani, Kennedy et al. - 1992  ACM
47   MPP Fortran programming model - Pase, MacDonald et al. - 1994
47   A software architecture for multidisciplinary applications: .. - Chapman, Mehrotra et al. - 1994  DBLP
45   Multithreading: A revisionist view of dataflow architectures - Papadopoulos, Traub - 1991
43   Data Structures = Programs (context) - Wirth - 1976
42   Symbolic Analysis for Parallelizing Compilers - Haghighat - 1994  ACM   DBLP
37   Polaris: The next generation in parallelizing compilers - Blume, Eigenmann et al. - 1994
37   The Polaris internal representation - Faigin, Hoeflinger et al. - 1994  ACM
32   A convex programming approach for exploiting data and functi.. - Ramaswamy, Sapatnekar et al. - 1994  DBLP
30   Communication and memory requirements as the basis for mappi.. - Subhlok, O'Hallaron et al. - 1994  ACM   DBLP
28   Functional Parallelism: Theoretical Foundations and Implemen.. (context) - Girkar - 1992
28   Low-overhead scheduling of nested parallelism (context) - Hummel, Schonberg - 1990  ACM
26   A survey of parallel computer architectures (context) - Duncan - 1990  ACM   DBLP
26   Kendall Square Research Technical Summary (context) - Research - 1992
25   Memory allocation costs in large C and C++ programs - Detlefs, Dosser et al. - 1993  ACM   DBLP
24   Can dataflow subsume von Neumann computing (context) - Nikhil - 1989
23   A fast static scheduling algorithm for DAGs on an unbounded .. - Yang, Gerasoulis - 1991  ACM   DBLP
23   A Fortran-to-C converter - Feldman, Gay et al. - 1993
23   COOL: A language for parallel programming (context) - Chandra, Gupta et al. - 1989  ACM
22   Clustering task graphs for message passing architectures - Gerasoulis, Venugopal et al. - 1990  ACM   DBLP
22   Fortran M: A language for modular parallel programming - Foster, Chandy - 1992  DBLP
20   Parallelism in object-oriented languages: A survey (context) - Wyatt, Kavi et al. - 1992  DBLP
19   Cray TD System Architecture Overview Manual (context) - Inc, Architecture et al. - 1993
19   Automatic partitioning of a program dependence graph into pa.. (context) - Sarkar - 1991  ACM
19   PYRROS: Static scheduling and code generation for message pa.. (context) - Yang, Gerasoulis - 1992
18   PTRAN - the IBM parallel translation system (context) - Sarkar - 1990
18   Toward a dataflow/von Neumann hybrid architecture (context) - Iannucci - 1988  ACM   DBLP
18   Hardware and Software for Functional and Fine Grain Parallel.. - Beckmann - 1993  ACM
18   Making effective use of shared-memory multiprocessors: The p.. (context) - Gupta, Tucker et al. - 1991
18   Parallel program graphs and their classification (context) - Sarkar, Simons - 1993
16   Microarchitecture support for dynamic scheduling of acyclic .. - Beckmann, Polychronopoulos - 1992  ACM   DBLP
15   Processor allocation and scheduling of macro dataflow graphs.. - Ramaswamy, Banerjee - 1993  DBLP
14   Scheduling for Locality in Shared-Memory Multiprocessors (context) - Markatos - 1993  ACM
14   A hierarchical task queue organization for sharedmemory mult.. (context) - Dandamudi, Cheng - 1995
14   Distributed pC++: Basic ideas for an object parallel languag.. - Bodin, Beckmann et al. - 1994
14   CHARM++: A portable concurrent object oriented system based .. - Kale, Krishnan - 1993
12   Compositional C++: Compositional parallel programming (context) - Chandy, Kesselman - 1992  ACM   DBLP
12   Performance optimizations, implementation, and verification .. (context) - Galles, Williams - 1994  DBLP
12   Memory Latency Reduction via Data Prefetching and Data Forwa.. (context) - Poulsen - 1994  ACM
11   Compiling for distributed memory architectures - Rogers, Pingali - 1994  ACM   DBLP
10   A framework for exploiting data and functional parallelism o.. - Ramaswamy, Sapatnekar et al. - 1994
9   Automatic detection and generation of unstructured paralleli.. (context) - Girkar, Polychronopoulos - 1992
9   A concurrent execution semantics for parallel program graphs.. (context) - Sarkar - 1992
8   Large-scale computer simulation of fully developed channel f.. (context) - Lyons, Hanratty et al. - 1991
8   Evaluation of a prototype data flow processor of the SIGMA-1.. (context) - Shimada, Hiraki et al. - 1986
8   VLSI support for cactus stack oriented memory organization (context) - Stenstrom - 1988
7   Multiprocessing versus multiprogramming (context) - Polychronopoulos - 1989  DBLP
7   Switch-stacks: A scheme for microtasking nested parallel loo.. (context) - Chow, Harrison - 1990  DBLP
6   Compile-time techniques for processor allocation in macro da.. (context) - Prasanna, Agarwal - 1992  DBLP
6   Parallel processing of near fine grain tasks using static sc.. (context) - Kasahara, Honda et al. - 1990
6   Vienna Fortran 90 (context) - Benkner, Chapman et al. - 1992
5   Concurrent aggregates (CA (context) - Chien, Dally - 1990
4   The Epsilon-2 hybrid dataflow architecture (context) - Grafe, Hoch - 1990
4   The parallel C (pC) programming language (context) - Canetti, Fertig et al. - 1991  ACM
4   Auto-scheduling: Control flow and data flow come together (context) - Polychronopoulos - 1990
4   C**: A large-grain, object oriented, data-parallel programmi.. - Larus - 1992
4   Multiprogrammed multiprocessor scheduling issues (context) - Leutenegger, Vernon
4   Computer experiments on classical fluids. thermodynamical pr.. (context) - Verlet - 1968
4   Experiences implementing dataflow on a general-purpose paral.. - Spertus, Dally - 1991  DBLP
4   High Performance Fortran: A perspective (context) - Wylie, Norman et al. - 1992
3   A low overhead coherent solution for multiprocessors with pr.. (context) - Papamarcos, Patel - 1984
3   New method for searching for neighbors in molecular dynamics.. (context) - Quentrec, Brot - 1975
3   Fortran 77 Programmer's Guide (context) - Graphics, Mountain et al.
3   User defined mappings in Vienna Fortran (context) - Chapman, Mehrotra et al. - 1993  ACM   DBLP
2   Product Overview (context) - Microsystems, Mountain et al. - 2000
2   Department of Electrical Engineering (context) - Goldschmidt, Multiprocessors et al. - 1993
2   Towards automatic derivation of data distributions from tens.. (context) - Gupta, Kaushik et al. - 1992
2   IRIS Power C User's Guide (context) - Graphics, Mountain et al.
2   Available from http://www (context) - Research, Cray et al. - 1995
1   Department of Electrical Engineering and Computer Science (context) - Schauser, into et al. - 1991  ACM
1   Concurrent aggregates (CA) language report (context) - Chien, Karamcheti et al. - 1993
1   Parallelization of TRFD (context) - Andrews - 1991
1   Storage management in IBM APL systems (context) - Trimble - 1991  ACM   DBLP
1   CONCENTRIX System Reference Manual (context) - Systems, Acton - 1986
1   System Description (context) - Systems, Littleton et al. - 1991  ACM
1   Efficient vector processing on a dataflow supercomputer SIGM.. (context) - Hiraki, Sekiguchi et al. - 1988
1   Corporate business servers: An alterntive to mainframes for .. (context) - Alexander, Robertson et al. - 1994
1   Computer Architecture and Organization (context) - Lorin - 1989  ACM
1   Cedar Fortran programmer's manual - Hoeflinger - 1993
1   Land avoidance and load balancing in ocean simulation (context) - DeRose, Gallivan et al. - 1993
1   Exemplar SPP (context) - Corporation, TX - 1994
1   LIDEX: A system for description, simulation and analysis of .. (context) - Moreira - 1990
1   A hardware design of the SIGMA-1, a data flow computer for s.. (context) - Hiraki, Shimada et al. - 1984  ACM
1   Hybrid dataflow architecture (context) - Hiraki
1   Automatic mapping of task and data parallel programs for eff.. (context) - Subhlock - 1993  ACM
1   Compiling Fortran 77D and 90D for MIMD distributed-memory ma.. (context) - Choudry, Hiranandani et al. - 1992



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.csrd.uiuc.edu/parafrase2/p2papers2.html):
Chief: A Simulation Environment for Studying Parallel Systems - Pavlos Konas (1994)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC