(Enter summary)
Abstract: processors
Physical
processors
Alignment
Distribution
dependent mapping
Implementation
Figure 3.4 HPF approach to data partition and distribution.
states that iteration i is to be executed by the processor to which A(i) is assigned. Therefore
processor p 1 executes iterations f1; 2; 3; 4g. The ON clause is a feature borrowed from the
language Kali [25].
3.1.3 HPF
The High Performance Fortran (HPF) [6, 26, 27] language was designed as a set of extensions
and modifications to Fortran 90 ... (Update)
Context of citations to this paper: More
...proposed scheduling policy. Most well known scheduling policies address simple loop parallelism, and although there has been evidence [More95], that functional parallelism can be present in substantial amounts in certain applications, little has been done to address the...
...Programmers have to learn new syntactic constructs and new semantics to use these language extensions. On the other hand, Poly93][More95] propose to automatically parallelize applications written in standard languages (e.g. C and FORTRAN) They are based on the nano threads...
Cited by: More
Simultaneous Exploitation of Task and Data Parallelism in.. - Ramaswamy (1996)
(Correct)
Analysis of Several Scheduling Algorithms under the.. - Martorell, Jesus (1997)
(Correct)
Scheduling User-Level Threads on Distributed.. - Eleftherios.. (1998)
(Correct)
Active bibliography (related documents): More All
0.8: On the Implementation and Effectiveness of Autoscheduling - Moreira (1995)
(Correct)
0.6: The Performance Impact of Granularity Control - And Functional Parallelism
(Correct)
0.5: A Framework for Exploiting Data and Functional.. - Ramaswamy.. (1994)
(Correct)
Similar documents based on text: More All
0.5: Autoscheduling in a Shared Memory Multiprocessor - Moreira, Polychronopoulos
(Correct)
0.4: Autoscheduling in a Distributed Shared-Memory Environment - Jos'e Moreira (1994)
(Correct)
0.3: Enhancing the Performance of Autoscheduling in.. - Nikolopoulos.. (1998)
(Correct)
Related documents from co-citation: More All
9: nanoThreads: A User-Level Threads Architecture (context) - Polychronopoulos, Bitar et al. - 1993
7: A Library Implementation of the Nano-Threads Programming Model
- Martorell, Labarta et al. - 1996
7: Tools and techniques for building fast portable threads packages
- Keppel - 1993
BibTeX entry: (Update)
J.E. Moreira, On the Implementation and Effectiveness of Autoscheduling for Shared-Memory Multiprocessors, PhD. thesis, Department of Electrical and Computer Engineering, Univ. of Illinois at Urbana-Champaign, 1995. http://citeseer.ist.psu.edu/moreira95implementation.html More
@misc{ moreira95implementation,
author = "J. Moreira",
title = "the Implementation and Effectiveness of Autoscheduling for Shared-Memory
Multiprocessors",
text = "J.E. Moreira, On the Implementation and Effectiveness of Autoscheduling
for Shared-Memory Multiprocessors, PhD. thesis, Department of Electrical
and Computer Engineering, Univ. of Illinois at Urbana-Champaign, 1995.",
year = "1995",
url = "citeseer.ist.psu.edu/moreira95implementation.html" }
Citations (may not include all citations):
2003
The Art of Computer Programming (context) - Knuth - 1973
2003
The Art of Computer Programming (context) - Knuth - 1973
1575
Computer Architecture: A Quantitative Approach (context) - Patterson, Hennessy - 1990 ACM
1399
Compilers -- Principles (context) - Aho, Sethi et al. - 1986
835
High Performance Fortran Language Specification
- Fortran - 1993 ACM
537
Graph Theory with Applications (context) - Bondy, Murty - 1976
353
Software pipelining: An effective scheduling technique for V.. (context) - Lam - 1988 ACM DBLP
247
Partitioning and Scheduling Parallel Programs for Multiproce.. (context) - Sarkar - 1989
216
Performance of various computers using standard linear equat..
- Dongarra - 1994 ACM
202
Distributed shared memory: A survey of issues and algorithms (context) - Nitzberg, Lo - 1991 DBLP
146
Parallelizing programs with recursive data structures (context) - Hendren, Nicolau - 1989 ACM DBLP
146
Demonstration of automatic data partitioning techniques for ..
- Gupta, Banerjee - 1992
142
Guided self-scheduling: A practical scheduling scheme for pa.. (context) - Polychronopoulos, Kuck - 1987 ACM DBLP
121
Monsoon: an explicit token-store architecture (context) - Papadopoulos, Culler - 1990
120
The performance implications of thread management alternativ.. (context) - Anderson, Lazowska et al. - 1989 ACM DBLP
94
Performance analysis of parallelizing compilers on the Perfe.. (context) - Blume, Eigenmann - 1992 ACM DBLP
90
Compiler support for machineindependent parallel programming..
- Hiranandani, Kennedy et al. - 1991
89
Supporting shared data structures on distributed memory mach.. (context) - Koelbel, Mehrotra et al. - 1990
87
Compile-time techniques for data distribution in distributed..
- Ramanujam, Sadayappan - 1991 ACM DBLP
78
High Performance Fortran (context) - Loveman - 1993 ACM
77
Ultracomputers: A teraflop before its time (context) - Bell - 1992 ACM DBLP
73
Exploiting task and data parallelism on a multicomputer
- Subhlock, Stichnoth et al. - 1993 ACM
71
Fortran D language specification
- Fox, Hiranandani et al. - 1990
68
A linear algebra framework for static HPF code distribution
- Angourt, Coelho et al. - 1993
64
Interprocedural compilation of Fortran D for MIMD distribute..
- Hall, Hiranandani et al. - 1992
63
Compiling for distributed-memory systems (context) - Zima, Chapman - 1993
63
An architecture of a dataflow single chip processor (context) - Sakai, Yamaguchi et al. - 1989 ACM DBLP
62
The transitive reduction of a directed graph (context) - Aho, Garey et al. - 1972 DBLP
59
MD: The Johns Hopkins University Press (context) - Golub, Loan et al. - 1989
57
Compiling Fortran D for MIMD distributed-memory machines
- Hiranandani, Kennedy et al. - 1992 ACM
47
MPP Fortran programming model
- Pase, MacDonald et al. - 1994
47
A software architecture for multidisciplinary applications: ..
- Chapman, Mehrotra et al. - 1994 DBLP
45
Multithreading: A revisionist view of dataflow architectures
- Papadopoulos, Traub - 1991
43
Data Structures = Programs (context) - Wirth - 1976
42
Symbolic Analysis for Parallelizing Compilers
- Haghighat - 1994 ACM DBLP
37
Polaris: The next generation in parallelizing compilers
- Blume, Eigenmann et al. - 1994
37
The Polaris internal representation
- Faigin, Hoeflinger et al. - 1994 ACM
32
A convex programming approach for exploiting data and functi..
- Ramaswamy, Sapatnekar et al. - 1994 DBLP
30
Communication and memory requirements as the basis for mappi..
- Subhlok, O'Hallaron et al. - 1994 ACM DBLP
28
Functional Parallelism: Theoretical Foundations and Implemen.. (context) - Girkar - 1992
28
Low-overhead scheduling of nested parallelism (context) - Hummel, Schonberg - 1990 ACM
26
A survey of parallel computer architectures (context) - Duncan - 1990 ACM DBLP
26
Kendall Square Research Technical Summary (context) - Research - 1992
25
Memory allocation costs in large C and C++ programs
- Detlefs, Dosser et al. - 1993 ACM DBLP
24
Can dataflow subsume von Neumann computing (context) - Nikhil - 1989
23
A fast static scheduling algorithm for DAGs on an unbounded ..
- Yang, Gerasoulis - 1991 ACM DBLP
23
A Fortran-to-C converter
- Feldman, Gay et al. - 1993
23
COOL: A language for parallel programming (context) - Chandra, Gupta et al. - 1989 ACM
22
Clustering task graphs for message passing architectures
- Gerasoulis, Venugopal et al. - 1990 ACM DBLP
22
Fortran M: A language for modular parallel programming
- Foster, Chandy - 1992 DBLP
20
Parallelism in object-oriented languages: A survey (context) - Wyatt, Kavi et al. - 1992 DBLP
19
Cray TD System Architecture Overview Manual (context) - Inc, Architecture et al. - 1993
19
Automatic partitioning of a program dependence graph into pa.. (context) - Sarkar - 1991 ACM
19
PYRROS: Static scheduling and code generation for message pa.. (context) - Yang, Gerasoulis - 1992
18
PTRAN - the IBM parallel translation system (context) - Sarkar - 1990
18
Toward a dataflow/von Neumann hybrid architecture (context) - Iannucci - 1988 ACM DBLP
18
Hardware and Software for Functional and Fine Grain Parallel..
- Beckmann - 1993 ACM
18
Making effective use of shared-memory multiprocessors: The p.. (context) - Gupta, Tucker et al. - 1991
18
Parallel program graphs and their classification (context) - Sarkar, Simons - 1993
16
Microarchitecture support for dynamic scheduling of acyclic ..
- Beckmann, Polychronopoulos - 1992 ACM DBLP
15
Processor allocation and scheduling of macro dataflow graphs..
- Ramaswamy, Banerjee - 1993 DBLP
14
Scheduling for Locality in Shared-Memory Multiprocessors (context) - Markatos - 1993 ACM
14
A hierarchical task queue organization for sharedmemory mult.. (context) - Dandamudi, Cheng - 1995
14
Distributed pC++: Basic ideas for an object parallel languag..
- Bodin, Beckmann et al. - 1994
14
CHARM++: A portable concurrent object oriented system based ..
- Kale, Krishnan - 1993
12
Compositional C++: Compositional parallel programming (context) - Chandy, Kesselman - 1992 ACM DBLP
12
Performance optimizations, implementation, and verification .. (context) - Galles, Williams - 1994 DBLP
12
Memory Latency Reduction via Data Prefetching and Data Forwa.. (context) - Poulsen - 1994 ACM
11
Compiling for distributed memory architectures
- Rogers, Pingali - 1994 ACM DBLP
10
A framework for exploiting data and functional parallelism o..
- Ramaswamy, Sapatnekar et al. - 1994
9
Automatic detection and generation of unstructured paralleli.. (context) - Girkar, Polychronopoulos - 1992
9
A concurrent execution semantics for parallel program graphs.. (context) - Sarkar - 1992
8
Large-scale computer simulation of fully developed channel f.. (context) - Lyons, Hanratty et al. - 1991
8
Evaluation of a prototype data flow processor of the SIGMA-1.. (context) - Shimada, Hiraki et al. - 1986
8
VLSI support for cactus stack oriented memory organization (context) - Stenstrom - 1988
7
Multiprocessing versus multiprogramming (context) - Polychronopoulos - 1989 DBLP
7
Switch-stacks: A scheme for microtasking nested parallel loo.. (context) - Chow, Harrison - 1990 DBLP
6
Compile-time techniques for processor allocation in macro da.. (context) - Prasanna, Agarwal - 1992 DBLP
6
Parallel processing of near fine grain tasks using static sc.. (context) - Kasahara, Honda et al. - 1990
6
Vienna Fortran 90 (context) - Benkner, Chapman et al. - 1992
5
Concurrent aggregates (CA (context) - Chien, Dally - 1990
4
The Epsilon-2 hybrid dataflow architecture (context) - Grafe, Hoch - 1990
4
The parallel C (pC) programming language (context) - Canetti, Fertig et al. - 1991 ACM
4
Auto-scheduling: Control flow and data flow come together (context) - Polychronopoulos - 1990
4
C**: A large-grain, object oriented, data-parallel programmi..
- Larus - 1992
4
Multiprogrammed multiprocessor scheduling issues (context) - Leutenegger, Vernon
4
Computer experiments on classical fluids. thermodynamical pr.. (context) - Verlet - 1968
4
Experiences implementing dataflow on a general-purpose paral..
- Spertus, Dally - 1991 DBLP
4
High Performance Fortran: A perspective (context) - Wylie, Norman et al. - 1992
3
A low overhead coherent solution for multiprocessors with pr.. (context) - Papamarcos, Patel - 1984
3
New method for searching for neighbors in molecular dynamics.. (context) - Quentrec, Brot - 1975
3
Fortran 77 Programmer's Guide (context) - Graphics, Mountain et al.
3
User defined mappings in Vienna Fortran (context) - Chapman, Mehrotra et al. - 1993 ACM DBLP
2
Product Overview (context) - Microsystems, Mountain et al. - 2000
2
Department of Electrical Engineering (context) - Goldschmidt, Multiprocessors et al. - 1993
2
Towards automatic derivation of data distributions from tens.. (context) - Gupta, Kaushik et al. - 1992
2
IRIS Power C User's Guide (context) - Graphics, Mountain et al.
2
Available from http://www (context) - Research, Cray et al. - 1995
1
Department of Electrical Engineering and Computer Science (context) - Schauser, into et al. - 1991 ACM
1
Concurrent aggregates (CA) language report (context) - Chien, Karamcheti et al. - 1993
1
Parallelization of TRFD (context) - Andrews - 1991
1
Storage management in IBM APL systems (context) - Trimble - 1991 ACM DBLP
1
CONCENTRIX System Reference Manual (context) - Systems, Acton - 1986
1
System Description (context) - Systems, Littleton et al. - 1991 ACM
1
Efficient vector processing on a dataflow supercomputer SIGM.. (context) - Hiraki, Sekiguchi et al. - 1988
1
Corporate business servers: An alterntive to mainframes for .. (context) - Alexander, Robertson et al. - 1994
1
Computer Architecture and Organization (context) - Lorin - 1989 ACM
1
Cedar Fortran programmer's manual
- Hoeflinger - 1993
1
Land avoidance and load balancing in ocean simulation (context) - DeRose, Gallivan et al. - 1993
1
Exemplar SPP (context) - Corporation, TX - 1994
1
LIDEX: A system for description, simulation and analysis of .. (context) - Moreira - 1990
1
A hardware design of the SIGMA-1, a data flow computer for s.. (context) - Hiraki, Shimada et al. - 1984 ACM
1
Hybrid dataflow architecture (context) - Hiraki
1
Automatic mapping of task and data parallel programs for eff.. (context) - Subhlock - 1993 ACM
1
Compiling Fortran 77D and 90D for MIMD distributed-memory ma.. (context) - Choudry, Hiranandani et al. - 1992
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.csrd.uiuc.edu/parafrase2/p2papers2.html):
Chief: A Simulation Environment for Studying Parallel Systems - Pavlos Konas (1994)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC