(Enter summary)
Abstract: This thesis describes Cilk, a parallel multithreaded language for programming contemporary
shared memory multiprocessors (SMP's). Cilk is a simple extension of
C which provides constructs for parallel control and synchronization. Cilk imposes
very low overheads --- the typical cost of spawning a parallel thread is only between
2 and 6 times the cost of a C function call on a variety of contemporary machines.
Many Cilk programs run on one processor with virtually no degradation compared
to... (Update)
Context of citations to this paper: More
...needs to be implemented on distributed memory systems such as networks of workstations. An implementation was written by Keith Randall [127] for Unix systems, but this implementation is still preliminary. The main problem is the implementation of shared memory, using BACKER...
...programming of highly irregular, fine grained parallel applications. CHAPTER 3. THE CASE FOR PERFORMANCE AVAILABILITY 25 Cilk: Cilk [100] is a parallel programming environment designed for parallel machines. Parallelism is attained by spawning extremely lightweight threads,...
Cited by: More
Portable High-Performance Programs - Frigo (1992)
(Correct)
Performance Availability for Networks of Workstations - Arpaci-Dusseau (1999)
(Correct)
Similar documents (at the sentence level): More
17.8%: Detecting Data Races in Cilk Programs that Use Locks - Cheng, Feng, Leiserson.. (1998)
(Correct)
14.0%: The Implementation of the Cilk-5 Multithreaded Language - Frigo, Leiserson, Randall (1998)
(Correct)
13.5%: Debugging Multithreaded Programs that Incorporate User-Level Locking - Stark (1998)
(Correct)
Active bibliography (related documents): More All
0.8: The Cilk System for Parallel Multithreaded Computing - Joerg (1996)
(Correct)
0.6: Computation-Centric Memory Models - Frigo, Luchangco (1998)
(Correct)
0.5: Automatic Detection Of Nondeterminacy, And Scalar Optimizations In .. - Ghosh (1992)
(Correct)
Similar documents based on text: More All
0.5: Indolent Closure Creation - Strumpen
(Correct)
0.3: Quantitative Performance Modeling of Scientific Computations and.. - Toledo (1995)
(Correct)
0.1: A Type-checking Preprocessor for Cilk 2, a Multithreaded C Language - Miller (1995)
(Correct)
Related documents from co-citation: More All
4: Shared memory consistency models: A tutorial
- Adve, Gharachorloo - 1995
3: High Performance Fortran Language Specification
- Fortran - 1992
3: Cilk: An Efficient Multithreaded Runtime System
- Blumofe, Joerg et al. - 1995
BibTeX entry: (Update)
K. H. RANDALL, Cilk: Efficient Multithreaded Computing, PhD thesis, Massachusetts Institute of Technology, 1998. http://citeseer.ist.psu.edu/randall98cilk.html More
@techreport{ randall98cilk,
author = "K. Randall",
title = "Cilk: Efficient Multithreaded Computing",
number = "MIT/LCS/TR-749",
pages = "179",
year = "1998",
url = "citeseer.ist.psu.edu/randall98cilk.html" }
Citations (may not include all citations):
3972
Introduction to Algorithms (context) - Cormen, Leiserson et al. - 1990
981
Introduction to Parallel Algorithms and Architectures: Array.. (context) - Leighton - 1992
735
A bridging model for parallel computation (context) - Valiant - 1990
723
Memory coherence in shared virtual memory systems
- Li, Hudak - 1989
606
How to make a multiprocessor computer that correctly execute.. (context) - Lamport - 1979
422
Implementation and performance of Munin
- Carter, Bennett et al. - 1991
406
TreadMarks: Distributed shared memory on standard workstatio..
- Keleher, Cox et al. - 1994
367
Computer Architecture: a Quantitative Approach (context) - Hennessy, Patterson - 1990
367
Computer Architecture: a Quantitative Approach (context) - Hennessy, Patterson - 1996
341
Parallel programming in Split-C
- Culler, Dusseau et al. - 1993
301
The Midway distributed shared memory system (context) - Bershad, Zekauskas et al. - 1993
268
Parallel algorithms for sharedmemory machines (context) - Karp, Ramachandran - 1990
268
Tempest and Typhoon: User-level shared memory
- Reinhardt, Larus et al. - 1994
230
Cilk: An efficient multithreaded runtime system
- Blumofe, Joerg et al. - 1996
230
Cilk: An efficient multithreaded runtime system
- Blumofe, Joerg et al. - 1995
219
Bounds on multiprocessing timing anomalies (context) - Graham - 1969
213
Weak ordering --- a new definition
- Adve, Hill - 1990
210
The Amber system: Parallel programming on a network of multi..
- Chase, Amador et al. - 1989
174
The parallel evaluation of general arithmetic expressions (context) - Brent - 1974
173
Lazy task creation: A technique for increasing the granulari..
- Mohr, Kranz et al. - 1991
168
Gaussian elimination is not optimal (context) - Strassen - 1969
165
Memory access buffering in multiprocessors (context) - Dubois, Scheurich et al. - 1986
162
A new approach to the maximum flow problem (context) - Goldberg, Tarjan - 1986
157
Scheduling multithreaded computations by work stealing
- Blumofe, Leiserson - 1994
145
CRL: Highperformance all-software distributed shared memory
- Johnson, Kaashoek et al. - 1995
142
Solution of a problem in concurrent programming control (context) - Dijkstra - 1965
135
Operating Systems Theory (context) - Coffman, Peter et al. - 1973
130
Memory consistency and event ordering in scalable shared-mem.. (context) - Gharachorloo, Lenoski et al. - 1990
101
The splash-2 programs: Characterization and methodological c..
- Woo, Ohara et al. - 1995
90
Programming parallel algorithms
- Blelloch - 1996
87
Software coherent shared memory on a clustered remote-write .. (context) - Stets, Dwarkadas et al. - 1997
81
The implementation of the Cilk-5 multithreaded language
- Frigo, Leiserson et al. - 1998
79
Software write detection for a distributed shared memory
- Zekauskas, Sawdon et al. - 1994
79
The design and evaluation of a shared object system for dist..
- Scales, Lam - 1994
79
SoftFLASH: Analyzing the performance of clustered distribute.. (context) - Erlichson, Nuckolls et al. - 1996
72
Implementing and programming causal distributed shared memor.. (context) - Ahamad, Hutto et al. - 1991
62
An analysis of dag-consistent distributed shared-memory algo..
- Blumofe, Frigo et al. - 1996
62
Distributed Filaments: Efficient fine-grain parallelism on a..
- Freeh, Lowenthal et al. - 1994
61
Department of Electrical Engineering and Computer Science (context) - Blumofe, Programs et al. - 1995
61
Department of Electrical Engineering and Computer Science (context) - Miller, preprocessor et al. - 1995
58
An empirical comparison of monitoring algorithms for access .. (context) - Dinning, Schonberg - 1990
48
Software-extended coherent shared memory: Performance and co..
- Chaiken, Agarwal - 1994
48
A mechanism for efficient debugging of parallel programs (context) - Miller, Choi - 1988
45
Thread scheduling for multiprogrammed multiprocessors
- Arora, Blumofe et al. - 1998
44
the complexity of event ordering for shared-memory parallel ..
- Netzer, Miller - 1990
41
Detecting access anomalies in programs with critical section.. (context) - Dinning, Schonberg - 1991
39
Provably efficient scheduling for languages with fine-graine..
- Blelloch, Gibbons et al. - 1995
37
Commutativity analysis: A new analysis framework for paralle..
- Rinard, Diniz - 1996
37
Some deadlock properties of computer systems (context) - Holt - 1972
36
The matrix market: A web resource for test matrix collection..
- Boisvert, Pozo et al. - 1997
36
The Cilk System for Parallel Multithreaded Computing
- Joerg - 1996
34
Event synchronization analysis for debugging parallel progra..
- Emrath, Ghosh et al. - 1991
33
Jade: A high-level machine-independent language for parallel..
- Rinard, Scales et al. - 1993
33
Dag-consistent distributed shared memory
- Blumofe, Frigo et al. - 1996
31
fly detection of data races for programs with nested fork-jo.. (context) - Mellor-Crummey - 1991
26
Massachusetts Institute of Technology (context) - Frigo, reasonable et al. - 1997
25
Arctic routing chip
- Boughton - 1994
24
Analyzing traces with anonymous synchronization
- Helmbold, McDowell et al. - 1990
24
ACM Letters on Programming Languages and Systems (context) - Netzer, Miller et al. - 1992
24
Tools for the efficient development of efficient parallel pr.. (context) - Nudler, Rudolph - 1986
23
The BLAZE language: A parallel language for scientific progr.. (context) - Mehrotra, Van Rosendale - 1987
22
Online data-race detection via coherency guarantees
- Perkovi'c, Keleher - 1996
19
Efficient race condition detection for shared-memory program..
- Netzer, Ghosh - 1992
18
The function of FUNCTION in LISP or why the FUNARG problem s.. (context) - Moses - 1970
17
LCM: Memory system support for parallel language implementat..
- Larus, Richards et al. - 1994
16
Polling efficiently on stock hardware
- Feeley - 1993
16
The control mechanism for the Myrias parallel computer syste.. (context) - Beltrametti, Bobey et al. - 1988
15
Detecting data races in Cilk programs that use locks (context) - Cheng, Feng et al. - 1998
15
MIMD-style parallel programming with continuation-passing th.. (context) - Halbherr, Zhou et al. - 1994
15
Efficient detection of determinacy races in Cilk programs
- Feng, Leiserson - 1997
14
Lazy threads: Implementing a fast parallel call
- Goldstein, Schauser et al. - 1996
12
Applications of path compression on balanced trees (context) - Tarjan - 1979
11
Whole-program optimization for time and space efficient thre..
- Grunwald, Neves - 1996
11
Computation-centric memory models
- Frigo, Luchangco - 1998
11
Empirical and analytic study of stack versus heap cost for l.. (context) - Appel, Shao - 1996
10
Eraser: A dynamic race detector for multi-threaded programs (context) - Savage, Burrows et al. - 1997
10
Available on the Internet from http://theory (context) - Beta, Manual
9
An efficient cache-based access anomaly detection scheme (context) - Min, Choi - 1991
8
Programming a distributed system using shared objects
- Tanenbaum, Bal et al. - 1993
8
VLSI support for a cactus stack oriented memory organization (context) - Stenstrom - 1988
8
IEEE Scalable Coherent Interface (context) - Goodman, sequential et al. - 1989
8
Location consistency: Stepping beyond the memory coherence b.. (context) - Gao, Sarkar - 1995
7
Extended static checking home page (context) - Nelson, Rustan et al. - 1996
6
Representing matrices as quadtrees for parallel processors (context) - Wise - 1985
6
MIT Artificial Intelligence Laboratory (context) - Miller, Rozas et al. - 1994
6
A one-man-year exercise in network interface engineering (context) - Hoe - 1998
5
Precedence-based memory models (context) - Luchangco - 1997
5
stack mechanism (context) - Hauck, Dent - 1968
5
Heaps o' stacks: Time and space efficient threads without op.. (context) - Grunwald - 1994
5
Detecting parallel access anomalies (context) - Fenster - 1998
4
Notes on Rubik's Magic Cube (context) - Singmaster - 1980
4
Detecting Nondeterminism in Shared Memory Parallel Programs (context) - Dinning - 1990
4
Personal communication (context) - Singh
4
Available on the Internet from http://www (context) - Bekaert, de Laet et al. - 1997
4
Personal communication (context) - Lisiecki, Medina
3
Parallel Symbolic Computing in Cid (context) - Nikhil - 1995
3
Available on the Internet from ftp://hubble (context) - Barnes, hierarchical et al.
1
Solving Rubik's cube
- Randall - 1998
Documents on the same site (http://supertech.lcs.mit.edu/cilk/papers/): More
Portable Fault-Tolerant File I/O - Lyubashevskiy
(Correct)
Debugging Multithreaded Programs that Incorporate User-Level Locking - Stark (1998)
(Correct)
Scheduling Adaptively Parallel Jobs - Song (1998)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC