(Enter summary)
Abstract: This paper presents our experience mapping OpenMP parallel
programming model to the IBM Cyclops-64 (C64) architecture.
The C64 employs a many-core-on-a-chip design
that integrates processing logic (160 thread units), embedded
memory (5MB) and communication hardware on the
same die. Such a unique architecture presents new opportunities
for optimization. Specifically, we consider the following
three areas: (1) a memory aware runtime library that
places frequently used data structures in... (Update)
Cited by: More
Performance Characteristics of OpenMP Language Constructs on .. - Zhu, Cuvillo, Gao (2006)
(Correct)
Active bibliography (related documents): More All
0.6: Lock-Free and Practical Deques using Single-Word.. - Sundell, Tsigas (2004)
(Correct)
0.5: A Scalable Elimination-based Exchange Channel - III, Lea, Scott (2005)
(Correct)
0.4: Preemption Adaptivity in Time-Published Queue-Based Spin Locks - He, III, Scott (2005)
(Correct)
Similar documents based on text:
6.0: Unknown -
(Correct)
BibTeX entry: (Update)
del Cuvillo, J., Zhu, W., Gao, G.R.: Landing OpenMP on Cyclops-64: An efficient mapping of OpenMP to a many-core system-on-a-chip. In: Proceedings of the 3rd ACM International Conference on Computing Frontiers, Ischia, Italy (2006) http://citeseer.ist.psu.edu/delcuvillo06landing.html More
@misc{ cuvillo06landing,
author = "d Cuvillo and J. Zhu and W. Gao",
title = "Landing OpenMP on Cyclops-64: An efficient mapping of OpenMP to a many-core
system-on-a-chip",
text = "del Cuvillo, J., Zhu, W., Gao, G.R.: Landing OpenMP on Cyclops-64: An efficient
mapping of OpenMP to a many-core system-on-a-chip. In: Proceedings of the
3rd ACM International Conference on Computing Frontiers, Ischia, Italy (2006)",
year = "2006",
url = "citeseer.ist.psu.edu/delcuvillo06landing.html" }
Citations (may not include all citations):
197
The performance of spin lock alternatives for shared-memory .. (context) - Anderson - 1990
74
Transactional memory: Architectural support for lock-free da..
- Herlihy, Eliot et al. - 1993
70
Dynamic decentralized cache schemes for MIMD parallel proces.. (context) - Rudolph, Segall - 1984
44
OpenMP FORTRAN application program interface (context) - Review - 2000
35
Lock-free linked lists using compare-and-swap
- Valois - 1995
30
application program interface (context) - Review, OpenMP - 2002
25
and practical non-blocking and blocking concurrent queue alg.. (context) - Michael, Scott et al. - 1996
25
Algorithms for scalable synchronization on shared-memory mul..
- Mellor-Crummey, Scott - 1991
21
Synchronization algorithms for shared-memory multiprocessors (context) - Graunke, Thakkar - 1990
18
Concurrent set manipulation without locking (context) - Lanin, Shasha - 1988
16
Non-blocking synchronization and system design
- Greenwald - 1999
13
Evaluating synchronization on shared address space multiproc..
- Kumar, Jiang et al. - 1999
11
A pragmatic implementation of non-blocking linked-lists
- Harris - 2001
9
Demonstrating the scalability of a molecular dynamics applic..
- Almasi, Cascaval et al. - 2002
8
Measuring synchronization and scheduling overheads in OpenMP (context) - Bull - 1999
7
High performance dynamic lock-free hash tables and list-base.. (context) - Michael - 2002
6
Performance evaluation of the Omni OpenMP compiler
- Kusano, Satoh et al. - 1940
4
Hazard pointers: Safe memory reclamation for lock-free objec.. (context) - Michael - 2004
4
Performance characteristics for OpenMP constructs on di#eren..
- Berrendorf, Nieken - 2000
3
and application benchmarks on a large symmetric multiprocess.. (context) - Fredrickson, Afsahi et al. - 2003
3
Principle of operation (context) - system, architecture - 1983
2
Evaluation of OpenMP for the Cyclops multithreaded architect..
- Almasi, Ayguade et al. - 2003
2
FAST: A functionally accurate simulation toolset for the Cyc..
- Cuvillo, Zhu et al. - 2005
2
Performance comparisons of basic OpenMP constructs (context) - Prabhakar, Getov et al. - 2002
2
CAS-based lock-free algorithm for shared deques (context) - Michael - 2003
2
A scalable lock-free stack algorithm (context) - Hendler, Shavit et al. - 2004
1
Nonblocking memory management support for dynamic-sized data.. (context) - Herlihy, Luchangco et al. - 2005
1
Optimizing NANOS OpenMP for the IBM Cyclops multithreaded ar.. (context) - Rodenas, Martorell et al. - 2005
1
Toward a software infrastructure for the Cyclops-64 cellular.. (context) - Cuvillo, Zhu et al. - 2006
Documents on the same site (http://www.capsl.udel.edu/~weirong/vita/cv/resume.html): More
Performance Characteristics of OpenMP Language Constructs on .. - Zhu, Cuvillo, Gao (2006)
(Correct)
A Cluster-Based Solution for High Performance Hmmpfam Using .. - Execution Model Weirong (2003)
(Correct)
Performance Portability on EARTH: A Case Study across.. - Architectures Weirong.. (2005)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC