(Enter summary)
Abstract: CMOS technology scaling poses challenges in designing dynamically scheduled cores that can sustain both high instruction-level parallelism and aggressive clock frequencies. In this paper, we present a new architecture that maps compiler-scheduled blocks onto a two-dimensional grid of ALUs. For the mapped window of execution, instructions execute in a dataflow-like manner, with each ALU forwarding its result along short wires to the consumers of the result. We describe our studies of program... (Update)
Cited by: More
Spatial Computation - Mihai Budiu Girish (2004)
(Correct)
Dataflow: A Complement to Superscalar - Budiu, Artigas, Goldstein (2005)
(Correct)
A Common Machine Language for Grid-Based Architectures - William Thies Michal
(Correct)
Similar documents (at the sentence level):
64.4%: A Technology-Scalable Architecture for Fast Clocks.. - Sankaralingam.. (2001)
(Correct)
Active bibliography (related documents): More All
2.2: A Technology-Scalable Architecture For Fast Clocks.. - Sankaralingam.. (2001)
(Correct)
0.6: A Design Space Evaluation of Grid Processor Architectures - Nagarajan.. (2001)
(Correct)
0.2: The Need for Fast Communication in Hardware-Based.. - Krishnan, Torrellas (1999)
(Correct)
Similar documents based on text: More All
0.7: List of Publications - Sankaralingam (2004)
(Correct)
0.6: Exploiting ILP, TLP, and DLP with the Polymorphous .. - Sankaralingam.. (2003)
(Correct)
0.4: Design and Analysis of Routed Inter-ALU Networks.. - Singh.. (2003)
(Correct)
Related documents from co-citation: More All
8: Smart Memories: A Modular Recongurable Architecture (context) - Mai, Paaske et al. - 2000
8: Space-Time Scheduling of Instruction-Level Parallelism on a Raw Machine
- Lee, Barua et al. - 1998
8: Baring it all to Software: The Raw Machine
- Elliot, Taylor et al. - 1997
BibTeX entry: (Update)
K. Sankaralingam, R. Nagarajan, D.C. Burger, and S.W. Keckler. A TechnologyScalable Architecture for Fast Clocks and High ILP. In 5th Workshop on the Interaction of Compilers and Computer Architecture, January 2001. http://citeseer.ist.psu.edu/article/sankaralingam01technologyscalable.html More
@misc{ sankaralingam01technologyscalable,
author = "K. Sankaralingam and R. Nagarajan and D. Burger and S. Keckler",
title = "A TechnologyScalable Architecture for Fast Clocks and High ILP",
text = "K. Sankaralingam, R. Nagarajan, D.C. Burger, and S.W. Keckler. A TechnologyScalable
Architecture for Fast Clocks and High ILP. In 5th Workshop on the Interaction
of Compilers and Computer Architecture, January 2001.",
year = "2001",
url = "citeseer.ist.psu.edu/article/sankaralingam01technologyscalable.html" }
Citations (may not include all citations):
269
Multiscalar processors
- Sohi, Breach et al. - 1995
151
Baring it all to software: RAW machines
- Waingold, Taylor et al. - 1997
133
Fine-grain parallelism with minimal hardware support: A comp..
- Culler, Sah et al. - 1991
125
Trace processors
- Rotenberg, Jacobson et al. - 1997
117
Clock rate versus IPC: The end of the road for conventional ..
- Agarwal, Hrishikesh et al. - 2000
104
for semiconductors. Semiconductor Industry Association (context) - technology - 1999
59
Very long instruction word architectures and the ELI (context) - Fisher - 1983
32
Dynamically scheduled VLIW processors
- Rau - 1993
25
Increasing the instruction fetch rate via blockstructured in..
- Hao, Chang et al. - 1996
24
ective compiler support for predicated execution using the h.. (context) - Mahlke, Lin et al. - 1992
16
ective superscalar processors (context) - Palacharla, Jouppi et al. - 1997
12
Scaling of microarchitectural structures in future process t.. (context) - Agarwal, Keckler et al. - 2000
12
Technology independent area and delay estimates for micropro..
- Gupta, Keckler et al. - 2000
9
An empirical study of decentralized ILP execution models
- Ranganathan, Franklin - 1998
6
Executing a program on the MIT Tagged-Token Data ow Architec.. (context) - Arvind - 1990
6
Transport Triggered Architectures (context) - Corporaal - 1995
5
Register queue new hardwaresoftware approach to ecient softw.. (context) - Tyson, queues et al. - 2000
4
Register trac analysis for streamlining inter-operation comm.. (context) - Franklin, Sohi - 1992
2
A preliminary architecture for a basic data- ow processor (context) - Dennis, Misunas - 1975
2
Using sacks to organize register les in VLIW machines (context) - Llosa, Valero et al. - 1994
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.utexas.edu/users/cart/pubs.html): More
Characterizing the SPHINX Speech Recognition System - Agaram, Keckler, Burger
(Correct)
SimpleScalar Simulation of the PowerPC Instruction.. - Sankaralingam.. (2000)
(Correct)
Technology Independent Area and Delay Estimates for.. - Gupta, Keckler, Burger (2000)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC