See this document in CiteSeerX!

A Technology-Scalable Architecture for Fast Clocks and High ILP (2001)  (Make Corrections)  (14 citations)
Karthikeyan Sankaralingam, Ramadass Nagarajan, Stephen W. Keckler, Doug Burger



  Home/Search   Context   Related

 
View or download:
utexas.edu/users/cart/pu...tr0102.ps.Z
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  utexas.edu/users/cart/pubs (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: CMOS technology scaling poses challenges in designing dynamically scheduled cores that can sustain both high instruction-level parallelism and aggressive clock frequencies. In this paper, we present a new architecture that maps compiler-scheduled blocks onto a two-dimensional grid of ALUs. For the mapped window of execution, instructions execute in a dataflow-like manner, with each ALU forwarding its result along short wires to the consumers of the result. We describe our studies of program... (Update)

Cited by:   More
Spatial Computation - Mihai Budiu Girish (2004)   (Correct)
Dataflow: A Complement to Superscalar - Budiu, Artigas, Goldstein (2005)   (Correct)
A Common Machine Language for Grid-Based Architectures - William Thies Michal   (Correct)

Similar documents (at the sentence level):
64.4%:   A Technology-Scalable Architecture for Fast Clocks.. - Sankaralingam.. (2001)   (Correct)

Active bibliography (related documents):   More   All
2.2:   A Technology-Scalable Architecture For Fast Clocks.. - Sankaralingam.. (2001)   (Correct)
0.6:   A Design Space Evaluation of Grid Processor Architectures - Nagarajan.. (2001)   (Correct)
0.2:   The Need for Fast Communication in Hardware-Based.. - Krishnan, Torrellas (1999)   (Correct)

Similar documents based on text:   More   All
0.7:   List of Publications - Sankaralingam (2004)   (Correct)
0.6:   Exploiting ILP, TLP, and DLP with the Polymorphous .. - Sankaralingam.. (2003)   (Correct)
0.4:   Design and Analysis of Routed Inter-ALU Networks.. - Singh.. (2003)   (Correct)

Related documents from co-citation:   More   All
8:   Smart Memories: A Modular Recongurable Architecture (context) - Mai, Paaske et al. - 2000
8:   Space-Time Scheduling of Instruction-Level Parallelism on a Raw Machine - Lee, Barua et al. - 1998
8:   Baring it all to Software: The Raw Machine - Elliot, Taylor et al. - 1997

BibTeX entry:   (Update)

K. Sankaralingam, R. Nagarajan, D.C. Burger, and S.W. Keckler. A TechnologyScalable Architecture for Fast Clocks and High ILP. In 5th Workshop on the Interaction of Compilers and Computer Architecture, January 2001. http://citeseer.ist.psu.edu/article/sankaralingam01technologyscalable.html   More

@misc{ sankaralingam01technologyscalable,
  author = "K. Sankaralingam and R. Nagarajan and D. Burger and S. Keckler",
  title = "A TechnologyScalable Architecture for Fast Clocks and High ILP",
  text = "K. Sankaralingam, R. Nagarajan, D.C. Burger, and S.W. Keckler. A TechnologyScalable
    Architecture for Fast Clocks and High ILP. In 5th Workshop on the Interaction
    of Compilers and Computer Architecture, January 2001.",
  year = "2001",
  url = "citeseer.ist.psu.edu/article/sankaralingam01technologyscalable.html" }
Citations (may not include all citations):
269   Multiscalar processors - Sohi, Breach et al. - 1995
151   Baring it all to software: RAW machines - Waingold, Taylor et al. - 1997
133   Fine-grain parallelism with minimal hardware support: A comp.. - Culler, Sah et al. - 1991
125   Trace processors - Rotenberg, Jacobson et al. - 1997
117   Clock rate versus IPC: The end of the road for conventional .. - Agarwal, Hrishikesh et al. - 2000
104   for semiconductors. Semiconductor Industry Association (context) - technology - 1999
59   Very long instruction word architectures and the ELI (context) - Fisher - 1983
32   Dynamically scheduled VLIW processors - Rau - 1993
25   Increasing the instruction fetch rate via blockstructured in.. - Hao, Chang et al. - 1996
24   ective compiler support for predicated execution using the h.. (context) - Mahlke, Lin et al. - 1992
16   ective superscalar processors (context) - Palacharla, Jouppi et al. - 1997
12   Scaling of microarchitectural structures in future process t.. (context) - Agarwal, Keckler et al. - 2000
12   Technology independent area and delay estimates for micropro.. - Gupta, Keckler et al. - 2000
9   An empirical study of decentralized ILP execution models - Ranganathan, Franklin - 1998
6   Executing a program on the MIT Tagged-Token Data ow Architec.. (context) - Arvind - 1990
6   Transport Triggered Architectures (context) - Corporaal - 1995
5   Register queue new hardwaresoftware approach to ecient softw.. (context) - Tyson, queues et al. - 2000
4   Register trac analysis for streamlining inter-operation comm.. (context) - Franklin, Sohi - 1992
2   A preliminary architecture for a basic data- ow processor (context) - Dennis, Misunas - 1975
2   Using sacks to organize register les in VLIW machines (context) - Llosa, Valero et al. - 1994



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.utexas.edu/users/cart/pubs.html):   More
Characterizing the SPHINX Speech Recognition System - Agaram, Keckler, Burger   (Correct)
SimpleScalar Simulation of the PowerPC Instruction.. - Sankaralingam.. (2000)   (Correct)
Technology Independent Area and Delay Estimates for.. - Gupta, Keckler, Burger (2000)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC