See this document in CiteSeerX!

A Large, Fast Instruction Window for Tolerating Cache Misses  (Make Corrections)  
Alvin R. Lebeck, Jinson Koppanalil, Tong Li, Jaidev Patwardhan, Eric Rotenberg



  Home/Search   Context   Related

 
View or download:
duke.edu/~alvy/papers/wib.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  duke.edu/~alvy/pubs (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Instruction window size is an important design parameter for many modern processors. Large instruction windows offer the potential advantage of exposing large amounts of instruction level parallelism. Unfortunately, naively scaling conventional window designs can significantly degrade clock cycle time, undermining the benefits of increased parallelism. This paper presents a new instruction window design targeted at achieving the latency tolerance of large windows with the clock cycle time of... (Update)

Active bibliography (related documents):   More   All
2.3:   A Large, Fast Instruction Window for Tolerating Cache Misses - Alvin Lebeck Jinson   (Correct)
0.4:   Exploring Wakeup-Free Instruction Scheduling - Jie Hu Vijaykrishnan   (Correct)
0.4:   Half-Price Architecture - Kim, Lipasti (2003)   (Correct)

Similar documents based on text:   More   All
0.3:   A Case for Dynamic Pipeline Scaling - Jinson Koppanalil Prakash   (Correct)
0.2:   Power Aware Page Allocation - Alvin Lebeck Dept (2000)   (Correct)
0.2:   Load Latency Tolerance In Dynamically Scheduled Processors - Srinivasan, Lebeck (1999)   (Correct)

BibTeX entry:   (Update)

@misc{ lebeck-large,
  author = "Alvin R. Lebeck and Jinson Koppanalil and Tong Li and Jaidev Patwardhan
    and Eric Rotenberg",
  title = "A Large, Fast Instruction Window for Tolerating Cache Misses",
  url = "citeseer.ist.psu.edu/615984.html" }
Citations (may not include all citations):
269   Multiscalar Processors - Sohi, Breach et al. - 1995
177   Evaluating Future Microprocessors-the SimpleScalar Tool Set - Burger, Austin et al. - 1996
175   ComplexityEffective Superscalar Processors - Palacharla, Jouppi et al. - 1997
150   An Efficient Algorithm for Exploiting Multiple Arithmetic Un.. (context) - Tomasulo - 1967
132   The Alpha 21264 Microprocessor (context) - Kessler - 1999
125   Trace Processors - Rotenberg, Jacobson et al. - 1997
117   Clock Rate Versus IPC: The End of the Road for Conventional .. - Agarwal, Hrishikesh et al. - 2000
74   Instruction Issue Logic for High-Performance (context) - Sohi - 1990
72   A Dynamic Multithreading Processor - Akkary, Driscoll - 1998
41   POWER4 System Microarchitecture (context) - Tendler, Dodson et al. - 2001
40   Power and Energy Reduction via Pipeline Balancing (context) - Bahar, Manne - 2001
34   Multiple-Banked Register File Architectures (context) - Cruz, Gonzalez et al. - 2000
33   Energy-Effective Issue Logic (context) - Folegnani, Gonzalez - 2001
32   SPEC CPU (context) - Henning
31   Understanding the Backward Slices of Performance Degrading I.. - Zilles, Sohi - 2000
30   Improving Prediction for Procedure Returns with ReturnAddres.. - Skadron, Ahuja et al. - 1998
30   Early Experiences with Olden (context) - Carlisle, Rogers et al. - 1993
30   Focusing Processor Policies via Critical-Path Prediction - Fields, Rubin et al. - 2001
27   and Cache Size: Performance Tradeoffs and Simulation Techniq.. (context) - Skadron, Ahuja et al. - 1999
26   Alpha 21264 Microprocessor Hardware Reference Manual (context) - Corporation - 1999
17   Circuits for Wide-Window Superscalar Processors - Henry, Kuszmaul et al. - 2000
17   Reducing the Complexity of the Register File in Dynamic Supe.. (context) - Balasubramonian, Dwarkadas et al. - 2001
17   A Low-Complexity Issue Logic (context) - Canal, Gonzalez - 2000
16   Dynamically Allocating Processor Resources Between Nearby an.. - Balasubramonian, Dwarkadas et al. - 2001
15   TwoLevel Hierarchical Register File Organization For VLIW Pr.. - Zalamea, Llosa et al. - 2000
15   Memory Behavior of the SPEC (context) - Sair, Charney
15   Select-Free Instruction Scheduling Logic - Brown, Stark et al. - 2001
12   Issue Logic for a 600-MHz Out-of-Order Execution Microproces.. (context) - Farrell, Fischer - 1998
12   Fast Instruction Window for Tolerating Cache Misses (context) - Li, Koppanalil et al. - 2002
10   Data-flow Prescheduling for Large Instruction Windows in Out.. - Michaud, Seznec - 2001
7   Scalable Register Renaming via the Quack Register File - Black, Shen - 2000
6   Reducing the Complexity of the Issue Logic (context) - Canal, Gonzalez - 2001
4   A Design Space Evaluation of Grid Processor Architectures (context) - Sankaralingam, Nagarajan et al. - 2001
3   Department of Computer Science (context) - Skadron, Removing et al. - 1999
3   Alpha Binaries (context) - Weaver

Documents on the same site (http://www.cs.duke.edu/~alvy/pubs.html):   More
Nonlinear Array Layouts for Hierarchical Memory Systems - Chatterjee, Jain.. (1999)   (Correct)
Dynamic Self-Invalidation: Reducing Coherence Overhead in.. - Lebeck, Wood (1995)   (Correct)
Architecture-efficient Strassen's Matrix.. - Pauca, Sun.. (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC