(Enter summary)
Abstract: The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buffering and reusing dynamic instruction traces. This work presents a new block-based trace cache implementation that can achieve higher IPC performance with more efficient storage of traces. Instead of explicitly storing instructions of a trace, pointers to blocks constituting a trace are stored in a much smaller trace table. The block-based trace cache renames fetch addresses at the basic block... (Update)
Cited by: More
Single Instruction Fetch Does Not Inhibit - Instruction-Level Parallelism..
(Correct)
Execution Cache-Based Microarchitecture for Power-Efficient .. - Talpes, Marculescu (2005)
(Correct)
Improving Processor Performance through Compiler-Assisted Block.. - Huang (2000)
(Correct)
Active bibliography (related documents): More All
0.3: Improving Multiple-block Prediction in the Block-based Trace Cache - Rakvic (1999)
(Correct)
0.2: Completion Time Multiple Branch Prediction for Enhancing.. - Rakvic, Black, Shen (2000)
(Correct)
0.2: Hardware Optimizations Enabled by a Decoupled Fetch Architecture - Reinman (2001)
(Correct)
Similar documents based on text: More All
0.8: Using Dynamic Branch Behavior for Power-Efficient Instruction Fetch - Hu, al. (2003)
(Correct)
0.7: A Comparative Study of Redundancy in Trace Caches - Vandierendonck..
(Correct)
0.2: Effective Ahead Pipelining of Instruction Block Address.. - Andre Seznec And (2003)
(Correct)
Related documents from co-citation: More All
18: Trace Cache: A Low Latency Approach to High Bandwidth Instruction Fetching
- Rotenberg, Bennett et al. - 1996
11: Putting the fill unit to work: Dynamic optimizations for trace cache microproces..
- Friendly, Patel et al. - 1998
8: Exploiting Instruction Level Parallelism (context) - Nair, Hopkins - 1996
BibTeX entry: (Update)
Bryan Black, Bohuslav Rychlik, and John Paul Shen. The block-based trace cache. Proceedings of the 26th Annual Intl. Symposium on Computer Architecture, May 1999. http://citeseer.ist.psu.edu/92664.html More
@inproceedings{ black99,
author = "Bryan Black and Bohuslav Rychlik and John Paul Shen",
institution = "Department of Electrical and Computer Engineering,
Carnegie Mellon University, Pittsburgh, USA",
title = "The Block-based Trace Cache",
year = 1999,
pages = "196--207",
booktitle = "Proceedings of the 26th Annual Intl. Symposium on Computer Architecture",
month = may,
publisher = "IEEE Computer Society Press",
url = "citeseer.ist.psu.edu/92664.html" }
Citations (may not include all citations):
214
Combining Branch Predictors
- McFarling - 1993
183
Trace Cache: A Low Latency Approach to High Bandwidth Instru..
- Rotenberg, Bennett et al. - 1996
125
Trace Processors
- Rotenberg, Jacobson et al. - 1997
110
Improving the Accuracy of Dynamic Branch Prediction Using Br.. (context) - Pan, So et al. - 1992
93
Optimization of Instruction Fetch Mechanisms for High Issue ..
- Conte, Menezes et al. - 1995
75
Increasing the Instruction Fetch Rate via Multiple Branch Pr..
- Yeh, Marr et al. - 1993
54
Dynamic Path-based Branch Correlation (context) - Nair - 1995
40
MultipleBlock Ahead Branch Predictors
- Seznec, Jourdan et al. - 1996
39
Path-based Next Trace Prediction
- Jacobson, Rotenberg et al. - 1997
36
PowerPC 604 RISC Microprocessor User's Manual (context) - Division - 1994
29
Next Cache Line and Set Prediction
- Calder, Grunwald - 1995
25
Increasing the Instruction Fetch Rate via Block-structured I..
- Hao, Chang et al. - 1997
24
The PowerPC 604 RISC Microprocessor (context) - Song, Denman et al. - 1994
23
Calibration of Microprocessor Performance Models
- Black, Shen - 1998
18
The Effects of Mispredicted-Path Execution on Branch Predict.. (context) - Jourdan, Hsing et al. - 1996
18
Enhancing Instruction Scheduling with a Block-Structured ISA (context) - Melvin, Patt - 1995
15
Alternative Fetch and Issue Policies for the Trace Cache Fet..
- Friendly, Patel et al. - 1997
12
The PowerPC User Instruction Set Architecture (context) - Diefendorf, Silha - 1994
10
Multiple Branch and Block Prediction
- Wallace, Bagherzadeh - 1997
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.ece.cmu.edu/~cmuart/publications.html): More
Thread Integration for Error Detection and Performance - Dean (1997)
(Correct)
Load Execution Latency Reduction - Black, Mueller, Postal, Rakvic.. (1998)
(Correct)
Reducing Branch Misprediction Penalties Via Dynamic Control.. - Chou, Fung, Shen (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC