Download:
by Hai Li, Swarup Bhunia, Yiran Chen, T. N. Vijaykumar, Kaushik Roy
In Proc. of 9 th Int’l Symp. on High Performance Computer Architecture (HPCA
http://www.cs.arizona.edu/hpca9/HPCA-PDFs/11-li.pdf
Add To MetaCart
Abstract:
With the scaling of technology and the need for higher performance and more functionality, power dissipation is becoming a major bottleneck for microprocessor designs. Pipeline balancing (PLB), a previous technique, is essentially a methodology to clockgate unused components whenever a program’s instruction-level parallelism is predicted to be low. However, no non-predictive methodologies are available in the literature for efficient clock gating. This paper introduces deterministic clock gating (DCG) based on the key observation that for many of the stages in a modern pipeline, a circuit block’s usage in a specific cycle in the near future is deterministically known a few cycles ahead of time. Our experiments show an average of 19.9% reduction in processor power with virtually no performance loss for an 8-issue, out-of-order superscalar processor by applying DCG to execution units, pipeline latches, D-Cache wordline decoders, and result bus drivers. In contrast, PLB achieves 9.9 % average power savings at 2.9 % performance loss. 1.
Citations
|
577
|
Wattch: A Framework for Architectural-Level Power Analysis and Optimizations
– Brooks, Tiwari, et al.
- 2000
|
|
344
|
Complexityeffective superscalar processors
– Palacharla, Jouppi, et al.
- 1997
|
|
206
|
Pipeline Gating: Speculation Control for Energy Reduction
– Manne, Klauser, et al.
- 1998
|
|
176
|
Selective cache ways: On-demand cache resourceallocation
– Albonesi
- 1999
|
|
145
|
The Filter Cache: An Energy Efficient Memory Structure
– Kin, Gupta, et al.
- 1997
|
|
140
|
A low-complexity issue logic
– Canal, Gonzalez
- 2000
|
|
108
|
Cache Design Trade-offs for Power and Performance Optimization: A Case Study
– Su, Despain
- 1995
|
|
103
|
Dynamically Exploiting Narrow Width Operands to Improve Processor Power and Performance
– Brooks, Martonosi
- 1999
|
|
69
|
Power and energy reduction via pipeline balancing
– Bahar, Manne
|
|
49
|
Power considerations in the design of the Alpha 21264 microprocessor
– Gowan, Biro, et al.
- 1998
|
|
22
|
Value-Based Clock Gating and Operation Packing: Dynamic Strategies for Improving Processor Power and Performance
– Brooks, Martonosi
- 2000
|
|
13
|
Reducing set-associative cache energy via selective direct-mapping and way prediction
– Powell, Agrawal, et al.
- 2001
|
|
10
|
Using dynamic management techniques to reduce energy in highperformance processors
– Bellas, Hajj, et al.
- 1999
|
|
3
|
Power optimization using dynamic power management
– Monteiro
- 1999
|
|
2
|
Cacti 2.0: An enhanced access and cycle time model for on-chip caches”, http://research.compaq.com/wrl/people/jouppi/CACTI.html
– Rienman, Jouppi
- 2001
|
|
1
|
Pre-compiled little-endian Alpha
– Weaver
|