MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  28th HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (Maui, HI) (winner: best paper) A Technique to Determine Power-Efficient, High-Performance Superscalar Processors

Download:
Download as a PDF | Download as a PS
by Thomas M. Conte, Kishore N. P. Menezes, Sumedh W. Sathaye
http://www4.ncsu.edu:8030/~swsathay/hicss95.ps
Add To MetaCart

Abstract:

Processor performance advances are increasingly inhibited by limitations in thermal power dissipation. Part of the problem is the lack of architectural power estimates before implementation. Although high-performance designs exist that dissipate low power, the method for finding these designs has been through trial-and-error. This paper presents systematic techniques to find low-power, high-performance superscalar processors tailored to specific user benchmarks. The model of power is novel because it separates power into architectural and technology components. The architectural component is found via trace-driven simulation, which also produces performance estimates. An example technology model is presented that estimates the technology component, along with critical delay time and real estate usage. This model is based on case studies of actual designs. It is used to solve an important problem: increasing the duplication in superscalar execution units without excessive power consumption. Results are presented from runs using simulated annealing to maximize processor performance subject to power and area constraints. The major contributions of this paper are the separation of architectural and technology components of dynamic power, the use of trace-driven simulation for architectural power measurement, and the use of a near-optimal search to tailor a processor design to a benchmark. 1

Citations

308 Low-power CMOS digital design – Chandrakasan, Sheng, et al. - 1992
274 Lockup-free instruction fetch/prefetch cache organisation – Kroft - 1981
171 Computer Arithmetic Algorithms – Koren - 1993
136 Two-Level Adaptive Training Branch Prediction – Yeh, Patt - 1991
42 The Alpha AXP architecture and 21064 processor – McLellan - 1993
39 The PowerPC 604 RISC microprocessor – Song, Denman, et al. - 1994
31 Branch Folding in the CRISP Microprocessor: Reducing Branch Delay to Zero – Ditzel, McLellan - 1987
31 Architecture of the Pentium Microprocessor – Alpert, Avnon - 1993
25 Systematic computer architecture prototyping – Conte - 1992
23 Performance features of the PA7100 microprocessor – Asprey, Averill, et al. - 1993
18 Computation of elementary functions on the IBM RISC System/6000 processor – Markstein - 1990
18 The IBM system/360 model 91 floating point execution unit – Anderson, Earle, et al. - 1967
17 Optimization by Simulated Annealing – Kirkpatric, Gelatt, et al.
12 Issues in trace collection through program instrumentation – Golden - 1991
10 Digital CMOS Circuit design – Annaratone - 1986
10 Computer Arithmetic Algorithms. Englewood Cli s – Koren - 1993
9 Determining cost-effective multiple issue processor designs – Conte, Mangione-Smith - 1993
3 Architectural resource requirements of contemporary benchmarks: A wish list – Conte - 1993
2 Determining cost-e ective multiple issue processor designs – Conte, Mangione-Smith - 1993