Decoupled Pipelines: Rationale, Analysis, and Evaluation
Abstract:
This paper analyzes the potential of asynchronous, decoupled pipelines from an architectural viewpoint. Previous work in this area has been implementation oriented and has concentrated on developing circuit models and design tools needed to actually build a simple clockless processor. This past work has shown that asynchronous processing is a viable approach, with several inherent advantages over synchronous processing. This paper breaks from this implementation focus and instead considers what is gained architecturally when the globally synchronous clock is removed. We first present the design of a decoupled, self-timed asynchronous architecture in which each pipeline component is able to determine its own operating speed. Our design includes micropipelines between stages to provide extra elasticity, allowing the pipeline to dynamically adjust to long latency operations. We then show how this organization enables a new style of prediction-based optimizations that is not possible in a synchronous processor. Finally, we provide preliminary results of a complete VHDL simulator which demonstrate that these optimizations can carry a significant performance advantage. 1
Citations
| 115 | Asynchronous Design Methodologies: An Overview – Hauck - 1995 |
| 47 | Reducing power in high-performance microprocessors – Tiwari, Singh, et al. - 1998 |
| 12 | A Low-Latency FIFO for MixedClock Systems – Chelcea, Nowick - 2000 |
| 7 | et al., “The Design of an Asynchronous – Martin - 1997 |
| 5 | Asynchronous Processor Survey – Werner, Akella - 1997 |
| 5 | Self-Timed Carry-Lookahead Adders – Cheng, Unger, et al. - 2000 |
| 2 | An Architecture for a Self-Timed Decoupled Computer – Fred - 1996 |
| 2 | Modeling and Simulation of Asynchronous Systems Using the LARD Hardware Description Language – Endecott, Furber - 1998 |
| 1 | A Comparative Power Analysis of an Asynchronous – Efthymiou, Temple - 2001 |
| 1 | et al. TITAC-2: An Asynchronous 32-bit Microprocessor based on Scalable -Delay-Insensitive Model. ICCD – Takamura, Kuwako - 1997 |
| 1 | Validating the AMULET Microprocessors – Furber - 2001 |
| 1 | Graphics Corporation. Renoir Version 99.1 (Build 26 – Mentor |

