| R. M. Tomasulo, "An efficient hardware algorithm for exploiting multiple arithmetic units," IBM Journal, vol. 11, pp. 25-33, 1967. |
....superscalar processors support code motion across basic block boundaries by looking ahead in the instruction stream, by buffering the internal state, and by executing instructions conditionally and out of order. These techniques were first utilized in the IBM Stretch [2] and IBM 360 91 [22]. These machines attempted to increase processor performance by hiding memory latency and decoupling instruction fetch from instruction execution. They provided internal buffering for issued but unexecuted instructions and extra state to keep track of the out of order results. Though these ....
....structures, but as explained below, its reorder buffer performs a similar function. Since MATCH performs all instruction scheduling in hardware, most of the complexity of the hardware is in the control logic and is not shown in the figure. In front of each functional unit is a reservation station [22]. The reservation stations are instruction buffers that disassociate the actual instruction fetch rate from the instruction execution rate. With this buffering, MATCH 4 Though Johnson did not name his models, we call one of his models MATCH because TORCH attempts to match its performance and ....
R.M. Tomasulo, "An Efficient Hardware Algorithm for Exploiting Multiple Arithmetic Units." IBM Journal
....Memory I Cache BTB Decoder Branch Register File ALU Shifter Data Memory D Cache Reorder Buffer Addr Data Unit Load Store Figure 3: Overall structure of Johnson s dynamically scheduled processor. consists of several independent function units. Each functional unit has a reservation station [25]. The reservation stations are instruction buffers that decouple instruction decoding from the instruction execution and allow for dynamic scheduling of instructions. Thus, the processor can execute instructions out of order though the instructions are fetched and decoded in program order. In ....
R. M. Tomasulo. An efficient hardware algorithm for exploiting multiple arithmetic units. IBM Journal, 11:25--33,
No context found.
R. M. Tomasulo, "An efficient hardware algorithm for exploiting multiple arithmetic units," IBM Journal, vol. 11, pp. 25-33, 1967.
No context found.
R. Tomasulo. An efficient hardware algorithm for exploiting multiple arithmetic units. IBM Journal, 44-5:25--33, Jan. 1967.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC