7 citations found. Retrieving documents...
Palacharla Subbaro, Jouppi Norman P., and Smith James E. Complexity-Effective Supperscalar Processors. 28th Annual International Symposium on Computer Architecture, 1997, pp. 206-218.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Transistor Count and Chip-Space Estimation of.. - Steinhaus, Kolla, .. (2001)   (1 citation)  (Correct)

....are published without hardware complexity accounts. Some papers include transistor count estimations without explanation (e.g. 15] some give rough transistor count estimations (e.g. 8] and a few papers include detailed complexity estimations, in particular Palacharla, Jouppi, and Smith [13] and [14] Burns and Gaudiot [4] perform a rst step in our direction by estimating the layout area of simultaneous multithreaded (SMT) processors. They identify, which layout blocks are a ected by SMT, determine the scaling of chip space requirements, and compare SMT versus single threaded ....

Palacharla Subbaro, Jouppi Norman P., and Smith James E. Complexity-E ective Supperscalar Processors. 28th Annual International Symposium on Computer Architecture,


A Dynamic Multithreading Processor - Akkary, Driscoll   (72 citations)  (Correct)

....where these disruptions occur often, the execution throughput is well below a wide superscalar s peak bandwidth. Ideally, we need an uninterrupted instruction fetch supply to increase performance. Even then, there are other complexities that have to be overcome to increase execution throughput [2]. Register renaming requires dependency checking among instructions of the same block, and multiple read ports into the rename table. This logic increases in complexity as the width of the rename stage increases. A large pool of instructions is also necessary to find enough independent ....

S. Palacharla, N. Jouppi, J. E. Smith. Complexity-Effective Superscalar Processors. The 24 th Annual International Symposium on Computer Architecture, pp. 206-218, June 1997.


Increasing Effective IPC by Exploiting Distant.. - Martel, Ortega.. (1999)   (2 citations)  (Correct)

.... possible because of the characteristics of the study itself, that totally relax certain architectural constraints, e.g. perfect branch prediction or unbounded resources) Even without assuming limit conditions, the proposals may not be worth implementing, such as very large instruction windows [11]. The main way of increasing IPC, and therefore speeding up applications, has always been the exploitation of the inherent parallelism in programs, either using software techniques or hardware mechanisms. Although the majority of previous research in ILP focused on the performance of a single ....

S. Palacharla, N. Jouppi, and J.E. Smith. Complexity-effective superscalar processors. 24th Annual International Symposium on Computer Architecture, June 1996.


Quantifying the Benefits of SPECint Distant.. - Ortega, Martel..   (Correct)

....instructions. Nevertheless, the parallelism found dynamically by these techniques has been shown to be bounded. Recent studies show that augmenting the instruction window size, thus increasing the dynamic scope of the processor may not be cost effective, not only because of technological reasons [12] but also because of the characteristics of the programs themselves, which show a saturation in the speed up obtainable with highly aggressive architectures [15] To overcome the limitations that a unique control thread may impose, the exploitation of multiple flows of control has been proposed ....

....configurations for SPECint programs sequential parallel sequential parallel 0.8 1.0 1.2 1.4 Speed Up 16K Cache L1 32K Cache L1 64K Cache L1 Perfect Memory (normalised) compress95 go Figure 7. Normalised execution times of basic memory configurations for compress95 and go this field [12, 7], that introducing light weight threads in an in order processor may be less costly in terms of cycle time than augmenting the instruction window and issue width. If this could be stated, we could affirm that an extra benefit could be expected for in order multithreaded architectures. We have ....

S. Palacharla, N. Jouppi, and J. Smith. Complexity-effective superscalar processors. 24th Annual International Symposium on Computer Architecture, June 1996.


Prédiction de l'adresse des lectures pour tolérer la.. - Hai, Rochange.. (1997)   (Correct)

....deux caches s epar es, un pour les instructions et un pour les donn ees tandis que le deuxi eme niveau comporte un seul cache. Le cache de donn ees de premier niveau est de 32 Kilo octets. Il est a correspondance directe, a r e ecriture et a ecriture allou ee avec chargement sur ecriture [13]. Le lecteur peu familier trouvera une excellente description de son fonctionnement dans [7] Il supporte quatre acc es simultan es depuis le processeur grace a un entrela cage sur quatre bancs et est non bloquant [9] On suppose qu il peut supporter un nombre infini de requetes en attente (nous ....

....includes a L1 32 KB data cache. This cache is composed of 4 interleaved banks that allow four simultaneous accesses by the processor (one access per L S functional unit) provided there is no bank conflict. It is a write back direct mapped cache with fetch on write and write allocate policies[13]. As most of caches today, it is also non blocking [9] We suppose that it can support an infinite number of pending loads since measures show that restricting the number fo pending loads to 2 gives the same performance. The hit access time is 1 cycle. The L1 instruction cache is supposed to be ....

N.P. Jouppi, Cache Write Policies and Performance, 20th Annual International Symposium on Computer Architecture, June 1993.


Increasing Effective IPC by Exploiting Distant.. - Martel, Ortega.. (1999)   (2 citations)  (Correct)

.... possible because of the characteristics of the study itself, that totally relax certain architectural constraints, e.g. perfect branch prediction or unbounded resources) Even without assuming limit conditions, the proposals may not be worth implementing, such as very large instruction windows [PJS96] The main way of increasing IPC, and therefore speeding up applications, has always been the exploitation of the inherent parallelism of programs, either using software techniques or hardware mechanisms. Although the majority of previous research in ILP focused on the performance of a single ....

S. Palacharla, N. Jouppi, and J.E. Smith. Complexity-effective superscalar processors. 24th Annual International Symposium on Computer Architecture, June 1996.


Transistor Count and Chip-Space Estimation of Simulated.. - Marc Steinhaus Reiner (2001)   (1 citation)  (Correct)

No context found.

Palacharla Subbaro, Jouppi Norman P., and Smith James E. Complexity-Effective Supperscalar Processors. 28th Annual International Symposium on Computer Architecture, 1997, pp. 206-218.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC