• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 11 - 20 of 1,096
Next 10 →

Single Instruction Stream Parallelism Is Greater than Two

by Michael Butler, Tse-yu Yeh, Yale Patt, Mitch Alsup, Hunter Scales, Michael Shebanow , 1991
"... Recent studies have concluded that little parallelism (less than two operations per cycle) is available in single instruction streams. Since the amount of available parallelism should influence the design of the processor, it is important to verify how much parallelism really exists. In this study w ..."
Abstract - Add to MetaCart
Recent studies have concluded that little parallelism (less than two operations per cycle) is available in single instruction streams. Since the amount of available parallelism should influence the design of the processor, it is important to verify how much parallelism really exists. In this study

Dynamo: A Transparent Dynamic Optimization System

by Vasanth Bala, Evelyn Duesterwald , Sanjeev Banerjia - ACM SIGPLAN NOTICES , 2000
"... We describe the design and implementation of Dynamo, a software dynamic optimization system that is capable of transparently improving the performance of a native instruction stream as it executes on the processor. The input native instruction stream to Dynamo can be dynamically generated (by a JIT ..."
Abstract - Cited by 479 (2 self) - Add to MetaCart
We describe the design and implementation of Dynamo, a software dynamic optimization system that is capable of transparently improving the performance of a native instruction stream as it executes on the processor. The input native instruction stream to Dynamo can be dynamically generated (by a JIT

Multiple Instruction Stream Control for an Associative Model of Parallel Computation

by M. Scherger, J. Baker, J. Potter - in Proc. of the 16th International Parallel and Distributed Processing Symposium (Workshop in Massively Parallel Processing , 2003
"... This paper describes a system software design for multiple instruction stream control in a massively parallel associative computing environment. The purpose of providing multiple instruction stream control is to increase throughput and reduce the amount of parallel slackness inherent in single instr ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
This paper describes a system software design for multiple instruction stream control in a massively parallel associative computing environment. The purpose of providing multiple instruction stream control is to increase throughput and reduce the amount of parallel slackness inherent in single

Instruction Stream Mutation for Non-Deterministic Processors

by Irwin Page Smart, J. Irwin, N. P. Smart - In Preprint
"... Differential power analysis (DPA) has become a real-world threat to the security of cryptographic hardware devices such as smart-cards. By using cheap and readily available equipment, attacks can easily compromise algorithms running on these devices in a non-invasive manner. Adding non-determinism t ..."
Abstract - Cited by 21 (2 self) - Add to MetaCart
Differential power analysis (DPA) has become a real-world threat to the security of cryptographic hardware devices such as smart-cards. By using cheap and readily available equipment, attacks can easily compromise algorithms running on these devices in a non-invasive manner. Adding non-determinism to the execution of cryptographic algorithms has been proposed as a defence against these attacks. One way of achieving this non-determinism is to introduce random additional operations to the algorithm which produce noise in the power profile of the device. We describe the addition of a specialised processor pipeline stage which increases the level of potential non-determinism and hence guards against the revelation of secret information.

Peephole Optimisation of the KROC target instruction stream

by T.M. Sheen
"... This report details the implimentation of a peephole optimiser stage for KROC (Kent Retargetable Occam Compiler)(Wood and Welch, 1996). 1 Introduction The KROC compiler has been ported to a variety of different architectures, e.g. Sparc, Alpha, PowerPC and Sharc. Benchmarking has shown that the p ..."
Abstract - Add to MetaCart
This report details the implimentation of a peephole optimiser stage for KROC (Kent Retargetable Occam Compiler)(Wood and Welch, 1996). 1 Introduction The KROC compiler has been ported to a variety of different architectures, e.g. Sparc, Alpha, PowerPC and Sharc. Benchmarking has shown that the performance of KROC is within a factor of between 2 and 4 of optimised C or assembly code. This performance bottleneck has previously been tackled by writing critical sections of code in either C or assembly language, and linking them with main OCCAM program. This situation is not Ideal, as it requires code profiling and considerable effort to hand code the critical regions. There has been speculation that significant benefit would be gained from peephole optimisation of the target assembly code produced by KROC. The report details an investigation into the potential gains of this target code optimisation. The compilation process for a KROC program has several stages, occ21 compiles OCCAM in...

Design and Implementation of a Multiple-Instruction-Stream Multiple-Execution-Pipeline Architecture

by Yamin Li And, Yamin Li, Wanming Chu - In Seventh IASTED International Conference on Parallel and Distributed Computing and Systems , 1995
"... This paper describes a single chip MultipleInstruction -Stream Multiple-Execution-Pipeline (MISMEP) architecture capable of improving processor throughput. The MIS-MEP architecture uses multiple instruction dispatch/branch units (slots) to dispatch instructions from multiple instruction streams. Mul ..."
Abstract - Add to MetaCart
This paper describes a single chip MultipleInstruction -Stream Multiple-Execution-Pipeline (MISMEP) architecture capable of improving processor throughput. The MIS-MEP architecture uses multiple instruction dispatch/branch units (slots) to dispatch instructions from multiple instruction streams

Exploiting Multi-Grained Parallelism For Multiple-Instruction-Stream Architectures

by Christopher John Newburn , 1997
"... Exploiting parallelism is an essential part of maximizing the performance of an application on a parallel computer. Parallelism is traditionally exploited at two granularities: individual operations are executed in parallel within a processor to exploit instruction-level parallelism and loop iterati ..."
Abstract - Cited by 7 (0 self) - Add to MetaCart
iterations or processes are executed in parallel on different processors to exploit loop-level parallelism and process-level parallelism. A new generation of architectures that execute multiple instruction streams on a single chip has the potential of significantly reducing the gap between communication

Design Alternatives For Caching Long Regions Of The Dynamic Instruction Stream

by Matthew Martin Crum , 2001
"... Noncontiguous control flow challenges high-bandwidth execution in microprocessors by prematurely terminating a fetch to less than a full fetch width. To deal with this problem, methods have been devised ranging from branch prediction schemes to compiler techniques for reducing taken control flow to ..."
Abstract - Cited by 4 (1 self) - Add to MetaCart
to hardware mechanisms for caching dynamic traces from the instruction stream. Recently, a technique to form long instruction sequences called frames using branch promotion has been proposed. Frames are instruction entities that can grow to be very long and must be cached as atomic units.

Design and Implementation of a Multiple-Instruction-Stream Multiple-Execution-Pipeline Architecture

by Yamin Li, Wanming Chu - In Seventh IASTED International Conference on Parallel and Distributed Computing and Systems , 1995
"... This paper describes a single chip MultipleInstruction -Stream Multiple-Execution-Pipeline (MISMEP) architecture capable of improving processor throughput. The MIS-MEP architecture uses multiple instruction dispatch/branch units (slots) to dispatch instructions from multiple instruction streams. Mul ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
This paper describes a single chip MultipleInstruction -Stream Multiple-Execution-Pipeline (MISMEP) architecture capable of improving processor throughput. The MIS-MEP architecture uses multiple instruction dispatch/branch units (slots) to dispatch instructions from multiple instruction streams

The Basic Block Reassembling Instruction Stream Buffer with LWBTB for X86 ISA

by Jih-ching Chiu, Yu-liang Chou, Tseng-kuei Lin
"... The potential performance of superscalar processors can be exploited only when processor is fed with sufficient instruction bandwidth. The front-end units, the Instruction Stream Buffer (ISB) and the fetcher, are the key elements for achieving this goal. Current ISBs could not support instruction st ..."
Abstract - Add to MetaCart
The potential performance of superscalar processors can be exploited only when processor is fed with sufficient instruction bandwidth. The front-end units, the Instruction Stream Buffer (ISB) and the fetcher, are the key elements for achieving this goal. Current ISBs could not support instruction
Next 10 →
Results 11 - 20 of 1,096
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University