Results 11 - 20
of
68
Refining Instruction Set Architecture for High-Performance Multimedia Processing in Constrained Environments
- PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS
, 2002
"... Multimedia processing in software has been significantly accelerated by the addition of subword-parallel instructions to the instruction set architectures (ISAs) of modern microprocessors. While some of these multimedia instructions are simple and effective, others are very complex, requiring large, ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
the use of PLX's architectural features with four frequently used multimedia kernels: discrete cosine transform, pixel padding, clip test and median filter. Our performance results show that a 64-bit PLX implementation achieves significant speedups compared to a basic 64-bit RISC processor and to IA
Hardware/Software Organization of a High Performance ATM Host Interface
- IEEE Journal on Selected Areas in Communications
, 1993
"... Concurrent increases in network bandwidths and processor speeds have created a performance bottleneck at the workstation-to-network host interface . This is especially true for BISDN networks where the fixed length ATM cell is mismatched with application requirements for data transfer; a successful ..."
Abstract
-
Cited by 78 (17 self)
- Add to MetaCart
Concurrent increases in network bandwidths and processor speeds have created a performance bottleneck at the workstation-to-network host interface . This is especially true for BISDN networks where the fixed length ATM cell is mismatched with application requirements for data transfer; a successful
A high-performance distributed object-oriented system: The MUSHROOM Project
, 1991
"... Instruction Set. An assembler for this instruction set was built, and a number of programs written to evaluate the architecture and to test ideas. Ideas and results from this abstract machine were used in the design of the real hardware, and the implementation of the simulator assisted in the desig ..."
Abstract
- Add to MetaCart
in the design of the later simulator and other software tools. 2.5.7 Other Issues Both technological and architectural improvements to conventional RISC processors have been progressing at a phenomenal rate. The resulting performance is considerable and still improving rapidly, with techniques such as "
An Analysis of Loop Permutation on the HP PA-RISC
, 1995
"... this report, we present an experiment with compiler optimizations to improve data locality based on a simple cost model [14]. The model computes both temporal and spatial reuse of cache lines to find desirable loop permutations. The cost model drives the application of compound transformations consi ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
, in addition, simulated cache hit rates. We collected statistics about the inherent data-locality characteristics of these programs and our ability to improve their locality. Performance improvements were difficult to achieve because benchmark programs typically have high hit rates; however, our optimizations
4th Generation 64-bit PowerPC-Compatible Commercial Processor Design Authors
"... IBM's NorthStar superscalar RISC microprocessor integrates high-bandwith and short pipe depth with low latency and zero cycle branch mispredict penalty into a fully scalable 64-bit PowerPC-compatible symmetric multiprocessor (SMP) implementation. Based on PowerPC architecture, the first in the ..."
Abstract
- Add to MetaCart
IBM's NorthStar superscalar RISC microprocessor integrates high-bandwith and short pipe depth with low latency and zero cycle branch mispredict penalty into a fully scalable 64-bit PowerPC-compatible symmetric multiprocessor (SMP) implementation. Based on PowerPC architecture, the first
MPFUN: A Portable High Performance Multiprecision Package
, 1990
"... The author has written a package of Fortran routines that perform a variety of arithmetic operations and transcendental functions on floating point numbers of arbitrarily high precision, including large integers. This package features (1) virtually universal portability, (2) high performance, especi ..."
Abstract
-
Cited by 50 (4 self)
- Add to MetaCart
The author has written a package of Fortran routines that perform a variety of arithmetic operations and transcendental functions on floating point numbers of arbitrarily high precision, including large integers. This package features (1) virtually universal portability, (2) high performance
A 3.8ms Latency Correlation Tracker for Active Mirror Control Based on a Reconfigurable Interface to a Standard Workstation
"... We describe the use of a reconfigurable interface board based on FPGAs and a UNIX workstation to implement a correlation tracker with 3.8ms latency. The correlation tracker is part of an active mirror system in use at the Swedish Vacuum Solar Telescope, La Palma, Canary Islands. The reconfigurable ..."
Abstract
-
Cited by 5 (4 self)
- Add to MetaCart
interface is used to leverage the workstation CPU, relieving it of tasks that it performs poorly such as rapid context switching and low-level bit manipulation. The reconfigurable interface handles control of external devices, high-performance input (16 MB/s) and data preformatting. The workstation CPU, a
Optimal Extension Fields for Fast Arithmetic in Public-Key Algorithms
, 1998
"... Abstract. This contribution introduces a class of Galois field used to achieve fast finite field arithmetic which we call an Optimal Extension Field (OEF). This approach is well suited for implementation of publickey cryptosystems based on elliptic and hyperelliptic curves. Whereas previous reported ..."
Abstract
-
Cited by 76 (14 self)
- Add to MetaCart
reported optimizations focus on finite fields of the form GF (p) and GF (2 m), an OEF is the class of fields GF (p m), for p a prime of special form and m a positive integer. Modern RISC workstation processors are optimized to perform integer arithmetic on integers of size up to the word size
High Performance Integer Optimization for Crew Scheduling
, 1999
"... Performance aspects of a Lagrangian relaxation based heuristic for solving large 0-1 integer linear programs are discussed. In particular, we look at its application to airline and railway crew scheduling problems. We present a scalable parallelization of the original algorithm used in productio ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
Performance aspects of a Lagrangian relaxation based heuristic for solving large 0-1 integer linear programs are discussed. In particular, we look at its application to airline and railway crew scheduling problems. We present a scalable parallelization of the original algorithm used
unknown title
"... Abstract: In this paper, the Field Programmable Gate Array (FPGA) based 64-bit RISC processor with built-inself test (BIST) feature implemented using VHDL and was, in turn, verified on Xilinx ISE simulator. The VHDL code supports FPGA, System-On-Chip (SOC), and Spartan 3E kit. This paper also presen ..."
Abstract
- Add to MetaCart
presents the architecture, data path and instruction set (IS) of the RISC processor. The 64-bit processors, on the other hand, can address enormous amounts of memory up to 16 Exabyte‟s. The proposed design can find its applications in high configured robotic work-stations such as, portable pong gaming kits
Results 11 - 20
of
68