• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Achieving High Levels of instruction-Level Parallelism with Reduced Hardware Complexity (1994)

by B Rau Schlansker, S Mahlke
Add To MetaCart

Tools

Sorted by:
Results 11 - 15 of 15

Register Saturation in Data Dependence Graphs

by Sid-ahmed-ali Touati, François Thomasset - RESEARCH REPORT RR-3978, INRIA , 2000
"... Register constraints in ILP scheduling can be taken into account during the scheduling phase of a code. The complexity of this problem is very high. In this work, we present a new approach consisting in manipulating data dependence graphs to reduce the number of "potential" values simultaneously ali ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Register constraints in ILP scheduling can be taken into account during the scheduling phase of a code. The complexity of this problem is very high. In this work, we present a new approach consisting in manipulating data dependence graphs to reduce the number of "potential" values simultaneously alive without assuming any schedule. We study theoretically the exact upper-bound of the register need for all valid schedules of a code : we call this limit the register saturation. It is used to build a modified data dependence graph such that any schedule of this graph will verify the register constraints and avoid introducing spill code. We study the case of Direct Acyclic Graphs and then we extend it to loops intended to software pipelining schedule. Experimental study shows that many DAGs and loops do not need register constraints during scheduling.

The Simulation and Performance Monitoring Environment of Trimaran

by unknown authors
"... This chapter describes the structure and operation of the Trimaran HPL-PD simulation environment. It also describes the performance monitoring framework interface, and the development of analysis tools. ..."
Abstract - Add to MetaCart
This chapter describes the structure and operation of the Trimaran HPL-PD simulation environment. It also describes the performance monitoring framework interface, and the development of analysis tools.

A Co-simulation Study of Adaptive EPIC Computing

by Valentin Stefan Gheorghita, Valentin Stefan, Gheorghita Weng-fai Wong, Tulika Mitra, Surendranath Talla , 2002
"... Reconfigurable computing offers the embedded systems designers the flexibility of application specific optimizations on a generic platform. In this paper, we are concerned with a fine-grain, tightly coupled, dynamically reconfigurable architecture we call Adaptive EPIC. A generic EPIC architecture i ..."
Abstract - Add to MetaCart
Reconfigurable computing offers the embedded systems designers the flexibility of application specific optimizations on a generic platform. In this paper, we are concerned with a fine-grain, tightly coupled, dynamically reconfigurable architecture we call Adaptive EPIC. A generic EPIC architecture is augmented with a dynamically reconfigurable structure. In this paper, we describe an experimental setup to evaluate the performance of such a processor. Our results show that such architecture can offer significant performance improvements for low jequency, and hence low power, core processors.

Very Large Instruction Word Architectures

by Binu Mathew What, Binu K. Mathew
"... this article discusses the technology, history, uses and the future of such processors ..."
Abstract - Add to MetaCart
this article discusses the technology, history, uses and the future of such processors

Constraint Satisfaction for Relative Location Assignment and Scheduling

by Carlos Alba-pinto, Bart Mesman, Jochen Jess
"... Tight data- and timing constraints are imposed by communication and multimedia applications. The architecture for the embedded processor imply resource constraints. Instead of random-access registers, relative location storages or rotating register files are used to exploit the available parallelism ..."
Abstract - Add to MetaCart
Tight data- and timing constraints are imposed by communication and multimedia applications. The architecture for the embedded processor imply resource constraints. Instead of random-access registers, relative location storages or rotating register files are used to exploit the available parallelism of resources by means of reducing the initiation interval in pipelined schedules. Therefore, the compiler or synthesis tool must deal with the difficult tasks of scheduling of operations and location assignment of values while respecting all the constraints including the storage file capacity. This paper presents a method that handles constraints of relative location storages during scheduling together with timing and resource constraints. The characteristics of the coloring of conflict graphs, representing the relative overlap of value instances, are analyzed in order to identify the bottlenecks for location assignment with the aim of serializing their lifetimes. This is done with pairs of loop instances of values until it can be guaranteed that all constraints will be satisfied. Experiments show that high quality schedules for kernels and inner loops can be efficiently obtained. 1.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University