DATA PRELOAD FOR SUPERSCALAR AND VLIW PROCESSORS (1993)
Cached
Download Links
| Citations: | 21 - 1 self |
BibTeX
@MISC{Chen93datapreload,
author = {William Yu-Wei Chen and Jr.},
title = { DATA PRELOAD FOR SUPERSCALAR AND VLIW PROCESSORS },
year = {1993}
}
Years of Citing Articles
OpenURL
Abstract
... decreased the average number of clock cycles per instruction. As a result, each execution cycle has become more significant to overall system performance. To maximize the effectiveness of each cycle, one must expose instruction-level parallelism and employ memory latency tolerant techniques. However, without special architecture support, a superscalar compiler cannot effectively accomplish these two tasks in the presence of control and memory access dependences. Preloading is a class of architectural support which allows memory reads to be performed early in spite of potential violation of control and memory access dependences. With preload support, a superscalar compiler can perform more aggressive code reordering to provide increased tolerance of cache and memory access latencies and increasing instruction-level parallelism. This thesis discusses the architectural features and compiler support required to effectively utilize preload instructions to increase the overall system performance. The first hardware support is preload register update, a data preload support for load scheduling to reduce first-level cache hit latency. Preload register update keeps the load destination







