MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Cache Write Generate For High-Performance Processing

Download:
Download as a PDF | Download as a PS
by Craig M. Wittenbrink, Arun K. Somani, Arun K. Somani, Chung-ho Chen, Chung-ho Chen
ftp://ftp.cse.ucsc.edu/pub/reinas/craig/generate.ps.Z
Add To MetaCart

Abstract:

1 Abstract: Much attention has been paid to read caching and several schemes have been developed to make read caching very efficient. As a result, the performance of write caching has become a concern. This paper investigates write caching policies and how they affect the performance of memory systems. We show that write caching can greatly alter the hit/miss ratios, but only more subtly affects the performance. Many factors such as write buffers, contention, split transactions, and multilevel caches dilute the correlation of cache miss ratios to performance. A more accurate performance formula is presented taking these factors into account. This allows an incremental improvement in multiprocessor cache architectures. Cache write generate is a scheme where a cache line is validated on a write miss without fetching from memory. It avoids unnecessary reads from main memory, reduces the CPU stalling time, lowers the cache miss latency, reduces bus contention, and thus increases the available bandwidth of the memory. We compare the performance of cache write generate with the performance of write around and write allocate in single processor and shared bus multiprocessors. We use register level simulations validated by our functioning hardware prototype Proteus system. Various memory speeds and differing numbers of processors are evaluated using detailed simulation models for high performance measurement accuracy.

Citations

3148 Computer Architecture: A Quantitative Approach – Hennessy, Patterson - 1996
537 Cache Memories – Smith - 1982
169 Image analysis using mathematical morphology – Haralick, Sternberg, et al. - 1987
160 High Performance Computer Architecture – Stone - 1993
134 Cache write policies and performance – Jouppi - 1991
91 The Wisconsin multicube: A new largescale cache-coherent multiprocessor – Goodman, Woest - 1988
74 The BaJance Multiprocessor System – Thakkar - 1988
56 The 801 Minicomputer – Radin - 1982
26 RP3 processor-memory element – Brantley, McAuliffe, et al. - 1985
10 MC68030 enhanced 32-bit microprocessor user’s manual, Third Ed – Motorola - 1990
2 Proteus: a reconfigurable computational network for computer vision – Haralick, Somani, et al. - 1995
2 Proteus system architecture and organization – Somani, Wittenbrink, et al. - 1991
1 Effects of Cache Traffic on Shared Bus Multiprocessor Systems – Chen, Somani - 1992
1 A Practical Guide to Task-Oriented Sequences of Morphological Operations for Use in Image Analysis – Costa - 1990
1 64-Bit Microprocessor Hardware Reference – Intel - 1990
1 Directed Data Cache for High Performance Morphological Image Processing – Wittenbrink - 1990