by Y. Andreopoulos, P. Schelkens, J. Cornelis
in Proc. of the 2001 IEEE Signal Processing Systems
http://www.etro.vub.ac.be/Members/andreopoulos.yiannis/_private/SIPSAndreopoulos.pdf
Add To MetaCart
Abstract:
Abstract: This paper compares various software implementations of the 2-D binary-tree wavelet decomposition by analyzing the data-related cache penalties in processor-based platforms. Such penalties appear to be the dominant factors that determine performance in this type of applications. The comparisons include various image-scanning techniques, from the classical Row-Column approach to the Local Wavelet Transform and the Line-Based Wavelet Transform, which are proposed in the framework of multimedia-coding standards. For a conflict-free cache model, a theoretical framework is constructed allowing for predictions of the data-cache penalties that are expected to diminish the system performance. The theoretical results are verified with measurements from simulations and also from a real platform. I.
Citations
|
1253
|
The Simplescalar toolset, version 2.0
– Burger, Austin
- 1997
|
|
84
|
The lifting scheme: A new philosophy in biorthogonal wavelet constructions
– Sweldens
- 1995
|
|
57
|
Cache and Memory Hierarchy Design: A Performance-Directed Approach
– Przybylski
- 1990
|
|
15
|
Efficient realizations of encoders and decoders based on the 2-D discrete wavelet transforms
– Chakrabarti, Mumford
- 1999
|
|
14
|
Line-Based, Reduced Memory, Wavelet Image Compression
– Chrysafis, Ortega
- 2000
|
|
12
|
Optimal memory organization for scalable texture codecs in MPEG-4
– Lafruit, Nachtergaele, et al.
- 1999
|
|
9
|
The Local Wavelet Transform: a memory-efficient, high-speed architecture for a Region-Oriented ZeroTree coder
– Lafruit
|
|
4
|
VTune(tm) Performance Analyzer. http://developer.intel.com/drg/software/info/Vtune.htm
– Corp
|
|
3
|
A wavelet-tree image coding system with efficient memory utilization,” accepted to ICASSP
– Andreopoulos, Zervas, et al.
- 2001
|
|
1
|
N1020R,“EBCOT: Embedded Block Coding with Optimized Truncation
– JTC1SC29WG1
|
|
1
|
N1467, “Results of the Core Experiment “Study of the VM Complexity for the Cost Efficient Implementation
– JTC1SC29WG1
|