MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  An Evaluation of Different DLP Alternatives for the Embedded Media Domain

Download:
pdf | ps
by Esther Salam, Jesus Corbal, Mateo Valero, Departament D'arquitectura De Computadors, Roger Espasa, Shrewsbury Ma
http://www.ac.upc.es/homes/jcorbal/w-micro32.ps
Add To MetaCart

Abstract:

The importance of media processing has produced a revolution in the design of embedded processors. In order to face the high computational and technological demands of near future media applications, new embedded processors are including features that were commonly restricted to the general purpose and the supercomputing domains. In this paper we have evaluated the performance of various DLP (Data Level Parallelism) oriented embedded architectures and analyzed quantitative data in order to determine the highlights and disadvantages of each approach. Additionally we have analyzed the differences between the explicit parallel versions of code (often based on the standard algorithms) and the high-tuned, non-vectorizable versions usually found in real multimedia programs. We will show that sub-word SIMD architectures (like MMX) are a very costeffective solution, and that, while long vector architectures provide few improvements at a very high cost, a smart combination between vector and SIMD-like architectures is the alternative that leverages best performance at a reasonable cost. We will also show that the memory latency tolerance, typical of vector architectures, is partially compensated by the worse spatial locality found when executing vector code. 1

Citations

664 ATOM: A system for building customized program analysis tools – Srivastava, Eustace - 1994
651 The JPEG Still Picture Compression Standard – Wallace - 1996
594 MediaBench: A tool for evaluating and synthesizing multimedia and communication systems – Lee, Potkonjak, et al. - 1997
192 MMX technology extension to the Intel architecture – Peleg, Weiser - 1996
64 A New Direction for Computer Architecture Research – Kozyrakis, Patterson - 1998
57 A new algorithm to compute the discrete cosine transform – Lee - 1984
30 Simple Vector Microprocessors for Multimedia Applications – Lee, Stoodley - 1998
21 Initial Results on the Performance and Cost of Vector Microprocessors – Lee, DeVries - 1997
21 Exploiting a new level of dlp in multimedia applications – Corbal, Espasa, et al. - 1999
15 Adding a vector unit on a superscalar processor – Quintana, Corbal, et al. - 1999
14 Trends in Embedded Microprocessor Design – Schlett - 1998
11 Tomorrow's computing engines (Keynote Speech – Dally - 1998
5 Sony's emotionally charged chip. Microprocessor report – Diefendorff - 1999
2 The T0 vector microprocessor – al - 1995
1 32-bit single-chip uc-dsp – Tricore - 1998
1 MPEG Digital Video Coding Standards. McGraw W-Hill – Sikora - 1995
1 Instruction set architecture – MOM - 1999
1 Performance of image and video processing with general-purpose and media isa extensions – Jouppi - 1999
1 Trimedia tm-1300. http://wwwus3. semiconductors.com/trimedia – Semiconductors - 1999