Download:
|
by Alexis Vartanian, Jean-luc Bechennec, Nathalie Drach-temam
In Proceedings of the 6th International Symposium on High Performance Computer Architecture
http://www.lri.fr/~alex/publis/paper0799.ps.gz
Add To MetaCart
Abstract:
The quality of a real-time high end virtual reality system depends on its ability to draw millions of textured triangles in 1/60s. The idea of using commodity PC 3D accelerators to build a parallel machine instead of custom ASICs seems more and more attractive as such chips are getting faster. If image parallelism is used, designers have the choice between two distributions: line interleaving and square block interleaving. Having fixed block shape and size makes chip design easier. A PC 3D accelerator has a cost-effective external bus and an on-chip texture cache. The performance of such a cache depends on spatial locality. If the image is rendered in multiple engines, this locality is reduced. Locality and load balancing depend on the distribution scheme of the machine. This paper investigates the impact of the distribution scheme on the performance of such a machine. We use detailed cache and memory system simulations with virtual reality benchmarks running on different configurations. We show that: (i) Both distributions have the same maximum performance with less than 16 processors but square block has a better speedup with 64 processors. (ii) SLI best block size depends on the number of processors of the machine and is not suitable for a scalable chip with a fixed block size. (iii) Using a big triangle buffer in the texture mapping engine has a very important impact on the performance. 1
Citations
|
137
|
PixelFlow: High-speed rendering using image composition
– Molnar, Eyles, et al.
- 1992
|
|
133
|
A sorting classification of parallel rendering
– Molnar, Cox, et al.
- 1994
|
|
124
|
RealityEngine graphics
– Akeley
- 1993
|
|
102
|
Pixel-Planes 5: a heterogeneous multiprocessor graphics system using processor-enhanced memories
– Fuchs, Poulton, et al.
- 1989
|
|
102
|
Advanced Animation and Rendering Techniques
– WATT, WATT
- 1992
|
|
87
|
InfiniteReality: A real-time graphics system
– Montrym, Baum, et al.
- 1997
|
|
48
|
Pixelflow: the realization
– Eyles, Molnar, et al.
- 1997
|
|
47
|
The design and analysis of a cache architecture for texture mapping
– Hakura, Gupta
- 1997
|
|
21
|
Prefetching in a texture cache architecture
– Igehy, Eldridge, et al.
- 1998
|
|
21
|
Hardware Accelerated Rendering of Antialiasing Using a Modified A-Buffer Algorithm
– Winner, Kelley, et al.
- 1997
|
|
16
|
Multi-Level Texture Caching for 3D Graphics Hardware
– Cox, Bhandari, et al.
|
|
13
|
Architectural implications of hardware-accelerated bucket rendering on the PC
– Cox, Bhandari
- 1997
|
|
10
|
Unsolved Problems and Opportunities for High-quality, High-performance 3-D Graphics on a PC Platform
– Kirk
- 1998
|
|
8
|
Parallel texture caching
– IGEHY, ELDRIDGE, et al.
- 1999
|
|
7
|
Tracing Interactive 3D Graphics Programs
– Dunwoody, Linton
- 1990
|
|
4
|
Models of the impact of overlap in bucket rendering
– Chen, Stoll, et al.
- 1998
|
|
4
|
Neon: a (big) (fast) single-chip 3D workstation graphics accelerator
– McCormack, McNamara, et al.
- 1998
|
|
4
|
cker Chiueh. Implementation and Evaluation of the Parallel Mesa Library
– Mitra, T
- 1998
|
|
3
|
ASF: A teaching and research objectoriented simulation tool for computer architecture design and performance evaluation
– Bechennec
- 1998
|
|
2
|
3D vendors aim high
– Glaskowsky
- 1998
|
|
2
|
The design of a parallel graphics inferface
– Igehy, Stoll, et al.
- 1998
|
|
2
|
Evaluation of High Performance Multicache Parallel Texture Mapping
– Vartanian, Béchennec, et al.
- 1998
|
|
1
|
3DLabs flies with jetstream
– Glaskowsky
- 1998
|
|
1
|
The OpenGL Graphics System
– Segal, Akelay
- 1996
|
|
1
|
Two schemes to improve the performance of a sort-last 3D parallel rendering machine with texture caches
– Vartanian, Bechennec, et al.
- 1999
|