Download:
|
by Wei Huang, B. Eng, Sujoy Basu, Qiang Cao, Marcelo Cintra, Zhenzhou Ge, Yi Kang, Diana Keen, Venkata Krishnan, Vinh Vi Lam, Jose Martinez, Anthony-trung Nguyen, Yan Solihin, Pedro Trancoso, Liuxi Yang, Seung-moon Yoo, Ye Zhang Their
http://iacoma.cs.uiuc.edu/~weihuang/others/thesis.ps.gz
Add To MetaCart
Abstract:
The state-of-the-art microprocessor employs tens of millions of transistors on a single chip, most of them are used to tackle the problem of slow memory. While these processors demonstrate great benchmark ratings, they do not necessarily work well on a lot of real world applications. On the other hand, memory components oer huge numbers of transistors that could be utilized to do signicant work thanks to major advances in Merged Logic DRAM technology. In this thesis a novel architecture is proposed to bridge the memory/processor speed gap in a much more cost eective way than the traditional central processor. Also parallelism is exploited using simple processor arrays thus surmounting the limit of ILP. The combined eect has very promising results. Assuming conservative parameters for the proposed system while using aggressive parameters for a traditional reference system, we get an average speedup of 7.8 and best case of 36 for ve benchmarks only with a single memory chip. iii To My Parents, for their endless love and support. iv ACKNOWLEDGMENTS First and foremost I would like to thank my advisor Josep Torrellas for his guidance and support over the past few years, which made the completion of this thesis possible. Secondly, my gratitude goes to all the IACOMA group members, current and past,
Citations
|
3148
|
Computer Architecture: A Quantitative Approach
– Hennessy, Patterson
- 1996
|
|
2062
|
The Self-Organizing Map
– Kohonen
- 1990
|
|
1184
|
Basic local alignment search tool
– Altschul, Gish
- 1990
|
|
390
|
The Connection Machine
– Hillis
- 1985
|
|
75
|
Active Pages: A Computation Model for Intelligent Memory
– Oskin, Chong, et al.
|
|
38
|
Discovering Data Mining: From Concept to Implementation”, ISBN
– Cabena, Stadler, et al.
- 1997
|
|
33
|
An Execution-Driven Framework for Fast and Accurate Simulation of Superscalar Processors
– Krishnan, Torrellas
- 1998
|
|
23
|
A scalable parallel algorithm for self-organizing maps with applications to sparse data mining problems. Data Mining and Knowledge Discovery, 3(2):171–195
– Lawrence, Almasi, et al.
- 1999
|
|
21
|
The EXECUBE Approach to Massively Parallel Processing
– Kogge
- 1994
|
|
16
|
MINT: A Front End for Ecient Simulation of Shared-Memory Multiprocessors
– Veenstra, Fowler
- 1994
|
|
12
|
Evaluation of existing architectures in IRAM systems
– Bowman, Cardwell, et al.
- 1997
|
|
12
|
Distributed Vector Architecture: Beyond a Single Vector-IRAM
– Kaxiras, Sugumar, et al.
- 1997
|
|
11
|
A single chip multiprocessor integrated with high density DRAM
– Yamauchi, Hammond, et al.
- 1997
|
|
10
|
Hardware Barrier Synchronization: Dynamic Barrier MIMD
– O’Keefe, Dietz
- 1990
|
|
9
|
Dynamic Barrier Architecture For Multi-Mode Fine-Grain Parallelism Using
– Cohen, Dietz, et al.
- 1994
|
|
7
|
Parallel Supercomputing in SIMD Architectures
– Hord
- 1990
|
|
5
|
et al. A 32-bank 1Gb DRAM with 1 GB/s Bandwidth
– Yoo
- 1996
|
|
3
|
Computational RAM: The case for SIMD computing in memory
– Elliott, Stumm, et al.
- 1997
|
|
3
|
et al. Using MML to Simulate Multiple Dual-Ported SRAMs: Parallel Routing Lookups
– Brown
- 1997
|
|
2
|
DRAMs: Today and Toward System-Level Integration. Half-day seminar
– Embedded
- 1997
|
|
1
|
Database Mining: A Perfromance perspective
– Agrawal, Imielinski, et al.
- 1993
|
|
1
|
Multidimension Access Memory
– Batcher
- 1977
|
|
1
|
STARAN parallel processor system hardware
– E
- 1974
|
|
1
|
Granacki et al
– John
|
|
1
|
IRAM Design for Multimedia Applications
– Kim, Choi, et al.
- 1997
|
|
1
|
The Smart Access Memory: An Intelligent RAM for Nearest Neighbor Database Searching
– Lipman, Yang
- 1997
|
|
1
|
Hardware-Software Trade-Os in a Direct Rambus Implementation of the RAMpage Memory Hierarchy
– Machanick, Salvedra, et al.
- 1998
|
|
1
|
Hardware Barrier Synchronization: Dynamic Static MIMD SBM
– O'Keefe, Dietz
- 1990
|
|
1
|
A Floratos. Combinatorial Motif Discovery In Biological Sequences Using the TEIRESIAS Algorithm
– Rigoutsos
- 1997
|