MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Exploiting Application Parallelism Using Advanced Intelligent Memory - The FlexRAM Approach (1999) [2 citations — 1 self]

Download:
Download as a PDF | Download as a PS
by Wei Huang, B. Eng, Sujoy Basu, Qiang Cao, Marcelo Cintra, Zhenzhou Ge, Yi Kang, Diana Keen, Venkata Krishnan, Vinh Vi Lam, Jose Martinez, Anthony-trung Nguyen, Yan Solihin, Pedro Trancoso, Liuxi Yang, Seung-moon Yoo, Ye Zhang Their
http://iacoma.cs.uiuc.edu/~weihuang/others/thesis.ps.gz
Add To MetaCart

Abstract:

The state-of-the-art microprocessor employs tens of millions of transistors on a single chip, most of them are used to tackle the problem of slow memory. While these processors demonstrate great benchmark ratings, they do not necessarily work well on a lot of real world applications. On the other hand, memory components oer huge numbers of transistors that could be utilized to do signicant work thanks to major advances in Merged Logic DRAM technology. In this thesis a novel architecture is proposed to bridge the memory/processor speed gap in a much more cost eective way than the traditional central processor. Also parallelism is exploited using simple processor arrays thus surmounting the limit of ILP. The combined eect has very promising results. Assuming conservative parameters for the proposed system while using aggressive parameters for a traditional reference system, we get an average speedup of 7.8 and best case of 36 for ve benchmarks only with a single memory chip. iii To My Parents, for their endless love and support. iv ACKNOWLEDGMENTS First and foremost I would like to thank my advisor Josep Torrellas for his guidance and support over the past few years, which made the completion of this thesis possible. Secondly, my gratitude goes to all the IACOMA group members, current and past,

Citations

3148 Computer Architecture: A Quantitative Approach – Hennessy, Patterson - 1996
2062 The Self-Organizing Map – Kohonen - 1990
1184 Basic local alignment search tool – Altschul, Gish - 1990
390 The Connection Machine – Hillis - 1985
75 Active Pages: A Computation Model for Intelligent Memory – Oskin, Chong, et al.
38 Discovering Data Mining: From Concept to Implementation”, ISBN – Cabena, Stadler, et al. - 1997
33 An Execution-Driven Framework for Fast and Accurate Simulation of Superscalar Processors – Krishnan, Torrellas - 1998
23 A scalable parallel algorithm for self-organizing maps with applications to sparse data mining problems. Data Mining and Knowledge Discovery, 3(2):171–195 – Lawrence, Almasi, et al. - 1999
21 The EXECUBE Approach to Massively Parallel Processing – Kogge - 1994
16 MINT: A Front End for Ecient Simulation of Shared-Memory Multiprocessors – Veenstra, Fowler - 1994
12 Evaluation of existing architectures in IRAM systems – Bowman, Cardwell, et al. - 1997
12 Distributed Vector Architecture: Beyond a Single Vector-IRAM – Kaxiras, Sugumar, et al. - 1997
11 A single chip multiprocessor integrated with high density DRAM – Yamauchi, Hammond, et al. - 1997
10 Hardware Barrier Synchronization: Dynamic Barrier MIMD – O’Keefe, Dietz - 1990
9 Dynamic Barrier Architecture For Multi-Mode Fine-Grain Parallelism Using – Cohen, Dietz, et al. - 1994
7 Parallel Supercomputing in SIMD Architectures – Hord - 1990
5 et al. A 32-bank 1Gb DRAM with 1 GB/s Bandwidth – Yoo - 1996
3 Computational RAM: The case for SIMD computing in memory – Elliott, Stumm, et al. - 1997
3 et al. Using MML to Simulate Multiple Dual-Ported SRAMs: Parallel Routing Lookups – Brown - 1997
2 DRAMs: Today and Toward System-Level Integration. Half-day seminar – Embedded - 1997
1 Database Mining: A Perfromance perspective – Agrawal, Imielinski, et al. - 1993
1 Multidimension Access Memory – Batcher - 1977
1 STARAN parallel processor system hardware – E - 1974
1 Granacki et al – John
1 IRAM Design for Multimedia Applications – Kim, Choi, et al. - 1997
1 The Smart Access Memory: An Intelligent RAM for Nearest Neighbor Database Searching – Lipman, Yang - 1997
1 Hardware-Software Trade-Os in a Direct Rambus Implementation of the RAMpage Memory Hierarchy – Machanick, Salvedra, et al. - 1998
1 Hardware Barrier Synchronization: Dynamic Static MIMD SBM – O'Keefe, Dietz - 1990
1 A Floratos. Combinatorial Motif Discovery In Biological Sequences Using the TEIRESIAS Algorithm – Rigoutsos - 1997