MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  A Multi-resolution Block Storage Model for Database Design (2003) [5 citations — 0 self]

Download:
Download as a PDF | Download as a PS
by Jingren Zhou, Kenneth A. Ross
Proceedings of the 2003 IDEAS Conference
http://www.cs.columbia.edu/~kar/pubsk/mbsmfinal.ps
Add To MetaCart

Abstract:

We propose a new storage model called MBSM (Multiresolution Block Storage Model) for laying out tables on disks. MBSM is intended to speed up operations such as scans that are typical of data warehouse workloads. Disk blocks are grouped into "super-blocks, " with a single record stored in a partitioned fashion among the blocks in a superblock. The intention is that a scan operation that needs to consult only a small number of attributes can access just those blocks of each super-block that contain the desired attributes. To achieve good performance given the physical characteristics of modern disks, we organize super-blocks on the disk into fixed-size "mega-blocks. " Within a megablock, blocks of the same type (from various super-blocks) are stored contiguously. We describe the changes needed in a conventional database system to manage tables using such a disk organization. We demonstrate experimentally that MBSM outperforms competing approaches such as NSM (N-ary Storage Model), DSM (Decomposition Storage Model) and PAX (Partition Attributes Across), for I/O bound decision-support workloads consisting of scans in which not all attributes are required. This improved performance comes at the expense of single-record insert and delete performance; we quantify the trade-offs involved. Unlike DSM, the cost of reconstructing a record from its partitions is small. MBSM stores attributes in a vertically partitioned manner similar to PAX, and thus shares PAX's good CPU cache behavior. We describe methods for mapping attributes to blocks within super-blocks in order to optimize overall performance, and show how to tune the super-block and mega-block sizes. 1

Citations

421 An Introduction to Disk Drive Mod-eling – Ruemmler, Wilkes - 1994
289 Bounds on Multiprocessing Timing Anomalies – Graham - 1969
179 Database Management Systems – Ramakrishnan, Gehrke - 2003
175 Optimization and approximation in deterministic sequencing and scheduling-A survey – Graham, Lawler, et al. - 1979
130 Dbmss on a modern processor: Where does time go – Ailamaki, DeWitt, et al. - 1999
116 Improved query performance with variant indexes – O’Neil, Quass - 1997
102 Bounds for certain multiprocessor anomalies. Bell System Technical Journal 45 – Graham - 2001
89 A decomposition storage model – Copeland, Koshafian - 1985
70 Database Architecture Optimized for the New Bottleneck: Memory Access – Boncz, Manegold, et al. - 1999
63 Cache conscious algorithms for relational query processing – Shatdal - 1994
49 Weaving relations for cache performance – Ailamaki, DeWitt, et al. - 2001
26 A case for fractured mirrors – Ramamurthy, DeWitt, et al. - 2002
18 What happens during a join? Dissecting cpu and memory optimization effects – Manegold, Boncz, et al.
10 An Effective Approach to Vertical Partitioning for Physical Design of Relational Databases – Cornell, Yu - 1990
1 The PC guide – Kozierok - 2002
1 Cheetah X15 product manual – Corp - 2002