MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Optimizing Multi-Join Queries in Parallel Relational Databases (1993) [32 citations — 4 self]

Download:
Download as a PDF | Download as a PS
by Jaideep Srivastava, Gary Elsesser
In Proc. Int'l. Conf. on Parallel and Distr. Inf. Sys
ftp://ftp.cs.umn.edu/dept/users/padma/optimization/pdis93.ps.Z
Add To MetaCart

Abstract:

Query optimization for parallel machines needs to consider machine architecture, processor and memory resources available, and different types of parallelism, making the search space much larger than the sequential case. In this paper our aim is to determine a plan that makes the execution of an individual query very fast, making minimizing parallel execution time the right objective. This creates the following circular dependence: a plan tree is needed for effective resource assignment, which is needed to estimate the parallel execution time, and this is needed for the cost-based search for a good plan tree. In this paper we propose a new search heuristic that breaks the cycle by constructing the plan tree layer by layer in a bottom-up manner. To select nodes at the next level, the lower and upper bounds on the execution time for plans consistent with the decisions made so far are estimated and are used to guide the search. A query plan representation for intra- and inter-operator parallelism, pipelining, and processor and memory assignment is proposed. Also proposed is a new approach to estimating the parallel execution time of a plan that considers sum and max of operators working sequentially and in parallel, respectively. The results obtained from a prototype optimizer are presented.

Citations

146 Apers . Dataflow query execution in a parallel main-memory environment – Wilschut, G - 1991
120 A Performance Evaluation of Four Parallel Join Algorithms in a SharedNothing Multiprocessor Environment – Schneider, DeWitt - 1989
118 Encapsulation of Parallelism in the Volcano Query Processing System – Graefe - 1990
96 Multiprocessor hash-based join algorithms – DeWitt, Gerber - 1985
92 Query Optimization for Parallel Execution – Ganguly, Hasan, et al. - 1992
91 Tradeo s in processing complex join queries via hashing in multiprocessor database machines – Schneider, DeWitt - 1990
72 Optimization of parallel query execution plans in xprs – Hong, Stonebraker - 1991
46 et al. Access Path Selection in a Relational Database Management System – Selinger - 1979
44 Parallel Database Systems: The Future of Database Processing or a Passing Fad – DeWitt, Gray - 1990
29 et al. The Gamma Database Machine Project – DeWitt - 1990
28 Design and Evaluation of Parallel Pipelined Join Algorithms – Richardson, Lu, et al. - 1987
17 Hash-based and index-based join algorithms for cube and ring connected multicomputers – Omiecinski, Lin - 1989
15 Dataflow Query Processing using Multiprocessor Hash-Partitioned Algorithms – Gerber - 1986
5 Join on a cube: Analysis, simulation, and implementation – Baru, Frieder, et al. - 1987
5 et al. The design of xprs – Stonebraker - 1988
2 et al. Parallelism in relational database systems: Architectural issues and design approaches – Pirahesh - 1990
2 Challenges to parallel processing. talk given at – Gustafson - 1989
2 Parallel query optimization for shared memory multiprocessors – Lu, Shan, et al. - 1991