MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Building Regression Cost Models for Multidatabase Systems (1996) [20 citations — 3 self]

Download:
Download as a PDF | Download as a PS
by Qiang Zhu, Per- Ake Larson
In Proceedings of PDIS
ftp://ftp.research.microsoft.com/users/palarson/pdis96.ps
Add To MetaCart

Abstract:

A major challenge for performing global query optimization in a multidatabase system (MDBS) is the lack of cost models for local database systems at the global level. In this paper we present a statistical procedure based on multiple regression analysis for building cost models for local database systems in an MDBS. Explanatory variables that can be included in a regression model are identified and a mixed forward and backward method for selecting significant explanatory variables is presented. Measures for developing useful regression cost models, such as removing outliers, eliminating multicollinearity, validating regression model assumptions, and checking significance of regression models, are discussed. Experimental results demonstrate that the presented statistical procedure can develop useful local cost models in an MDBS.

Citations

176 Query optimization in database systems – Jarke, Koch - 1984
137 Practical selectivity estimation through adaptive sampling – Lipton, Naughton, et al.
74 Query Optimization in a Heterogeneous DBMS – Du, Krishnamurthy, et al. - 1992
46 et al. Access Path Selection in a Relational Database Management System – Selinger - 1979
33 Simple random sampling from relational databases – Olken, Rotem - 1986
31 A query sampling method of estimating local cost parameters in a multidatabase system – Zhu, Larson - 1994
17 On global multidatabase query optimization – Lu, Ooi, et al. - 1992
12 Accurate estimation of the number of tuples satisfying a condition – Shapiro, Connel - 1984
11 Query optimization in multidatabase systems – Zhu - 1992
10 The Theory of Linear Models and Multivariate Analysis – Arnold - 1981
8 Establishing a fuzzy cost model for query optimization in a multidatabase system – Zhu, Larson - 1994
5 Regression Analysis by Example, 2nd Ed – Chatterjee, Price - 1991
4 Statistical Methods for Business and – Pfaffenberger, Patterson - 1987
4 Query optimization using fuzzy set theory for multidatabase systems – Zhu, Larson - 1993
2 Statistical Methods, 6th Ed. The Iowa State university – Snedecor, Cochran - 1967