MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  A query sampling method for estimating local cost parameters in a multidatabase system (1994) [31 citations — 7 self]

Download:
Download as a PDF | Download as a PS
by Qiang Zhu, Per- Ake Larson
In IEEE International Conference on Data Engineering
http://www.engin.umd.umich.edu/~qzhu/papers/icde94_pub.ps
Add To MetaCart

Abstract:

In a multidatabase system (MDBS), some query optimization information related to local database systems may not be available at the global level because of local autonomy. To perform global query optimization, a method is required to derive the necessary local information. This paper presents a new method that employs a query sampling technique to estimate the cost parameters of an autonomous local database system. We introduce a classification for grouping local queries and suggest a cost estimation formula for the queries in each class. We present a procedure to draw a sample of queries from each class and use the observed costs of sample queries to determine the cost parameters by multiple regression. Experimental results indicate that the method is quite promising for estimating the cost of local queries in an MDBS. 1

Citations

152 Equi-depth histograms for estimating selectivity factors for multi-dimensional queries – Muralikrishna, DeWitt - 1988
137 Practical selectivity estimation through adaptive sampling – Lipton, Naughton, et al.
74 Query Optimization in a Heterogeneous DBMS – Du, Krishnamurthy, et al. - 1992
33 Simple random sampling from relational databases – Olken, Rotem - 1986
27 Estimating Record Selectivities – Christodoulakis - 1983
12 Accurate estimation of the number of tuples satisfying a condition – Shapiro, Connel - 1984
11 Query optimization in multidatabase systems – Zhu - 1992
9 On global query optimization in multidatabase systems – Lu, Shan - 1992
9 An integrated method for estimating selectivities in a multidatabase system – Zhu - 1993
8 Establishing a fuzzy cost model for query optimization in a multidatabase system – Zhu, Larson - 1994
4 Statistical Methods for Business and – Pfaffenberger, Patterson - 1987
3 et al. Error-constrained COUNT query evaluation in relational databases – Hou - 1991