See this document in CiteSeerX!

CXHist: An On-line Classification-Based Histogram for XML String Selectivity Estimation  (Make Corrections)  
Lipyeow Lim, Min Wang, Jeffrey Scott Vitter



  Home/Search   Context   Related

 
View or download:
vldb2005.org/program/pa...p1187lim.pdf
Cached:  PDF   PS.gz  PS  Image  Update  Help

From:  vldb2005.org/program/paper/thu... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Query optimization in IBM's System RX, the first truly relational-XML hybrid data management system, requires accurate selectivity estimation of path-value pairs, i.e., the number of nodes in the XML tree reachable by a given path with the given text value. Previous techniques have been inadequate, because they have focused mainly on the tag-labeled paths (tree structure) of the XML data. For most real XML data, the number of distinct string values at the leaf nodes is orders of... (Update)

Active bibliography (related documents):   More   All
0.6:   The History of Histograms (abridged) - Ioannidis (2003)   (Correct)
0.5:   Statistical Learning Techniques for Costing XML Queries - Zhang, Haas, Josifovski, .. (2005)   (Correct)
0.5:   Selectivity Estimation for String Predicates: Overcoming .. - Chaudhuri, Ganti.. (2004)   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@misc{ lim-cxhist,
  author = "Lipyeow Lim and Min Wang and Jeffrey Scott Vitter",
  title = "CXHist: An On-line Classification-Based Histogram for XML String Selectivity
    Estimation",
  url = "citeseer.ist.psu.edu/738985.html" }
Citations (may not include all citations):
2133   Pattern Classification and Scene Analysis (context) - Duda, Hart - 1972
1491   Learning internal representations by error propagation (context) - Rumelhart, Hinton et al. - 1986
121   An analysis of bayesian classifiers - Langley, Iba et al. - 1992
117   Least squares quantization in PCM (context) - Lloyd - 1982
116   Beyond independence: Conditions for the optimality of the si.. - Domingos, Pazzani - 1996
108   Prediction and entropy of printed english (context) - Shannon - 1951
91   Selectivity estimation without the attribute value independe.. - Poosala, Ioannidis - 1997
87   Quantizing for minimum distortion (context) - Max - 1960
43   Storing and querying ordered XML using a relational database.. - Tatarinov, Viglas et al. - 2002
40   Estimating the selectivity of XML path expressions for inter.. - Aboulnaga, Alameldeen et al. - 2001
35   Counting twig matches in a tree - Chen, Jagadish et al. - 2001
28   XRel: a path-based approach to storage and retrieval of XML .. (context) - Yoshikawa, Amagasa et al. - 2001
27   Self-tuning histograms: Building histograms without looking .. - Aboulnaga, Chaudhuri - 1999
22   StatiX: Making XML count - Freire, Haritsa et al. - 2002
18   STHoles: a multidimensional workload-aware histogram - Bruno, Chaudhuri et al. - 2001
17   Estimating answer sizes for XML queries (context) - Wu, Patel et al. - 2002
16   Structure and value synopses for XML data graphs - Polyzotis, Garofalakis - 2002
15   Statistical synopses for graphstructured XML databases - Polyzotis, Garofalakis - 2002
15   Estimating alphanumeric selectivity in the presence of wildc.. - Krishnan, Vitter et al. - 1996
14   Substring selectivity estimation - Jagadish, Ng et al. - 1999
11   Multidimensional substring selectivity estimation - Jagadish, Kapitskaia et al. - 1999
10   Selectivity estimation for boolean queries - Chen, Korn et al. - 2000
8   Selectivity estimation in the presence of alphanumeric corre.. - Wang, Vitter et al. - 1997
8   XPathLearner: An on-line self-tuning markov histogram for XM.. - Lim, Wang et al. - 2002
7   Human behaviour and the principle of least effort: an introd.. (context) - Zipf - 1949
6   Onedimensional and multi-dimensional substring selectivity e.. - Jagadish, Kapitskaia et al. - 2000
4   VXMLR: A visual XML-relational database system (context) - Zhou, Lu et al. - 2001
2   System RX: One part relational (context) - Ozcan, Cochrane et al. - 2005
2   XParent: An efficient RDBMS-based XML database system - Jiang, Lu et al. - 2002
2   Bloom histogram: Path selectivity estimation for xml data wi.. - Wang, Jiang et al. - 2004
1   Selectivity estimation for string predicates: Overcoming the.. - Chaudhuri, Ganti et al. - 2004

Documents on the same site (http://www.vldb2005.org/program/paper/thu/):   More
Shuffling a Stacked Deck: The Case for Partially.. - Pandey, Roy, Olston, al. (2005)   (Correct)
REED: Robust, Efficient Filtering and Event Detection - In Sensor Networks (2005)   (Correct)
Parallel Execution of Test Runs for Database Application.. - Haftmann, Kossmann, Lo (2005)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC