(Enter summary)
Abstract: Modern relational database systems are beginning to
support ad hoc queries on mining models. In this paper,
we explore novel techniques for optimizing queries that apply
mining models to relational data. For such queries, we
use the internal structure of the mining model to automatically
derive traditional database predicates. We present
algorithms for deriving such predicates for some popular
discrete mining models: decision trees, Naive Bayes, and
clustering. Our experiments on Microsoft SQL... (Update)
Similar documents based on text: More All
0.4: The Microsoft Database Research Group - Lomet, Barga, Chaudhuri, Larson..
(Correct)
0.4: Serendipity - Life Is Full
(Correct)
0.4: Bulletin of the Technical Committee on - December Vol No
(Correct)
BibTeX entry: (Update)
Surjit Chaudhuri, Vivek Narasayya, and Sunita Sarawagi. Efficient Evaluation of Queries with Mining Predicates. In ICDE 2002, pages 529 -- 540, 2002. http://citeseer.ist.psu.edu/chaudhuri02efficient.html More
@inproceedings{ chaudhuri02efficient,
author = "Surajit Chaudhuri and Vivek R. Narasayya and Sunita Sarawagi",
title = "Efficient Evaluation of Queries with Mining Predicates",
booktitle = "{ICDE}",
year = "2002",
url = "citeseer.ist.psu.edu/chaudhuri02efficient.html" }
Citations (may not include all citations):
2177
Programs for Machine Learning (context) - Quinlan - 1993
976
Machine Learning (context) - Mitchell - 1997
805
Algorithms for Clustering Data (context) - Jain, Dubes - 1988
696
UCI repository of machine learning databases (context) - Blake, Merz - 1998
475
Automatic subspace clustering of high dimensional data for d..
- Agrawal, Gehrke et al. - 1998
282
Finding Groups in Data: An Introduction to Cluster Analysis (context) - Kaufman, Rousseeuw - 1990
248
Fast effective rule induction
- Cohen - 1995
210
A densitybased algorithm for discovering clusters in large s..
- Ester, Kriegel et al. - 1996
205
Mixture models: Inference and applications to clustering (context) - McLachlan, Basford - 1988
171
Supervised and unsupervised discretization of continuous fea..
- Dougherty, Kohavi et al. - 1995
135
Theory and results (context) - Cheeseman, Stutz et al. - 1996
88
Predicate migration: Optimizing queries with expensive predi..
- Hellerstein, Stonebraker - 1993
37
Data mining using MLC++: A machine learning library in C (context) - Kohavi, Sommerfield et al. - 1996
36
An algorithm for point clustering and grid generation (context) - Berger, Regoutsos - 1991
32
Optimization of queries with user-defined predicates
- Chaudhuri, Shim - 1996
[Article contains additional citations not shown here]
Documents on the same site (http://www.it.iitb.ac.in/~sunita/):
Interactive Deduplication using Active Learning - Sarawagi, Bhamidipaty (2002)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC