See this document in CiteSeerX!

Efficient Evaluation of Queries with Mining Predicates (2002)  (Make Corrections)  (1 citation)
Surajit Chaudhuri, Vivek Narasayya, Sunita Sarawagi
ICDE



  Home/Search   Context   Related

 
View or download:
it.iitb.ac.in/~sunita/paper...icde02.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  it.iitb.ac.in/~sunita/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Modern relational database systems are beginning to support ad hoc queries on mining models. In this paper, we explore novel techniques for optimizing queries that apply mining models to relational data. For such queries, we use the internal structure of the mining model to automatically derive traditional database predicates. We present algorithms for deriving such predicates for some popular discrete mining models: decision trees, Naive Bayes, and clustering. Our experiments on Microsoft SQL... (Update)

Similar documents based on text:   More   All
0.4:   The Microsoft Database Research Group - Lomet, Barga, Chaudhuri, Larson..   (Correct)
0.4:   Serendipity - Life Is Full   (Correct)
0.4:   Bulletin of the Technical Committee on - December Vol No   (Correct)

BibTeX entry:   (Update)

Surjit Chaudhuri, Vivek Narasayya, and Sunita Sarawagi. Efficient Evaluation of Queries with Mining Predicates. In ICDE 2002, pages 529 -- 540, 2002. http://citeseer.ist.psu.edu/chaudhuri02efficient.html   More

@inproceedings{ chaudhuri02efficient,
    author = "Surajit Chaudhuri and Vivek R. Narasayya and Sunita Sarawagi",
    title = "Efficient Evaluation of Queries with Mining Predicates",
    booktitle = "{ICDE}",
    year = "2002",
    url = "citeseer.ist.psu.edu/chaudhuri02efficient.html" }
Citations (may not include all citations):
2177   Programs for Machine Learning (context) - Quinlan - 1993
976   Machine Learning (context) - Mitchell - 1997
805   Algorithms for Clustering Data (context) - Jain, Dubes - 1988
696   UCI repository of machine learning databases (context) - Blake, Merz - 1998
475   Automatic subspace clustering of high dimensional data for d.. - Agrawal, Gehrke et al. - 1998
282   Finding Groups in Data: An Introduction to Cluster Analysis (context) - Kaufman, Rousseeuw - 1990
248   Fast effective rule induction - Cohen - 1995
210   A densitybased algorithm for discovering clusters in large s.. - Ester, Kriegel et al. - 1996
205   Mixture models: Inference and applications to clustering (context) - McLachlan, Basford - 1988
171   Supervised and unsupervised discretization of continuous fea.. - Dougherty, Kohavi et al. - 1995
135   Theory and results (context) - Cheeseman, Stutz et al. - 1996
88   Predicate migration: Optimizing queries with expensive predi.. - Hellerstein, Stonebraker - 1993
37   Data mining using MLC++: A machine learning library in C (context) - Kohavi, Sommerfield et al. - 1996
36   An algorithm for point clustering and grid generation (context) - Berger, Regoutsos - 1991
32   Optimization of queries with user-defined predicates - Chaudhuri, Shim - 1996

[Article contains additional citations not shown here]

Documents on the same site (http://www.it.iitb.ac.in/~sunita/):
Interactive Deduplication using Active Learning - Sarawagi, Bhamidipaty (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC