See this document in CiteSeerX!

BOAT Optimistic Decision Tree Construction (1999)  (Make Corrections)  (37 citations)
Johannes Gehrke, Venkatesh Ganti, Raghu Ramakrishnan, Wei-Yin Loh



  Home/Search   Context   Related

 
View or download:
wisc.edu/~johannes/p...boatsigmod99.ps
cornell.edu/johannes...boatsigmod99.ps
utexas.edu/course/...oatsigmod99.ps.gz
Cached:  PDF   PS.gz  PS  Image  Update  Help

From:  128.105.7.11/~joha...publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Classification is an important data mining problem. Given a training database of records, each tagged with a class label, the goal of classification is to build a concise model that can be used to predict the class label of future, unlabeled records. A very popular class of classifiers are decision trees. All current algorithms to construct decision trees, including all main-memory algorithms, make one scan over the training database per level of the tree. We introduce a new algorithm (BOAT)... (Update)

Cited by:   More
Computational and Visual Support for Geographical Knowledge.. - Gahegan, Brodaric (2002)   (Correct)
The Use of Emerging Patterns in the Analysis of Gene.. - Dong, Li, Wong (2003)   (Correct)
Mining Data Streams Using Option Trees - Holmes, Kirkby, Pfahringer (2004)   (Correct)

Active bibliography (related documents):   More   All
0.6:   A Framework for Measuring Differences in Data.. - Ganti, Ramakrishnan.. (1999)   (Correct)
0.2:   Communication and Memory Efficient Parallel Decision Tree.. - Jin, Agrawal (2003)   (Correct)
0.2:   A Robust Reputation System for P2P and Mobile Ad-hoc Networks - Buchegger, Le Boudec (2004)   (Correct)

Similar documents based on text:   More   All
0.5:   Clustering Large Datasets in Arbitrary Metric Spaces - Venkatesh Ganti Raghu (1999)   (Correct)
0.3:   ICICLES: Self-tuning Samples for Approximate Query Answering - Ganti, Lee, Ramakrishnan (2000)   (Correct)
0.2:   A Framework for Measuring Changes in Data Characteristics - Ganti, Gehrke.. (1998)   (Correct)

Related documents from co-citation:   More   All
21:   SPRINT: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
16:   Programs for machine learning (context) - Quinlan - 1993
12:   SLIQ: A fast scalable classifier for data mining - Mehta, Agrawal et al. - 1996

BibTeX entry:   (Update)

J. Gehrke, V. Ganti, R. Ramakrishnan, and W. Loh. Boat-- optimistic decision tree construction. In Proc. of the ACM SIGMOD Conference on Management of Data, June 1999. http://citeseer.ist.psu.edu/gehrke99boat.html   More

@inproceedings{ gehrke99boat,
    author = "Johannes Gehrke and Venkatesh Ganti and Raghu Ramakrishnan and Wei-Yin Loh",
    title = "{BOAT} --- optimistic decision tree construction",
    pages = "169--180",
    year = "1999",
    url = "citeseer.ist.psu.edu/gehrke99boat.html" }
Citations (may not include all citations):
1262   Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
546   An introduction to the bootstrap (context) - Efron, Tibshirani - 1993
474   Advances in Knowledge Discovery and Data Mining (context) - Fayyad, Piatetsky-Shapiro et al. - 1996
200   Neural and Statistical Classification (context) - Michie, Spiegelhalter et al. - 1994
145   SPRINT: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
117   IEEE Transactions on Knowledge and Data Engineering (context) - Agrawal, Imielinski et al. - 1993
111   SLIQ: A fast scalable classifier for data mining - Mehta, Agrawal et al. - 1996
95   An interval classifier for database mining applications - Agrawal, Ghosh et al. - 1992
83   Incremental induction of decision trees - Utgoff - 1989
74   Data mining using two-dimensional optimized association rule.. (context) - Fukuda, Morimoto et al. - 1996
61   Construction and Assessment of Classification Rules (context) - Hand - 1997
60   Induction of decision trees (context) - Quinlan - 1986
57   Decision tree induction based on efficient tree restructurin.. - Utgoff, Berkman et al. - 1997
41   Mining optimized association rules for numeric attributes (context) - Fukuda, Morimoto et al. - 1996
38   Random Sampling from Databases - Olken - 1993
31   Rainforest - A framework for fast decision tree construction.. - Gehrke, Ramakrishnan et al. - 1996
25   Public: A decision tree classifier that integrates building .. - Rastogi, Shim - 1998
17   On growing better decision trees from data - Murthy - 1995
14   Algorithms for mining association rules for binary segmentat.. (context) - Morimoto, Fukuda et al. - 1998
13   Constructing efficient decision trees by using optimized num.. - Fukuda, Morimoto et al. - 1996
10   Discovering predictive association rules - Megiddo, Srikant - 1998
5   Society for Industrial and Applied Mathematics (context) - Mangasarian, Classics et al. - 1994
4   Cambridge Series in Statistical and Probabilistic Mathematic.. (context) - Davison, Hinkley et al. - 1997
2   Split selection methods for classification trees (context) - Loh, Shih - 1997
1   An empirical comparison of decision trees and other classifi.. (context) - Lim, Loh et al. - 1997



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://128.105.7.11/~johannes/publications.html):
Clustering Large Datasets in Arbitrary Metric Spaces - Ganti, Ramakrishnan.. (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC