(Enter summary)
Abstract: Data mining algorithmsincluding machine learning, statistical
analysis, and pattern recognition techniques can
greatly improve our understanding of data warehouses that
are now becoming more widespread. In this paper, we focus
on classification algorithms and review the need for multiple
classification algorithms. We describe a system called
MLC
++ , which was designed to help choose the appropriate
classification algorithm for a given dataset by making it
easy to compare the utility of... (Update)
Context of citations to this paper: More
.... of tools for visualizing or interactively exploring the results of learning (e.g. The MineSet Tree Visualizer Kohavi, Sommerfield, Dougherty, 1996). While these tools provide an excellent means of identifying and exploring what was learned, they do not provide...
...from demographic variables. When we evaluate models produced for this task, we would like to obtain rules such as: Classifier MC4 (Kohavi et al. 1997) is 21 less accurate than average on people who are between 45 and 55 years of age, are high school graduates, and are married....
Cited by: More
Multivariate Discretization for Set Mining - Bay (2000)
(Correct)
Parcel: Feature Subset Selection in Variable Cost Domains - Scott, Niranjan, Prager (1998)
(Correct)
Characterizing Model Performance in the Feature Space - Bay, Pazzani
(Correct)
Similar documents (at the sentence level):
77.6%: Data Mining using MLC++ - A Machine Learning Library in C++ - Kohavi, Sommerfield.. (1997)
(Correct)
Active bibliography (related documents): More All
1.0: Wrappers For Performance Enhancement And Oblivious Decision Graphs - Kohavi (1995)
(Correct)
0.3: Wrappers for Feature Subset Selection - Kohavi, John (1996)
(Correct)
0.2: MLC++: A Machine Learning Library in C++ - Kohavi, John, Long, Manley.. (1994)
(Correct)
Similar documents based on text: More All
1.0: Unknown - Tutorial Machine Learning
(Correct)
1.0: Mlc++ - Kohavi, Sommerfield (1998)
(Correct)
0.6: Updated September 25, 1998 - Mineset Cli Brunk
(Correct)
Related documents from co-citation: More All
4: Programs for machine learning (context) - Quinlan - 1993
4: KDD for science data analysis: Issues and examples
- Fayyad, Haussler et al. - 1996
3: Lookahead and pathology in decision tree induction
- Murthy, Salzberg - 1995
BibTeX entry: (Update)
R. Kohavi, D. Sommerfield, and J. Dougherty. Data mining using mlc + +, a machine learning library in c + +. International Journal of Artificial Intelligence Tools,, 6(4):537--566, 1997. http://citeseer.ist.psu.edu/article/kohavi96data.html More
@inproceedings{ kohavi96data,
author = "Ron Kohavi and Dan Sommerfield and James Dougherty",
title = "Data Mining Using {MLC}++: {A} Machine Learning Library in {C}++",
booktitle = "Tools with Artificial Intelligence",
publisher = "IEEE Computer Society Press",
pages = "234--245",
year = "1996",
url = "citeseer.ist.psu.edu/article/kohavi96data.html" }
Citations (may not include all citations):
2177
Programs for Machine Learning (context) - Quinlan - 1993
2133
Pattern Classification and Scene Analysis (context) - Duda, Hart - 1973
1359
Induction of decision trees (context) - Quinlan - 1986
1056
Introduction to the Theory of Neural Computation (context) - Hertz, Krogh et al. - 1991
667
UCI repository of machine learning databases (context) - Murphy, Aha - 1996
657
Bagging predictors
- Breiman - 1994
367
Stacked generalization
- Wolpert - 1992
317
Learning quickly when irrelevant attributes abound: A new li.. (context) - Littlestone - 1988
291
Computer Systems that Learn (context) - Weiss, Kulikowski - 1991
291
Irrelevant features and the subset selection problem
- John, Kohavi et al. - 1994
262
From data mining to knowledge discovery: An overview (context) - Fayyad, Piatetsky-Shapiro et al. - 1996
216
Very simple classification rules perform well on most common.. (context) - Holte - 1993
180
The CN2 induction algorithm (context) - Clark, Niblett - 1989
171
A weighted nearest neighbor algorithm for learning with symb..
- Cost, Salzberg - 1993
171
Supervised and unsupervised discretization of continuous fea..
- Dougherty, Kohavi et al. - 1995
164
A study of cross-validation and bootstrap for accuracy estim..
- Kohavi
136
A system for the induction of oblique decision trees
- Murthy, Kasif et al. - 1994
121
An analysis of bayesian classifiers
- Langley, Iba et al. - 1992
116
Beyond independence: conditions for the optimality of the si..
- Domingos, Pazzani - 1996
89
Machine Learning (context) - Taylor, Michie et al. - 1994
84
A conservation law for generalization performance (context) - Schaffer - 1994
68
Improving regression estimation: averaging methods for varia..
- Perrone - 1993
68
Rule induction with CN2: Some recent improvements (context) - Clark, Boswell - 1991
64
Nearest Neighbor (context) - Dasarathy - 1990
62
A machine learning library in C (context) - Kohavi, John et al. - 1994
58
The Estimation of Probabilities: An Essay on Modern Bayesian.. (context) - Good - 1965
51
Wrappers for Performance Enhancement and Oblivious Decision ..
- Kohavi
51
Drawing graphs with dot
- Koutsofios, North - 1994
47
Theory and applications of agnostic PAC-learning with small ..
- Auer, Holte et al. - 1995
45
A Study of Distance-Based Machine Learning Algorithms (context) - Wettschereck - 1994
36
Hypothesis-driven constructive induction in AQ17-HCI : A met.. (context) - Wnek, Michalski - 1994
36
Lazy decision trees
- Friedman, Kohavi et al. - 1996
33
Comparing connectionist and symbolic learning methods
- Quinlan - 1994
31
The power of decision tables
- Kohavi
30
Inductive and bayesian learning in medical diagnosis (context) - Kononenko - 1993
28
Automatic parameter selection by minimizing estimated error
- Kohavi, John - 1995
25
Feature subset selection using the wrapper model: Overfittin.. (context) - Kohavi, Sommerfield - 1995
22
Cross-validation and the bootstrap: Estimating the error rat..
- Efron, Tibshirani - 1995
19
The relationship between PAC (context) - Wolpert - 1994
15
Tolerating noisy, irrelevant and novel attributes in instanc.. (context) - Aha - 1992
10
LEDA: A Library of Efficient Data Types and Algorithms (context) - Naeher - 1996
7
A planar geometric model for representing multidimensional d.. (context) - Michalski - 1978
3
The Design and Evolution of C (context) - Stroustroup - 1994
2
Scaling up the accuracy of naive-bayes classifiers: a decisi.. (context) - ftp, stanford et al. - 1996
1
Learning Probabilistic Relational Concept Descriptions (context) - Aha, to - 1996
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://wizpak.iaf.uiowa.edu/~sgiservices/Varsity/silicon_camp/Mineset/tech/): More
Volume Rendering for Relational Data - Becker (1997)
(Correct)
Option Decision Trees with Majority Votes - Kohavi, Kunz (1997)
(Correct)
Feature Subset Selection Using the Wrapper Method.. - Kohavi, Sommerfield (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC