(Enter summary)
Abstract: We address the problem of finding a subset of features that allows a supervised induction algorithm to induce small high-accuracy concepts. We examine notions of relevance and irrelevance, and show that the definitions used in the machine learning literature do not adequately partition the features into useful categories of relevance. We present definitions for irrelevance and for two degrees of relevance. These definitions improve our understanding of the behavior of previous subset selection... (Update)
Cited by: More
An Improved Boosting Algorithm and Its Application to Text.. - Sebastiani, al. (2000)
(Correct)
Edited Naive Bayes - Martnez-Otzeta Sierra Lazkano
(Correct)
Anytime Algorithm for Feature Selection - Mark Last Abraham
(Correct)
Similar documents (at the sentence level):
11.8%: Wrappers for Feature Subset Selection - Kohavi, John (1997)
(Correct)
6.5%: Wrappers For Performance Enhancement And Oblivious Decision Graphs - Kohavi (1995)
(Correct)
Active bibliography (related documents): More All
1.0: Feature Subset Selection as Search with Probabilistic Estimates - Kohavi (1994)
(Correct)
0.4: Useful Feature Subsets and Rough Set Reducts - Kohavi, Frasca (1994)
(Correct)
0.3: Exploiting Upper Approximation in the Rough Set Methodology - Deogun, Raghavan, Sever (1995)
(Correct)
Similar documents based on text: More All
0.2: On-Line Cumulative Learning of Hierarchical Sparse n-Grams - Pfleger (2004)
(Correct)
0.2: On-Line Learning of Predictive Compositional Hierarchies By.. - Pfleger
(Correct)
0.2: Cascade Correlation: Derivation of a More Numerically.. - George John Computer
(Correct)
Related documents from co-citation: More All
44: Programs for machine learning (context) - Quinlan - 1993
31: Greedy attribute selection
- Caruana, Freitag - 1994
27: UCI repository of machine learning databases (context) - Merz, Murphy et al. - 1997
BibTeX entry: (Update)
John, G.H., Kohavi, R., Pfleger, K., Irrelevant Features and the Subset Selection Problem, Proc. of the 11th International Conference on Machine Learning ICML94, pp. 121---129, 1994. http://citeseer.ist.psu.edu/john94irrelevant.html More
@inproceedings{ john94irrelevant,
author = "George H. John and Ron Kohavi and Karl Pfleger",
title = "Irrelevant Features and the Subset Selection Problem",
booktitle = "International Conference on Machine Learning",
pages = "121-129",
note = "Journal version in AIJ, available at http://citeseer.nj.nec.com/13663.html",
year = "1994",
url = "citeseer.ist.psu.edu/john94irrelevant.html" }
Citations (may not include all citations):
1359
Induction of decision trees (context) - Quinlan - 1986
1262
Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
667
UCI repository of machine learning databases (context) - Murphy, Aha - 1994
317
Learning quickly when irrelevant attributes abound: A new li.. (context) - Littlestone - 1988
291
Computer Systems that Learn (context) - Weiss, Kulikowski - 1991
239
Pattern Recognition: A Statistical Approach (context) - Devijver, Kittler - 1982
216
Very simple classification rules perform well on most common.. (context) - Holte - 1993
166
Applied Regression Analysis (context) - Draper, Smith - 1981
139
Stochastic complexity and modeling (context) - Rissanen - 1986
132
Estimation and inference by compact coding (context) - Wallace, Freeman - 1987
126
A practical approach to feature selection (context) - Kira, Rendell
125
Learning with many irrelevant features
- Almuallim, Dietterich - 1991
111
The feature selection problem: Traditional methods and a new.. (context) - Kira, Rendell
105
The monk's problems: A performance comparison of different l..
- Thrun - 1991
102
Training a 3-node neural network is NP-complete
- Blum, Rivest - 1992
96
Occam's razor (context) - Blumer, Ehrenfeucht et al. - 1987
87
Subset Selection in Regression (context) - Miller - 1990
87
Prototype and feature selection by sampling and random mutat..
- Skalak - 1994
74
A branch and bound algorithm for feature subset selection (context) - Narendra, Fukunaga - 1977
59
Efficient algorithms for minimizing cross validation error
- Moore, Lee - 1994
53
Using decision trees to improve case-based learning
- Cardie - 1993
47
On automatic feature selection (context) - Siedlecki, Sklansky - 1988
42
Efficiently inducing determinations: A complete and systemat..
- Schlimmer - 1993
32
Oblivious decision trees and abstract cases
- Langley, Sage - 1994
28
Efficient pruning methods for separate-and-conquer rule lear.. (context) - Cohen - 1993
19
Irrelevance Reasoning in Knowledge Based Systems
- Levy - 1993
18
the effectiveness of receptors in recognition systems (context) - Marill, Green - 1963
11
Optimal Subset Selection (context) - Boyce, Farhi et al. - 1974
10
Best first strategy for feature selection (context) - Xu, Yan et al. - 1989
4
Use of distance measures (context) - Ben-Bassat - 1982
3
IEEE Transactions on Computers C (context) - on, Wasserman - 1990
3
Feature selection using rough sets theory (context) - Hall - 1993
3
The Use of Knowledge in Analogy and Induction (context) - Russel - 1989
3
and Fisher (context) - Gennari, Langley - 1989
2
Genetic algorithms as a tool for feature selection in machin.. (context) - CMU-CS-, Carnegie et al. - 1992
1
Boolean feature discovery in empirical learning (context) - Statistical, Irwin et al. - 1990
1
Preliminary steps toward the automation of induction (context) - Statist, Russel - 1986
1
Readings in Machine Learning (context) - Learning, Reprinted et al. - 1992
1
the difficulty of finding small consistent decision trees (context) - incremental, formation et al. - 1989
1
Some comments on c p (context) - Learning, Mallows - 1973
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.stanford.edu/~kpfleger/copy/publications/all.html): More
Learning of Compositional Hierarchies for the Modeling of Context .. - Pfleger
(Correct)
Context Effects and Learning of Hierarchical Compositional.. - Pfleger
(Correct)
A Domain-Specific Software Architecture for.. - Hayes-Roth.. (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC