Abstract:
Typically, decision tree construction algorithms apply a single "goodness of split " criterion to form each test node of the tree. It is a hypothesis of this research that better results can be obtained if during tree construction one applies a split criterion suited to the "location " of the test node in the tree. Specifically, given the objective of maximizing predictive accuracy, test nodes near the root of the tree should be chosen using a measure based on information theory, whereas test nodes closer to the leaves of the pruned tree should be chosen to maximize classification accuracy on the training set. The results of an empirical evaluation illustrate that adapting the split criterion to node location can improve classification performance.
Citations
|
2489
|
Induction of Decision Trees
– Quinlan
- 1986
|
|
2438
|
Classification and Regression Trees
– Breiman, Friedman, et al.
- 1984
|
|
335
|
Very simple classification rules perform well on most commonly used data sets
– Holte
- 1993
|
|
113
|
Overfitting avoidance as bias
– Schaffer
- 1993
|
|
111
|
On the handling of continuous-valued attributes in decision tree generation
– Fayyad, Irani
- 1992
|
|
103
|
Multivariate decision trees
– Brodley, Utgoff
- 1995
|
|
85
|
A further comparison of splitting rules for decision-tree induction
– Buntine, Niblett
- 1992
|
|
83
|
The Attribute Selection Problem in Decision Tree Generation
– Fayyad, Irani
- 1992
|
|
79
|
Reducing Misclassification Costs
– Pazzani, Merz, et al.
- 1994
|
|
56
|
Survey of Decision Tree Classifier Methodology
– Safavian, Landgrebe
- 1991
|
|
34
|
International application of a new probability algorithm for the diagnosis of coronary artery disease
– Detrano, Janosi, et al.
- 1989
|
|
5
|
Building robust learning systems by computing induction and optimization
– Tcheng, Lambert, et al.
- 1989
|
|
3
|
Constructing optimal binary decision trees is np-complete
– Hyafile, Rivest
- 1976
|
|
1
|
The importance of attribute selection in decision tree induction
– Liu
- 1994
|