MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  V.: Learning decision tree classifiers from attribute value taxonomies and partially specified data (2003) [8 citations — 4 self]

Download:
Download as a PDF
by Jun Zhang, Vasant Honavar
In the Twentieth International Conference on Machine Learning (ICML 2003
http://www.cs.iastate.edu/~honavar/Papers/icml-zhang03.pdf
Add To MetaCart

Abstract:

We consider the problem of learning to classify partially specified instances i.e., instances that are described in terms of attribute values at different levels of precision, using user-supplied attribute value taxonomies (AVT). We formalize the problem of learning from AVT and data and present an AVT-guided decision tree learning algorithm (AVT-DTL) to learn classification rules at multiple levels of abstraction. The proposed approach generalizes existing techniques for dealing with missing values to handle instances with partially missing values. We present experimental results that demonstrate that AVT-DTL is able to effectively learn robust high accuracy classifiers from partially specified examples. Our experiments also demonstrate that the use of AVT-DTL outperforms standard decision tree algorithm (C4.5 and its variants) when applied to data with missing attribute values; and produces substantially more compact decision trees than those obtained by standard approach. 1.

Citations

3215 C4.5: Programs for Machine Learning – Quinlan - 1993
402 Distributional clustering of English words – Pereira, Tishby, et al. - 1993
51 Exploration of the power of attribute-oriented induction in data mining – Han, Fu - 1996
48 Resolving database incompatibility: An approach to performing relational operations over mismatched domains – DeMichiel - 1989
21 Evaluating aggregate operations over imprecise data – Chen, Chiu, et al. - 1996
9 On Handling Tree-Structured Attributes – Almuallim, Akiba, et al. - 1995
7 Ontology-Driven Induction of Decision Trees at Multiple Levels of Abstraction – Zhang, Silvescu, et al. - 2002
6 Using feature hierarchies in bayesian network learning – desJardins, Getoor, et al. - 2000
6 Learning Hierarchies from Ambiguous Natural Language Data – Yamazaki, Pazzani, et al. - 1995
2 An Efficient Rule-based Attribute-Oriented Induction for Data Mining – Cheung, Hwang, et al. - 2000
1 Aggregation of Imprecise and Uncertain Information in Databases – Núñez - 1991