@MISC{Liang_decisiontrees, author = {Jia Liang}, title = {Decision Trees and Attribute-Oriented Knowledge Discovery in Databases}, year = {} }
Bookmark
OpenURL
Abstract
We compare two types of knowledge representations for data mining: decision trees and knowledge rules, and three mining methods. We show that the decision tree representation and the knowledge rules representation are semantically equivalent. The ID3 algorithm by Quinlan has limited generalization power because it uses only a very simple generalization function of partial attribute removal type. ID3 may output too many knowledge rules. Quinlan's production rule generators use attribute removal functions that are more powerful than ID3. The HCC-algorithm by Han, Cai, and Cercone uses both attribute removal and concept tree ascension as its generalization functions. HCC-algorithm normally has more generalization power than the other two. HCCalgorithm also uses thresholds to control the output complexity. On computational efficiency, HCCalgorithm is the most efficient, ID3 is the next, while Quinlan's production rule generators are less efficient. We also point out some disadvantages of H...