Results 1 -
4 of
4
Feature Selection using Improved Mutual Information for Text Classification
- of Lecture Notes in Computer Science
, 2004
"... Abstract. A major characteristic of text document classification problem is extremely high dimensionality of text data. In this paper we present two algorithms for feature (word) selection for the purpose of text classification. We used sequential forward selection methods based on improved mutual i ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
Abstract. A major characteristic of text document classification problem is extremely high dimensionality of text data. In this paper we present two algorithms for feature (word) selection for the purpose of text classification. We used sequential forward selection methods based on improved mutual information introduced by Battiti [1] and Kwak and Choi [6] for non-textual data. These feature evaluation functions take into consideration how features work together. The performance of these evaluation functions compared to the information gain which evaluate features individually is discussed. We present experimental results using naive Bayes classifier based on multinomial model on the Reuters data set. Finally, we analyze the experimental results from various perspectives, including F1-measure, precision and recall. Preliminary experimental results indicate the effectiveness of the proposed feature selection algorithms in a text classification problem. 1
Information theoretic feature extraction for audio-visual speech recognition
- IEEE Transactions on Signal Processing
, 2009
"... Abstract—The problem of feature selection has been thoroughly analyzed in the context of pattern classification, with the purpose of avoiding the curse of dimensionality. However, in the context of multimodal signal processing, this problem has been studied less. Our approach to feature extraction i ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Abstract—The problem of feature selection has been thoroughly analyzed in the context of pattern classification, with the purpose of avoiding the curse of dimensionality. However, in the context of multimodal signal processing, this problem has been studied less. Our approach to feature extraction is based on information theory, with an application on multimodal classification, in particular audio–visual speech recognition. Contrary to previous work in information theoretic feature selection applied to multimodal signals, our proposed methods penalize features for their redundancy, achieving more compact feature sets and better performance. We propose two greedy selection algorithms, one that penalizes a proportion of feature redundancy, while the other uses conditional mutual information as an evaluation measure, for the selection of visual features for audio–visual speech recognition. Our features perform better than linear discriminant analysis, the most usual transform for dimensionality reduction in the field, across a wide range of dimensionality values and combined with audio at different quality levels. Index Terms—Audio–visual speech recognition, feature selection, mutual information. I.
A significance test-based feature selection method for the detection of prostate cancer from proteomic patterns
, 2004
"... I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, including any required final revisions, as accepted by my examiners. I understand that my thesis may be made electronically available to the public. ii The work reported in the thesis consists of two parts. ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, including any required final revisions, as accepted by my examiners. I understand that my thesis may be made electronically available to the public. ii The work reported in the thesis consists of two parts. One part is concerned with the development of a feature selection method based on statistical significance test, which can be generally used in any supervised pattern classification. The other part applies this proposed feature selection method to conduct proteomic pattern analysis for prostate cancer detection. For a given classification problem, we need to determine a set of relevant features to generate a classifier. In real-world problems, many features in initial feature set are usually irrelevant to the classification task and redundant with each other, which will increase the computational complexity and reduce the recognition rate. The task of feature selection is to choose a small feature subset in order to achieve better classification performance. As such,

