MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

 

Download:
pdf | ps
by J. J. Verbeek
http://www.illc.uva.nl/Publications/Lists/../ResearchReports/MoL-2000-03.text.ps.gz
Add To MetaCart

Abstract:

An information theoretic approach to nding word groups for text classication

Citations

4364 Elements of Information Theory – Cover, Thomas - 1991
1083 Introduction to Kolmogorov Complexity and Its Applications – Li, Vitanyi - 1993
718 Pattern recognition and neural networks – Ripley - 1996
512 A comparative study on feature selection in text categorization – Yang, Pedersen - 1997
369 Stochastic Complexity – Rissanen - 1989
261 Concrete mathematics : a foundation for computer science – Graham, Knuth, et al. - 1989
225 Naive (bayes) at forty: The independence assumption in information retrieval – Lewis - 1998
178 Divergence Measures Based on the Shannon Entropy – Lin - 1991
152 Distributional Clustering of Words for Text Classification – Baker, McCallum - 1998
111 A.S.Weigend. A neural network approach to topic spotting – Wiener - 1995
110 A toolkit for statistical language modeling, text retrieval, classification and clustering. http://www.cs.cmu.edu/ mccallum/bow – Bow - 1996
96 Reuters-21578 text categorization test collection distribution 1.0. http://www.research.att.com/∼lewis – Lewis - 1999
89 Document Clustering Using Word Clusters via the Information Bottleneck Method – Slonim, Tishby - 2000
85 M.E.: Computer evaluation of indexing and text processing – Salton, Lesk - 1968
80 Agglomerative Information Bottleneck – Slonim, Tishby - 1999
77 Baeza-Yates and Berthier A. Ribeiro-Neto. Modern Information Retrieval – Ricardo - 1999
73 Natural Language Processing for Information Retrieval – Lewis, Jones - 1996
72 Feature selection and feature extraction for text categorization – Lewis - 1992
64 Using latent semantic analysis to improve access to textual information – DUMAIS, FURNAS, et al. - 1988
52 The minimum description length principle and reasoning under uncertainty – Grunwald - 1998
44 Principal Component Analysis – Jollie - 1986
37 Text categorization: a symbolic approach – Moulinier, Ra˘skinis, et al. - 1996
25 Present position and potential developments: some personal views, statistical theory, the prequential approach – Dawid - 1984
22 A minimum description length approach to grammar inference – Grunwald - 1996
20 Text Representation for Intelligent Text Retrieval: A Classification-Oriented View – Lewis - 1992
13 A.Eikvil, Text Categorization: A survey – Aas - 1999
10 A minimal encoding approach to feature discovery – Derthick - 1991
7 Cross-validation methods – Browne - 2000
7 Arti Intelligence - A modern Approach – Russel, Peter - 1995
4 Symbolic, Connectionist, and Statistical Approaches to Learning for Natural Language Processing – Wermter, Riloff - 1996
2 J.Sheinvald. Feature selection for classi using the MDL principle – Dom, Niblack - 1989
2 On bias, viariance 0/1 loss, and the curse-of-dimensionality. Data Mining and Knowledge Discovery – Friedman - 1997
2 On optimal number of features in classi – Rissanen - 1988
2 Over using the MDL Principle – Verbeek - 1998
1 On the optimality of the simple baysian classi under zero-one loss – Domingos, Pazzani - 1997
1 20 newsgroups data set. fetched May 24 – Lang
1 Human Behavor and the Principle of Least Eort, an Introduction to Human Ecology – Zipf - 1949