|
4364
|
Elements of Information Theory
– Cover, Thomas
- 1991
|
|
1083
|
Introduction to Kolmogorov Complexity and Its Applications
– Li, Vitanyi
- 1993
|
|
718
|
Pattern recognition and neural networks
– Ripley
- 1996
|
|
512
|
A comparative study on feature selection in text categorization
– Yang, Pedersen
- 1997
|
|
369
|
Stochastic Complexity
– Rissanen
- 1989
|
|
261
|
Concrete mathematics : a foundation for computer science
– Graham, Knuth, et al.
- 1989
|
|
225
|
Naive (bayes) at forty: The independence assumption in information retrieval
– Lewis
- 1998
|
|
178
|
Divergence Measures Based on the Shannon Entropy
– Lin
- 1991
|
|
152
|
Distributional Clustering of Words for Text Classification
– Baker, McCallum
- 1998
|
|
111
|
A.S.Weigend. A neural network approach to topic spotting
– Wiener
- 1995
|
|
110
|
A toolkit for statistical language modeling, text retrieval, classification and clustering. http://www.cs.cmu.edu/ mccallum/bow
– Bow
- 1996
|
|
96
|
Reuters-21578 text categorization test collection distribution 1.0. http://www.research.att.com/∼lewis
– Lewis
- 1999
|
|
89
|
Document Clustering Using Word Clusters via the Information Bottleneck Method
– Slonim, Tishby
- 2000
|
|
85
|
M.E.: Computer evaluation of indexing and text processing
– Salton, Lesk
- 1968
|
|
80
|
Agglomerative Information Bottleneck
– Slonim, Tishby
- 1999
|
|
77
|
Baeza-Yates and Berthier A. Ribeiro-Neto. Modern Information Retrieval
– Ricardo
- 1999
|
|
73
|
Natural Language Processing for Information Retrieval
– Lewis, Jones
- 1996
|
|
72
|
Feature selection and feature extraction for text categorization
– Lewis
- 1992
|
|
64
|
Using latent semantic analysis to improve access to textual information
– DUMAIS, FURNAS, et al.
- 1988
|
|
52
|
The minimum description length principle and reasoning under uncertainty
– Grunwald
- 1998
|
|
44
|
Principal Component Analysis
– Jollie
- 1986
|
|
37
|
Text categorization: a symbolic approach
– Moulinier, Ra˘skinis, et al.
- 1996
|
|
25
|
Present position and potential developments: some personal views, statistical theory, the prequential approach
– Dawid
- 1984
|
|
22
|
A minimum description length approach to grammar inference
– Grunwald
- 1996
|
|
20
|
Text Representation for Intelligent Text Retrieval: A Classification-Oriented View
– Lewis
- 1992
|
|
13
|
A.Eikvil, Text Categorization: A survey
– Aas
- 1999
|
|
10
|
A minimal encoding approach to feature discovery
– Derthick
- 1991
|
|
7
|
Cross-validation methods
– Browne
- 2000
|
|
7
|
Arti Intelligence - A modern Approach
– Russel, Peter
- 1995
|
|
4
|
Symbolic, Connectionist, and Statistical Approaches to Learning for Natural Language Processing
– Wermter, Riloff
- 1996
|
|
2
|
J.Sheinvald. Feature selection for classi using the MDL principle
– Dom, Niblack
- 1989
|
|
2
|
On bias, viariance 0/1 loss, and the curse-of-dimensionality. Data Mining and Knowledge Discovery
– Friedman
- 1997
|
|
2
|
On optimal number of features in classi
– Rissanen
- 1988
|
|
2
|
Over using the MDL Principle
– Verbeek
- 1998
|
|
1
|
On the optimality of the simple baysian classi under zero-one loss
– Domingos, Pazzani
- 1997
|
|
1
|
20 newsgroups data set. fetched May 24
– Lang
|
|
1
|
Human Behavor and the Principle of Least Eort, an Introduction to Human Ecology
– Zipf
- 1949
|