| Alternate document: Details Improving Text Classification by Shrinkage in a Hierarchy of Classes (98) Andrew McCallum, Ronald Rosenfeld, Tom Mitchell, Andrew Y. Ng |
(Enter summary)
Abstract: When documents are organized in a large number of topic categories, the categories are often arranged in a hierarchy. The U.S. patent database and Yahoo are two examples. (Update)
Cited by: More
A Roadmap for Web Mining: - From Web To
(Correct)
The Trajectory Mixture Model for Learning Collections of .. - Shon, Baker, Grimes, Rao (2003)
(Correct)
Extracting Social Networks and Contact Information.. - Culotta, Bekkerman.. (2004)
(Correct)
Similar documents (at the sentence level):
62.6%: Improving Text Classification by Shrinkage in a.. - McCallum, Rosenfeld, .. (1998)
(Correct)
Active bibliography (related documents): More All
0.1: Towards a Comprehensive Topic Hierarchy for News - Boyapati (2000)
(Correct)
0.1: Context-Sensitive Modeling of Web-Surfing Behaviour using.. - Acharyya, Ghosh
(Correct)
0.0: New Techniques In Intelligent Information Filtering - Macskassy (2003)
(Correct)
Similar documents based on text: More All
0.3: Learning Hidden Markov Model Structure for Information Extraction - Seymore (1999)
(Correct)
0.3: Building Domain-Specific Search Engines with Machine .. - McCallum, Nigam.. (1999)
(Correct)
0.3: Learning to Classify Text from Labeled and Unlabeled.. - Nigam, McCallum, Thrun, .. (1998)
(Correct)
Related documents from co-citation: More All
24: Hierarchically classifying documents using very few words
- Koller, Sahami - 1997
15: Maximum Likelihood from Incomplete Data via the EM Algorithm (context) - Dempster, Laird et al. - 1977
14: Text categorization with Support Vector Machines: Learning with many relevant fe..
- Joachims - 1998
BibTeX entry: (Update)
Andrew McCallum, Ronald Rosenfeld, Tom Mitchell, and Andrew Ng. Improving text classification by shrinkage in a hierarchy of classes. In ICML-98, pages 359--367, 1998. http://citeseer.ist.psu.edu/mccallum98improving.html More
@inproceedings{ mccallum98improving,
author = "Andrew K. McCallum and Ronald Rosenfeld and Tom M. Mitchell and Andrew Y. Ng",
title = "Improving text classification by shrinkage in a hierarchy of classes",
booktitle = "Proceedings of {ICML}-98, 15th International Conference on Machine Learning",
publisher = "Morgan Kaufmann Publishers, San Francisco, US",
address = "Madison, US",
editor = "Jude W. Shavlik",
pages = "359--367",
year = "1998",
url = "citeseer.ist.psu.edu/mccallum98improving.html" }
Citations (may not include all citations):
2528
Maximum likelihood from incomplete data via the EM (context) - Dempster, Laird et al. - 1977
149
Interpolated estimation of markov source parameters from spa.. (context) - Jelinek, Mercer - 1980
135
Hierarchically classifying documents using very few words
- Koller, Sahami - 1997
130
A probabilistic analysis of the Rocchio algorithm with TFIDF..
- Joachims - 1997
116
Beyond independence: Conditions for the optimality of the si..
- Domingos, Pazzani - 1997
97
A comparison of two learning algorithms for text categorizat..
- Lewis, Ringuette - 1994
88
Bayes and Empirical Bayes Methods for Data Analysis (context) - Carlin, Louis - 1996
81
Developments in automatic text retrieval (context) - Salton - 1991
80
Learning to classify text from labeled and unlabeled documen..
- Nigam, McCallum et al. - 1998
64
A tree-based statistical language model for natural language.. (context) - Bahl, Brown et al. - 1989
45
Estimation with quadratic loss (context) - James, Stein - 1961
41
Feature selection in statistical learning of text categoriza.. (context) - Yang, Pederson - 1997
30
Inadmissibility of the usual estimator for the mean of a mul.. (context) - Stein - 1955
23
Using story topics for language model adaptation
- Seymore, Rosenfeld - 1997
20
An evaluation of statistical approaches to text categorizati.. (context) - Yang - 1997
16
Adaptive mixtures of probabilistic transducers
- Singer - 1997
6
Conditions for the equivalence of hierarchical and flat Baye.. (context) - Mitchell - 1998
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.umass.edu/~mccallum/papers/): More
A Note on the Unification of Information Extraction and Data .. - McCallum, Jensen (2003)
(Correct)
Toward Conditional Models of Identity Uncertainty with.. - McCallum, Wellner (2003)
(Correct)
Employing EM in Pool-Based Active Learning for Text.. - McCallum, Nigam (1998)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC