Alternate document:   Details   Improving Text Classification by Shrinkage in a Hierarchy of Classes (98) Andrew McCallum, Ronald Rosenfeld, Tom Mitchell, Andrew Y. Ng

See this document in CiteSeerX!

Improving Text Classification by Shrinkage in a Hierarchy of Classes (1998)  (Make Corrections)  (61 citations)
Andrew McCallum, Ronald Rosenfeld, Tom Mitchell, Andrew Ng
Proceedings of ICML-98, 15th International Conference on Machine Learning



  Home/Search   Context   Related

 
View or download:
umass.edu/~mccallu...hiericml98s.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  umass.edu/~mccallum/papers/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: When documents are organized in a large number of topic categories, the categories are often arranged in a hierarchy. The U.S. patent database and Yahoo are two examples. (Update)

Cited by:   More
A Roadmap for Web Mining: - From Web To   (Correct)
The Trajectory Mixture Model for Learning Collections of .. - Shon, Baker, Grimes, Rao (2003)   (Correct)
Extracting Social Networks and Contact Information.. - Culotta, Bekkerman.. (2004)   (Correct)

Similar documents (at the sentence level):
62.6%:   Improving Text Classification by Shrinkage in a.. - McCallum, Rosenfeld, .. (1998)   (Correct)

Active bibliography (related documents):   More   All
0.1:   Towards a Comprehensive Topic Hierarchy for News - Boyapati (2000)   (Correct)
0.1:   Context-Sensitive Modeling of Web-Surfing Behaviour using.. - Acharyya, Ghosh   (Correct)
0.0:   New Techniques In Intelligent Information Filtering - Macskassy (2003)   (Correct)

Similar documents based on text:   More   All
0.3:   Learning Hidden Markov Model Structure for Information Extraction - Seymore (1999)   (Correct)
0.3:   Building Domain-Specific Search Engines with Machine .. - McCallum, Nigam.. (1999)   (Correct)
0.3:   Learning to Classify Text from Labeled and Unlabeled.. - Nigam, McCallum, Thrun, .. (1998)   (Correct)

Related documents from co-citation:   More   All
24:   Hierarchically classifying documents using very few words - Koller, Sahami - 1997
15:   Maximum Likelihood from Incomplete Data via the EM Algorithm (context) - Dempster, Laird et al. - 1977
14:   Text categorization with Support Vector Machines: Learning with many relevant fe.. - Joachims - 1998

BibTeX entry:   (Update)

Andrew McCallum, Ronald Rosenfeld, Tom Mitchell, and Andrew Ng. Improving text classification by shrinkage in a hierarchy of classes. In ICML-98, pages 359--367, 1998. http://citeseer.ist.psu.edu/mccallum98improving.html   More

@inproceedings{ mccallum98improving,
    author = "Andrew K. McCallum and Ronald Rosenfeld and Tom M. Mitchell and Andrew Y. Ng",
    title = "Improving text classification by shrinkage in a hierarchy of classes",
    booktitle = "Proceedings of {ICML}-98, 15th International Conference on Machine Learning",
    publisher = "Morgan Kaufmann Publishers, San Francisco, US",
    address = "Madison, US",
    editor = "Jude W. Shavlik",
    pages = "359--367",
    year = "1998",
    url = "citeseer.ist.psu.edu/mccallum98improving.html" }
Citations (may not include all citations):
2528   Maximum likelihood from incomplete data via the EM (context) - Dempster, Laird et al. - 1977
149   Interpolated estimation of markov source parameters from spa.. (context) - Jelinek, Mercer - 1980
135   Hierarchically classifying documents using very few words - Koller, Sahami - 1997
130   A probabilistic analysis of the Rocchio algorithm with TFIDF.. - Joachims - 1997
116   Beyond independence: Conditions for the optimality of the si.. - Domingos, Pazzani - 1997
97   A comparison of two learning algorithms for text categorizat.. - Lewis, Ringuette - 1994
88   Bayes and Empirical Bayes Methods for Data Analysis (context) - Carlin, Louis - 1996
81   Developments in automatic text retrieval (context) - Salton - 1991
80   Learning to classify text from labeled and unlabeled documen.. - Nigam, McCallum et al. - 1998
64   A tree-based statistical language model for natural language.. (context) - Bahl, Brown et al. - 1989
45   Estimation with quadratic loss (context) - James, Stein - 1961
41   Feature selection in statistical learning of text categoriza.. (context) - Yang, Pederson - 1997
30   Inadmissibility of the usual estimator for the mean of a mul.. (context) - Stein - 1955
23   Using story topics for language model adaptation - Seymore, Rosenfeld - 1997
20   An evaluation of statistical approaches to text categorizati.. (context) - Yang - 1997
16   Adaptive mixtures of probabilistic transducers - Singer - 1997
6   Conditions for the equivalence of hierarchical and flat Baye.. (context) - Mitchell - 1998



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.umass.edu/~mccallum/papers/):   More
A Note on the Unification of Information Extraction and Data .. - McCallum, Jensen (2003)   (Correct)
Toward Conditional Models of Identity Uncertainty with.. - McCallum, Wellner (2003)   (Correct)
Employing EM in Pool-Based Active Learning for Text.. - McCallum, Nigam (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC