See this document in CiteSeerX!

Hierarchical Classification of Web Content (2000)  (Make Corrections)  (36 citations)
Susan Dumais, Hao Chen
Proceedings of SIGIR-00, 23rd ACM International Conference on Research and Development in Information Retrieval



  Home/Search   Context   Related

 
View or download:
berkeley.edu/~hchen/publi...sigir00.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  berkeley.edu/~hchen...publication (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper explores the use of hierarchical structure for classifying a large, heterogeneous collection of web content. The hierarchical structure is initially used to train different second-level classifiers. In the hierarchical case, a model is learned to distinguish a second-level category from other categories within the same top level. In the flat non-hierarchical case, a model distinguishes a second-level category from all other second-level categories. Scoring rules can further take... (Update)

Cited by:   More
A Personalized Collaborative Digital Library Environment: a.. - Renda, Straccia (2002)   (Correct)
BINGO! and DAFFODIL: Personalized Exploration of Digital.. - Theobald, Klas   (Correct)
The Organisation and Retrieval of Document Collections: A.. - Vinokourov (2003)   (Correct)

Active bibliography (related documents):   More   All
0.4:   Predicting Library of Congress Classifications from Library.. - Frank, Paynter (2004)   (Correct)
0.4:   The Effect of Using Hierarchical Classifiers in Text.. - D'Alessio, Murray.. (2000)   (Correct)
0.3:   Combining Machine Learning and Hierarchical Structures for Text.. - Ruiz (2001)   (Correct)

Similar documents based on text:   More   All
0.2:   Cognitive Science 27 (2003) 491--524 - Data-Driven Approaches To   (Correct)
0.2:   Model Checking One Million Lines of C Code - Hao Chen Drew (2004)   (Correct)
0.1:   Evaluation of Decision Forests on Text Categorization - Chen, Ho (2000)   (Correct)

Related documents from co-citation:   More   All
13:   Text categorization with Support Vector Machines: Learning with many relevant fe.. - Joachims - 1998
12:   Inductive learning algorithms and representations for text categorization (context) - Dumais, Platt et al. - 1998
11:   Machine learning in automated text categorization - Sebastiani - 1999

BibTeX entry:   (Update)

S. T. Dumais and H. Chen. Hierarchical classification of web content. In Proc. of the 23rd Int'l ACM Conf. on Research and Development in Information Retrieval (SIGIR), pages 256--263, Athens, Greece, August 2000. http://citeseer.ist.psu.edu/dumais00hierarchical.html   More

@inproceedings{ dumais00hierarchical,
    author = "Susan T. Dumais and Hao Chen",
    title = "Hierarchical classification of {W}eb content",
    booktitle = "Proceedings of {SIGIR}-00, 23rd {ACM} International Conference on Research and Development in Information Retrieval",
    publisher = "ACM Press, New York, US",
    address = "Athens, GR",
    editor = "Nicholas J. Belkin and Peter Ingwersen and Mun-Kew Leong",
    pages = "256--263",
    year = "2000",
    url = "citeseer.ist.psu.edu/dumais00hierarchical.html" }
Citations (may not include all citations):
2319   Elements of Information Theory (context) - Cover, Thomas - 1991
1291   The Nature of Statistical Learning Theory (context) - Vapnik - 1995
376   Text categorization with support vector machines: Learning w.. - Joachims - 1998
215   A comparative study on feature selection in text categorizat.. - Yang, Pedersen - 1997
191   Fast training of support vector machines using sequential mi.. (context) - Platt - 1999
166   A re-examination of text categorization methods - Yang, Lui - 1999
135   Hierarchically classifying documents using very few words - Koller, Sahami - 1997
120   Inductive learning algorithms and representations for text c.. (context) - Dumais, Platt et al. - 1998
110   Context-sensitive learning methods for text categorization P.. - Cohen, Singer - 1996
97   A comparison of two learning algorithms for text categorizat.. - Lewis, Ringuette - 1994
79   Web document clustering: A feasibility demonstration - Zamir, Etzioni - 1998
63   Automated learning of decision rules for text categorization - Apte, Damerau et al. - 1994
61   Improving text classification by shrinkage in a hierarchy of.. - McCallum, Rosenfeld et al. - 1998
59   Reexamining the cluster hypothesis: Scatter/Gather on retrie.. - Hearst, Pedersen - 1996
57   A comparison of classifiers and document representations for.. - Schtze, Hull et al. - 1995
55   Expert network: Effective and efficient learning from human .. (context) - Yang - 1994
28   Bringing order to the web: Automatically categorizing search.. (context) - Chen, Dumais - 2000
25   classification and signature generation for organizing large.. (context) - Chakrabarti, Dom et al. - 1998
18   Feature selection for classification based on text hierarchy - Mladenic, Grobelnik - 1998
17   Enhancing the usability of text through computer delivery an.. (context) - Landauer, Egan et al. - 1993
9   Exploiting hierarchy in text categorization (context) - Weigend, Wiener et al. - 1999
7   Hierarchical neural networks for text categorization - Ruiz, Srinivasan - 1999
3   Category levels in hierarchical text categorization (context) - D'Alessio, Murray et al. - 1998
3   Some issues in the automatic classification of U (context) - Larkey - 1998
3   CONSTRUE: A System for Content-Based Indexing of a Database .. (context) - Hayes, Weinstein - 1990
3   Searching and browsing text collections with large category .. (context) - Hearst, Karadi - 1997
1   and Low (context) - Ng, Goh - 1997
1   A rule-based multi-stage indexing system for lage subject fi.. (context) - Fuhr, Hartmanna et al. - 1991



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.berkeley.edu/~hchen/publication/publication.html):   More
Emu: An E-Mail Preprocessor For Text-To-Speech - Sproat Hu Bell   (Correct)
Piecewise Linear Modulation Model of Handwriting - Chen, Agazzi, Suen (1997)   (Correct)
Evaluation of Decision Forests on Text Categorization - Chen, Ho (2000)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC