See this document in CiteSeerX!

An Analysis of the Relative Hardness of Reuters-21578 Subsets (2004)  (Make Corrections)  (3 citations)
Franca Debole, Fabrizio Sebastiani



  Home/Search   Context   Related

 
View or download:
faure.iei.pi.cnr.it/~fab...JASIST04.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  faure.iei.pi.cnr.i...Publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: benchmark for a given information retrieval (IR) task are beneficial to research on this task, since they allow di#erent researchers to experimentally compare their own systems by comparing the results they have obtained on this benchmark. The Reuters-21578 test collection, together with its earlier variants, has been such a standard benchmark for the text categorization (TC) task throughout the last ten years. However, the benefits that this has brought about have somehow been limited ... (Update)

Similar documents based on text:   More   All
1.0:   An Analysis of the Relative Difficulty of Reuters-21578 Subsets - Debole, Sebastiani   (Correct)
0.5:   Supervised Term Weighting for Automated Text Categorization - Debole, Sebastiani (2002)   (Correct)
0.4:   Text Categorization - Sebastiani (2005)   (Correct)

BibTeX entry:   (Update)

Franca Debole and Fabrizio Sebastiani. An analysis of the relative hardness of Reuters-21578 subsets. In Proceedings of LREC-04, 4th International Conference on Language Resources and Evaluation, pages 971--974, Lisbon, PT, 2004. http://citeseer.ist.psu.edu/debole04analysis.html   More

@misc{ debole04analysis,
  author = "F. Debole and F. Sebastiani",
  title = "An analysis of the relative hardness of Reuters-21578 subsets",
  text = "Franca Debole and Fabrizio Sebastiani. An analysis of the relative hardness
    of Reuters-21578 subsets. In Proceedings of LREC-04, 4th International Conference
    on Language Resources and Evaluation, pages 971--974, Lisbon, PT, 2004.",
  year = "2004",
  url = "citeseer.ist.psu.edu/debole04analysis.html" }
Citations (may not include all citations):
376   Text categorization with support vector machines: learning w.. - Joachims - 1998
268   Making large-scale SVM learning practical - Joachims - 1999
166   A re-examination of text categorization methods - Yang, Liu - 1999
140   A comparison of event models for naive Bayes text classifica.. - McCallum, Nigam - 1998
140   Text classification from labeled and unlabeled documents usi.. - Nigam, McCallum et al. - 2000
139   Machine learning in automated text categorization - Sebastiani - 2002
110   Training algorithms for linear text classifiers - Lewis, Schapire et al. - 1996
73   An evaluation of phrasal and clustered representations on a .. (context) - Lewis - 1992
61   Department of Computer Science (context) - Lewis, learning et al. - 1992
58   Distributional clustering of words for text classification - Baker, McCallum - 1998
52   Improving text retrieval for the routing problem using laten.. (context) - Hull - 1994
35   Support vector machine active learning with applications to .. - Tong, Koller - 2001
19   A learner-independent evaluation of the usefulness of statis.. - Caropreso, Matwin et al. - 2001
17   A study on thresholding strategies for text categorization - Yang - 2001
12   A new family of online algorithms for category ranking - Crammer, Singer - 2002

[Article contains additional citations not shown here]

Documents on the same site (http://faure.iei.pi.cnr.it/~fabrizio/Publications/Publications.html):   More
Categorisation by Context - Attardi, Di Marco, Salvi, Sebastiani (1998)   (Correct)
A Note on Logic and Information Retrieval - Sebastiani (1996)   (Correct)
Incremental Knowledge Acquisition for Non-Monotonic Reasoning - Sebastiani, Straccia   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC