(Enter summary)
Abstract: benchmark for a given information retrieval (IR) task are beneficial
to research on this task, since they allow di#erent researchers to experimentally
compare their own systems by comparing the results they have
obtained on this benchmark. The Reuters-21578 test collection, together
with its earlier variants, has been such a standard benchmark for the
text categorization (TC) task throughout the last ten years. However,
the benefits that this has brought about have somehow been limited
... (Update)
Similar documents based on text: More All
1.0: An Analysis of the Relative Difficulty of Reuters-21578 Subsets - Debole, Sebastiani
(Correct)
0.5: Supervised Term Weighting for Automated Text Categorization - Debole, Sebastiani (2002)
(Correct)
0.4: Text Categorization - Sebastiani (2005)
(Correct)
BibTeX entry: (Update)
Franca Debole and Fabrizio Sebastiani. An analysis of the relative hardness of Reuters-21578 subsets. In Proceedings of LREC-04, 4th International Conference on Language Resources and Evaluation, pages 971--974, Lisbon, PT, 2004. http://citeseer.ist.psu.edu/debole04analysis.html More
@misc{ debole04analysis,
author = "F. Debole and F. Sebastiani",
title = "An analysis of the relative hardness of Reuters-21578 subsets",
text = "Franca Debole and Fabrizio Sebastiani. An analysis of the relative hardness
of Reuters-21578 subsets. In Proceedings of LREC-04, 4th International Conference
on Language Resources and Evaluation, pages 971--974, Lisbon, PT, 2004.",
year = "2004",
url = "citeseer.ist.psu.edu/debole04analysis.html" }
Citations (may not include all citations):
376
Text categorization with support vector machines: learning w..
- Joachims - 1998
268
Making large-scale SVM learning practical
- Joachims - 1999
166
A re-examination of text categorization methods
- Yang, Liu - 1999
140
A comparison of event models for naive Bayes text classifica..
- McCallum, Nigam - 1998
140
Text classification from labeled and unlabeled documents usi..
- Nigam, McCallum et al. - 2000
139
Machine learning in automated text categorization
- Sebastiani - 2002
110
Training algorithms for linear text classifiers
- Lewis, Schapire et al. - 1996
73
An evaluation of phrasal and clustered representations on a .. (context) - Lewis - 1992
61
Department of Computer Science (context) - Lewis, learning et al. - 1992
58
Distributional clustering of words for text classification
- Baker, McCallum - 1998
52
Improving text retrieval for the routing problem using laten.. (context) - Hull - 1994
35
Support vector machine active learning with applications to ..
- Tong, Koller - 2001
19
A learner-independent evaluation of the usefulness of statis..
- Caropreso, Matwin et al. - 2001
17
A study on thresholding strategies for text categorization
- Yang - 2001
12
A new family of online algorithms for category ranking
- Crammer, Singer - 2002
[Article contains additional citations not shown here]
Documents on the same site (http://faure.iei.pi.cnr.it/~fabrizio/Publications/Publications.html): More
Categorisation by Context - Attardi, Di Marco, Salvi, Sebastiani (1998)
(Correct)
A Note on Logic and Information Retrieval - Sebastiani (1996)
(Correct)
Incremental Knowledge Acquisition for Non-Monotonic Reasoning - Sebastiani, Straccia
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC