(Enter summary)
Abstract: Real-world applications of text categorization often require
a system to deal with tens of thousands of categories defined
over a large taxonomy. This paper addresses the problem
with respect to a set of popular algorithms in text categorization,
including Support Vector Machines, k-nearest
neighbor, ridge regression, linear least square fit and logistic
regression. By providing a formal analysis of the computational
complexity of each classification method, followed
by an investigation on the... (Update)
Cited by: More
Text Categorization - Sebastiani (2005)
(Correct)
Active bibliography (related documents): More All
0.9: A Scalability Analysis of Classifiers in Text Categorization - Yang, Zhang, Kisiel (2003)
(Correct)
0.4: Classification Techniques for Categorization of Hypertext Documents - Arumugam
(Correct)
0.2: Margin-based Local Regression for Adaptive Filtering - Yiming Yang Bryan (2003)
(Correct)
Similar documents based on text: More All
0.4: Robustness of Regularized Linear Classification Methods in Text .. - Zhang, Yang (2003)
(Correct)
0.3: Probabilistic Score Estimation with Piecewise Logistic Regression - Zhang, Yang
(Correct)
0.3: Robustness of Regularized Linear Classification Methods - In Text Categorization
(Correct)
BibTeX entry: (Update)
Yang, Y., A scalability analysis of classifiers in text categorization. Proceedings of SIGIR-03, 26th ACM International Conference on Research and Development in Information Retrieval,ACM Press, New York, US: Toronto, CA, 2003. http://citeseer.ist.psu.edu/article/yang03scalability.html More
@misc{ yang03scalability,
author = "Y. Yang",
title = "A scalability analysis of classifiers in text categorization",
text = "Yang, Y., A scalability analysis of classifiers in text categorization.
Proceedings of SIGIR-03, 26th ACM International Conference on Research and
Development in Information Retrieval,ACM Press, New York, US: Toronto, CA,
2003.",
year = "2003",
url = "citeseer.ist.psu.edu/article/yang03scalability.html" }
Citations (may not include all citations):
2441
Johns Hopkins University Press (context) - Golub, Loan et al. - 1996
947
Statistical Learning Theory (context) - Vapnik - 1998
376
Text Categorization with Support Vector Machines: Learning w..
- Joachims - 1998
375
On power-law relationships of the internet topology
- Faloutsos, Faloutsos et al. - 1999
215
A comparative study on feature selection in text categorizat..
- Yang, Pedersen - 1997
166
A re-examination of text categorization methods
- Yang, Liu - 1999
149
An evaluation of statistical approaches to text categorizati..
- Yang - 1999
127
Lanczos algorithm for large symmetric eigenvalue computation.. (context) - Cullum, Willoughby - 1985
112
An improved training algorithm for support vector machines
- Osuna, Girosi - 1997
110
Training algorithms for linear text classifiers
- Lewis, Schapire et al. - 1996
40
Large-scale singular value computations (context) - Berry - 1992
29
Mechanisms of skill acquisition and the law of practice (context) - Newell, Rosenbloom - 1981
29
A study of approaches to hypertext categorization
- Yang, Slattery et al. - 2002
25
Noise reduction in a statistical approach to text categoriza..
- Yang - 1995
24
Text categorization based on regularized linear classificati..
- Zhang, Oles - 2001
21
cient learning from human decisions in text categorization a.. (context) - Yang, ective - 1994
18
The Maximum-Margin Approach to Learning Text Classifiers: Me..
- Joachims - 2000
10
Microsoft cambridge at trec
- Robertson, Walker - 2001
6
A loss function analysis for classification methods in text ..
- Li, Yang - 2003
6
The reuters corpus volume i as a text categorization test co.. (context) - Lewis, Li et al. - 2003
6
Sequetial minimal optimization: A fast algorithm for trainin.. (context) - Platt - 1998
Documents on the same site (http://www-2.cs.cmu.edu/~yiming/publications.html): More
High-Performing Feature Selection for Text Classification - Rogati, Yang (2002)
(Correct)
Modified Logistic Regression: An Approximation to SVM.. - Zhang, Jin, Yang.. (2003)
(Correct)
A Study of Approaches to Hypertext Categorization - Yang, Slattery, Ghani (2002)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC