See this document in CiteSeerX!

A Scalability Analysis of Classifiers in Text Categorization (2003)  (Make Corrections)  (1 citation)
Yiming Yang Carnegie Mellon University 5000 Forbes Avenue Pittsburgh, PA...



  Home/Search   Context   Related

 
View or download:
cmu.edu/~yiming/papers....sigir03.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/~yiming/publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Real-world applications of text categorization often require a system to deal with tens of thousands of categories defined over a large taxonomy. This paper addresses the problem with respect to a set of popular algorithms in text categorization, including Support Vector Machines, k-nearest neighbor, ridge regression, linear least square fit and logistic regression. By providing a formal analysis of the computational complexity of each classification method, followed by an investigation on the... (Update)

Cited by:   More
Text Categorization - Sebastiani (2005)   (Correct)

Active bibliography (related documents):   More   All
0.9:   A Scalability Analysis of Classifiers in Text Categorization - Yang, Zhang, Kisiel (2003)   (Correct)
0.4:   Classification Techniques for Categorization of Hypertext Documents - Arumugam   (Correct)
0.2:   Margin-based Local Regression for Adaptive Filtering - Yiming Yang Bryan (2003)   (Correct)

Similar documents based on text:   More   All
0.4:   Robustness of Regularized Linear Classification Methods in Text .. - Zhang, Yang (2003)   (Correct)
0.3:   Probabilistic Score Estimation with Piecewise Logistic Regression - Zhang, Yang   (Correct)
0.3:   Robustness of Regularized Linear Classification Methods - In Text Categorization   (Correct)

BibTeX entry:   (Update)

Yang, Y., A scalability analysis of classifiers in text categorization. Proceedings of SIGIR-03, 26th ACM International Conference on Research and Development in Information Retrieval,ACM Press, New York, US: Toronto, CA, 2003. http://citeseer.ist.psu.edu/article/yang03scalability.html   More

@misc{ yang03scalability,
  author = "Y. Yang",
  title = "A scalability analysis of classifiers in text categorization",
  text = "Yang, Y., A scalability analysis of classifiers in text categorization.
    Proceedings of SIGIR-03, 26th ACM International Conference on Research and
    Development in Information Retrieval,ACM Press, New York, US: Toronto, CA,
    2003.",
  year = "2003",
  url = "citeseer.ist.psu.edu/article/yang03scalability.html" }
Citations (may not include all citations):
2441   Johns Hopkins University Press (context) - Golub, Loan et al. - 1996
947   Statistical Learning Theory (context) - Vapnik - 1998
376   Text Categorization with Support Vector Machines: Learning w.. - Joachims - 1998
375   On power-law relationships of the internet topology - Faloutsos, Faloutsos et al. - 1999
215   A comparative study on feature selection in text categorizat.. - Yang, Pedersen - 1997
166   A re-examination of text categorization methods - Yang, Liu - 1999
149   An evaluation of statistical approaches to text categorizati.. - Yang - 1999
127   Lanczos algorithm for large symmetric eigenvalue computation.. (context) - Cullum, Willoughby - 1985
112   An improved training algorithm for support vector machines - Osuna, Girosi - 1997
110   Training algorithms for linear text classifiers - Lewis, Schapire et al. - 1996
40   Large-scale singular value computations (context) - Berry - 1992
29   Mechanisms of skill acquisition and the law of practice (context) - Newell, Rosenbloom - 1981
29   A study of approaches to hypertext categorization - Yang, Slattery et al. - 2002
25   Noise reduction in a statistical approach to text categoriza.. - Yang - 1995
24   Text categorization based on regularized linear classificati.. - Zhang, Oles - 2001
21   cient learning from human decisions in text categorization a.. (context) - Yang, ective - 1994
18   The Maximum-Margin Approach to Learning Text Classifiers: Me.. - Joachims - 2000
10   Microsoft cambridge at trec - Robertson, Walker - 2001
6   A loss function analysis for classification methods in text .. - Li, Yang - 2003
6   The reuters corpus volume i as a text categorization test co.. (context) - Lewis, Li et al. - 2003
6   Sequetial minimal optimization: A fast algorithm for trainin.. (context) - Platt - 1998

Documents on the same site (http://www-2.cs.cmu.edu/~yiming/publications.html):   More
High-Performing Feature Selection for Text Classification - Rogati, Yang (2002)   (Correct)
Modified Logistic Regression: An Approximation to SVM.. - Zhang, Jin, Yang.. (2003)   (Correct)
A Study of Approaches to Hypertext Categorization - Yang, Slattery, Ghani (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC