See this document in CiteSeerX!

A Scalability Analysis of Classifiers in Text Categorization (2003)  (Make Corrections)  (1 citation)
Yiming Yang Carnegie Mellon University 5000 Forbes Avenue Pittsburgh, PA...



  Home/Search   Context   Related

 
View or download:
cmu.edu/~jianzhan/./pa...sigir03yang.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/~jianzhan/publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Real-world applications of text categorization often require a system to deal with tens of thousands of categories de- ned over a large taxonomy. This paper addresses the problem with respect to a set of popular algorithms in text categorization, including Support Vector Machines, k-nearest neighbor, ridge regression, linear least square t and logistic regression. By providing a formal analysis of the computational complexity of each classi cation method, followed by an investigation on the... (Update)

Cited by:   More
Text Categorization - Sebastiani (2005)   (Correct)

Active bibliography (related documents):   More   All
1.3:   A Scalability Analysis of Classifiers in Text Categorization - Yang, Zhang, Kisiel (2003)   (Correct)
0.2:   Margin-based Local Regression for Adaptive Filtering - Yiming Yang Bryan (2003)   (Correct)
0.2:   Application of K-NN and FPTC Based Text Categorization Algorithms.. - Ilhan (2001)   (Correct)

Similar documents based on text:   More   All
0.3:   Robustness of Regularized Linear Classification Methods in Text .. - Zhang, Yang (2003)   (Correct)
0.3:   Robustness of Regularized Linear Classification Methods - In Text Categorization   (Correct)
0.2:   A Comparative Study on Feature Selection in Text Categorization - Yang, Pedersen (1997)   (Correct)

BibTeX entry:   (Update)

Yang, Y., A scalability analysis of classifiers in text categorization. Proceedings of SIGIR-03, 26th ACM International Conference on Research and Development in Information Retrieval,ACM Press, New York, US: Toronto, CA, 2003. http://citeseer.ist.psu.edu/article/yang03scalability.html   More

@misc{ yang03scalability,
  author = "Y. Yang",
  title = "A scalability analysis of classifiers in text categorization",
  text = "Yang, Y., A scalability analysis of classifiers in text categorization.
    Proceedings of SIGIR-03, 26th ACM International Conference on Research and
    Development in Information Retrieval,ACM Press, New York, US: Toronto, CA,
    2003.",
  year = "2003",
  url = "citeseer.ist.psu.edu/article/yang03scalability.html" }
Citations (may not include all citations):
2441   Johns Hopkins University Press (context) - Golub, Loan et al. - 1996
947   Statistical Learning Theory (context) - Vapnik - 1998
376   Text Categorization with Support Vector Machines: Learning w.. - Joachims - 1998
375   On power-law relationships of the internet topology - Faloutsos, Faloutsos et al. - 1999
215   A comparative study on feature selection in text categorizat.. - Yang, Pedersen - 1997
166   A re-examination of text categorization methods - Yang, Liu - 1999
149   An evaluation of statistical approaches to text categorizati.. - Yang - 1999
127   Lanczos algorithm for large symmetric eigenvalue computation.. (context) - Cullum, Willoughby - 1985
112   An improved training algorithm for support vector machines - Osuna, Girosi - 1997
40   Large-scale singular value computations (context) - Berry - 1992
29   Mechanisms of skill acquisition and the law of practice (context) - Newell, Rosenbloom - 1981
29   A study of approaches to hypertext categorization - Yang, Slattery et al. - 2002
25   Noise reduction in a statistical approach to text categoriza.. - Yang - 1995
24   Text categorization based on regularized linear classi catin.. - Zhang, Oles - 2001
21   Training algorithms for linear text classi ers - Lewis, Schapire et al. - 1996
18   The Maximum-Margin Approach to Learning Text Classi ers: Met.. - Joachims - 2000
10   Microsoft cambridge at trec - Robertson, Walker - 2001
9   ective and ecient learning from human decisions in text cate.. (context) - Yang - 1994
6   The reuters corpus volume i as a text categorization test co.. (context) - Lewis, Li et al. - 2003
6   Sequetial minimal optimization: A fast algorithm for trainin.. (context) - Platt - 1998
2   A loss function analysis for classi cation methods in text c.. (context) - Li, Yang - 2003

Documents on the same site (http://www.cs.cmu.edu/~jianzhan/publications.html):   More
A Smoothed Boosting Algorithm Using Probabilistic Output Codes - Jin, Zhang (2005)   (Correct)
Learning Multiple Related Tasks Using Latent Independent.. - Zhang, Ghahramani, Yang (2005)   (Correct)
Topic-conditioned Novelty Detection - Yiming Yang Jian (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC