See this document in CiteSeerX!

Text Categorization with Support Vector Machines: Learning with Many Relevant Features (1997)  (Make Corrections)  (376 citations)
Thorsten Joachims
Proceedings of ECML-98, 10th European Conference on Machine Learning



  Home/Search   Context   Related

 
View or download:
uta.edu/~alp/ix/re...Categorization.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uta.edu/~alp/ix/readings/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper explores the use of Support Vector Machines (SVMs) for learning text classifiers from examples. It analyzes the particular properties of learning with text data and identifies, why SVMs are appropriate for this task. Empirical results support the theoretical findings. SVMs achieve substantial improvements over the currently best performing methods and they behave robustly over a variety of different learning tasks. Furthermore, they are fully automatic, eliminating the need for... (Update)

Cited by:   More
Journal of Machine Learning Research 3 (2003) 1265-1287.. - Algorithm For Text   (Correct)
Journal of Machine Learning Research 3 (2003) 1307-1331.. - Amir Globerson Gamir   (Correct)
Concept Drift and the Importance of Examples - Klinkenberg, Rüping (2002)   (Correct)

Similar documents (at the sentence level):
73.2%:   Text Categorization with Support Vector Machines: Learning with.. - Joachims (1998)   (Correct)
5.7%:   The Maximum-Margin Approach to Learning Text Classifiers -.. - Joachims (2000)   (Correct)
5.2%:   Transductive Inference for Text Classification using Support.. - Joachims (1999)   (Correct)

Active bibliography (related documents):   More   All
0.4:   Automated Modeling and Nonlinear Axis Scaling - Leejay Wu (2005)   (Correct)
0.4:   WebMate: A Personal Agent for Browsing and Searching - Chen, Sycara (1998)   (Correct)
0.2:   Combining Machine Learning and Hierarchical Structures for Text.. - Ruiz (2001)   (Correct)

Similar documents based on text:   More   All
0.5:   Estimating the Generalization Performance of an SVM Efficiently - Joachims (1999)   (Correct)
0.5:   Achieving Intelligence in Mobility - Incorporating Learning.. - Kaiser, al. (1994)   (Correct)
0.4:   Inferring Probabilistic Automata from Sensor Data for Robot.. - Rieger (1995)   (Correct)

Related documents from co-citation:   More   All
31:   The Nature of Statistical Learning Theory (context) - Vapnik - 1995
24:   An evaluation of statistical approaches to text categorization - Yang - 1999
24:   A sequential algorithm for training text classifiers: Corrigendum and additional.. - Lewis - 1995

BibTeX entry:   (Update)

Joachims, T. (1998). Text categorization with Support Vector Machines: Learning with many relevant features. In Machine Learning: ECML-98, Tenth European Conference on Machine Learning, pp. 137--142. http://citeseer.ist.psu.edu/joachims97text.html   More

@inproceedings{ joachims98text,
    author = "Thorsten Joachims",
    title = "Text categorization with support vector machines: learning with many relevant features",
    booktitle = "Proceedings of {ECML}-98, 10th European Conference on Machine Learning",
    number = "1398",
    publisher = "Springer Verlag, Heidelberg, DE",
    address = "Chemnitz, DE",
    editor = "Claire N{\'{e}}dellec and C{\'{e}}line Rouveirol",
    pages = "137--142",
    year = "1998",
    url = "citeseer.ist.psu.edu/joachims97text.html" }
Citations (may not include all citations):
2177   Programs for Machine Learning (context) - Quinlan - 1993
1291   The Nature of Statistical Learning Theory (context) - Vapnik - 1995
976   Machine Learning (context) - Mitchell - 1997
524   Support-vector networks - Cortes, Vapnik - 1995
463   Term weighting approaches in automatic text retrieval (context) - Salton, Buckley - 1988
372   An algorithm for suffix stripping (context) - Porter - 1980
288   Relevance feedback in information retrieval (context) - Rocchio - 1971
255   A training algorithm for optimal margin classifiers - Boser, Guyon et al. - 1992
225   Newsweeder: Learning to filter netnews - Lang - 1995
215   A comparative study on fea- ture selection in text categoriz.. - Yang, Pedersen - 1997
189   Webwatcher: A tour guide for the world wide web - Joachims, Freitag et al. - 1997
149   An evaluation of statistical approaches to text categoriza- .. - Yang - 1997
130   A probabilistic analysis of the rocchio algorithm with tfidf.. - Joachims - 1997
124   Learning infor- mation retrieval agents: Experiments with au.. - Balabanovic, Shoham - 1995
117   Estimation of Dependencies Based on Empirical Data (context) - Vapnik - 1982
112   An improved training algorithm for support vector machines - Osuna, Freund et al. - 1997
81   Developments in automatic text retrieval (context) - Salton - 1991
59   A neural net- work approach to topic spotting - Wiener, Pedersen et al. - 1995
57   A comparison of classifiers and document representations for.. - Schfitze, Hull et al. - 1995
51   Classifying news stories using memory based reasoning (context) - Masand, Linoff - 1992
41   Improving the accu- racy and speed of support vector machine.. - Burges, Sch - 1997
36   Structural risk minimization over data-dependent hierarchies (context) - Shawe-Taylor, Bartlett et al. - 1996
24   Text cate- gorization: A symbolic approach (context) - Moulinier, Raskinis et al. - 1996
23   A critical investigation of recall and precision as measures.. (context) - Raghavan, Bollmann et al. - 1989
16   Construeti system content based indexing database new storie (context) - Weinstein, Weinstein et al. - 1990
13   Using corpus statistics to re- move redundant words in text .. (context) - Yang, Wilbur - 1996
7   A comparison of two learning algorithms for text classificat.. (context) - Lewis, Ringuette - 1994
1   The perceptton algorithm vs (context) - Kivinen, Warmuth et al. - 1995



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://ranger.uta.edu/~alp/ix/readings/):   More
Dynamic Topic Identification: Towards Combination of.. - Bigi, Brun, Haton..   (Correct)
A Statistical Information Extraction System for Turkish - Tür (2000)   (Correct)
Semantic Wrappers. Using Web Agents to Extract Knowledge.. - Arjona, Corchuelo, Toro   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC