(Enter summary)
Abstract: This paper explores the use of Support Vector Machines (SVMs) for learning text classifiers from examples. It analyzes the particular properties of learning with text data and identifies, why SVMs are appropriate for this task. Empirical results support the theoretical findings. SVMs achieve substantial improvements over the currently best performing methods and they behave robustly over a variety of different learning tasks. Furthermore, they are fully automatic, eliminating the need for... (Update)
Cited by: More
Journal of Machine Learning Research 3 (2003) 1265-1287.. - Algorithm For Text
(Correct)
Journal of Machine Learning Research 3 (2003) 1307-1331.. - Amir Globerson Gamir
(Correct)
Concept Drift and the Importance of Examples - Klinkenberg, Rüping (2002)
(Correct)
Similar documents (at the sentence level):
73.2%: Text Categorization with Support Vector Machines: Learning with.. - Joachims (1998)
(Correct)
5.7%: The Maximum-Margin Approach to Learning Text Classifiers -.. - Joachims (2000)
(Correct)
5.2%: Transductive Inference for Text Classification using Support.. - Joachims (1999)
(Correct)
Active bibliography (related documents): More All
0.4: Automated Modeling and Nonlinear Axis Scaling - Leejay Wu (2005)
(Correct)
0.4: WebMate: A Personal Agent for Browsing and Searching - Chen, Sycara (1998)
(Correct)
0.2: Combining Machine Learning and Hierarchical Structures for Text.. - Ruiz (2001)
(Correct)
Similar documents based on text: More All
0.5: Estimating the Generalization Performance of an SVM Efficiently - Joachims (1999)
(Correct)
0.5: Achieving Intelligence in Mobility - Incorporating Learning.. - Kaiser, al. (1994)
(Correct)
0.4: Inferring Probabilistic Automata from Sensor Data for Robot.. - Rieger (1995)
(Correct)
Related documents from co-citation: More All
31: The Nature of Statistical Learning Theory (context) - Vapnik - 1995
24: An evaluation of statistical approaches to text categorization
- Yang - 1999
24: A sequential algorithm for training text classifiers: Corrigendum and additional..
- Lewis - 1995
BibTeX entry: (Update)
Joachims, T. (1998). Text categorization with Support Vector Machines: Learning with many relevant features. In Machine Learning: ECML-98, Tenth European Conference on Machine Learning, pp. 137--142. http://citeseer.ist.psu.edu/joachims97text.html More
@inproceedings{ joachims98text,
author = "Thorsten Joachims",
title = "Text categorization with support vector machines: learning with many relevant features",
booktitle = "Proceedings of {ECML}-98, 10th European Conference on Machine Learning",
number = "1398",
publisher = "Springer Verlag, Heidelberg, DE",
address = "Chemnitz, DE",
editor = "Claire N{\'{e}}dellec and C{\'{e}}line Rouveirol",
pages = "137--142",
year = "1998",
url = "citeseer.ist.psu.edu/joachims97text.html" }
Citations (may not include all citations):
2177
Programs for Machine Learning (context) - Quinlan - 1993
1291
The Nature of Statistical Learning Theory (context) - Vapnik - 1995
976
Machine Learning (context) - Mitchell - 1997
524
Support-vector networks
- Cortes, Vapnik - 1995
463
Term weighting approaches in automatic text retrieval (context) - Salton, Buckley - 1988
372
An algorithm for suffix stripping (context) - Porter - 1980
288
Relevance feedback in information retrieval (context) - Rocchio - 1971
255
A training algorithm for optimal margin classifiers
- Boser, Guyon et al. - 1992
225
Newsweeder: Learning to filter netnews
- Lang - 1995
215
A comparative study on fea- ture selection in text categoriz..
- Yang, Pedersen - 1997
189
Webwatcher: A tour guide for the world wide web
- Joachims, Freitag et al. - 1997
149
An evaluation of statistical approaches to text categoriza- ..
- Yang - 1997
130
A probabilistic analysis of the rocchio algorithm with tfidf..
- Joachims - 1997
124
Learning infor- mation retrieval agents: Experiments with au..
- Balabanovic, Shoham - 1995
117
Estimation of Dependencies Based on Empirical Data (context) - Vapnik - 1982
112
An improved training algorithm for support vector machines
- Osuna, Freund et al. - 1997
81
Developments in automatic text retrieval (context) - Salton - 1991
59
A neural net- work approach to topic spotting
- Wiener, Pedersen et al. - 1995
57
A comparison of classifiers and document representations for..
- Schfitze, Hull et al. - 1995
51
Classifying news stories using memory based reasoning (context) - Masand, Linoff - 1992
41
Improving the accu- racy and speed of support vector machine..
- Burges, Sch - 1997
36
Structural risk minimization over data-dependent hierarchies (context) - Shawe-Taylor, Bartlett et al. - 1996
24
Text cate- gorization: A symbolic approach (context) - Moulinier, Raskinis et al. - 1996
23
A critical investigation of recall and precision as measures.. (context) - Raghavan, Bollmann et al. - 1989
16
Construeti system content based indexing database new storie (context) - Weinstein, Weinstein et al. - 1990
13
Using corpus statistics to re- move redundant words in text .. (context) - Yang, Wilbur - 1996
7
A comparison of two learning algorithms for text classificat.. (context) - Lewis, Ringuette - 1994
1
The perceptton algorithm vs (context) - Kivinen, Warmuth et al. - 1995
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://ranger.uta.edu/~alp/ix/readings/): More
Dynamic Topic Identification: Towards Combination of.. - Bigi, Brun, Haton..
(Correct)
A Statistical Information Extraction System for Turkish - Tür (2000)
(Correct)
Semantic Wrappers. Using Web Agents to Extract Knowledge.. - Arjona, Corchuelo, Toro
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC