See this document in CiteSeerX!

The Maximum-Margin Approach to Learning Text Classifiers Methods, Theory, and Algorithms (2000)  (Make Corrections)  (18 citations)
Thorsten Joachims



  Home/Search   Context   Related

 
View or download:
ai.informatik.uni...achims_2000b.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  ai.informatik.unidor...DOKUMENTE (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: this dissertation would have turned out di erently. Or maybe, it would never have been started nor nished (Update)

Cited by:   More
Novel Learning Tasks from Practical Applications - Klinkenberg, Ritthoff, Morik (2002)   (Correct)
Concept Drift and the Importance of Examples - Klinkenberg, Rüping (2002)   (Correct)
A Scalability Analysis of Classifiers in Text Categorization - Yang, Zhang, Kisiel (2003)   (Correct)

Similar documents (at the sentence level):
5.2%:   Estimating the Generalization Performance of an SVM Efficiently - Joachims (2000)   (Correct)

Active bibliography (related documents):   More   All
2.2:   A Statistical Learning Model of Text Classification for Support.. - Joachims (2001)   (Correct)
1.5:   WebMate: A Personal Agent for Browsing and Searching - Chen, Sycara (1998)   (Correct)
1.1:   Combining Machine Learning and Hierarchical Structures for Text.. - Ruiz (2001)   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

Related documents from co-citation:   More   All
12:   Statistical Learning Theory (context) - Vapnik
9:   Text categorization with Support Vector Machines: Learning with many relevant fe.. - Joachims - 1998
5:   Text Categorization Based on Regularized Linear Classification Methods - Zhang, Oles - 2000

BibTeX entry:   (Update)

T. Joachims. The Maximum-Margin Approach to Learning Text Classiers: Methods, Theory, and Algorithms. PhD thesis, Universitat Dortmund, 2001. Kluwer, to appear. http://citeseer.ist.psu.edu/joachims00maximummargin.html   More

@misc{ joachims01maximummargin,
  author = "T. Joachims",
  title = "The Maximum-Margin Approach to Learning Text Classiers: Methods",
  text = "T. Joachims. The Maximum-Margin Approach to Learning Text Classiers: Methods,
    Theory, and Algorithms. PhD thesis, Universitat Dortmund, 2001. Kluwer,
    to appear.",
  year = "2001",
  url = "citeseer.ist.psu.edu/joachims00maximummargin.html" }
Citations (may not include all citations):
2319   Elements of Information Theory (context) - Cover, Thomas - 1991
2177   Programs for Machine Learning (context) - Quinlan - 1993
1359   Induction of decision trees (context) - Quinlan - 1986
1291   The Nature of Statistical Learning Theory (context) - Vapnik - 1995
976   Machine Learning (context) - Mitchell - 1997
947   Statistical Learning Theory (context) - Vapnik - 1998
686   Practical Methods of Optimization (context) - Fletcher - 1987
663   Practical Optimization (context) - Gill, Murray et al. - 1981
568   Indexing by latent semantic analysis - Deerwester, Dumais et al. - 1990
546   An Introduction to the Bootstrap (context) - Efron, Tibshirani - 1993
500   Experiments with a new boosting algorithm - Freund, Schapire - 1996
480   An overview of the kl-one knowledge representation system (context) - Brachman, Schmolze - 1985
463   Term weighting approaches in automatic text retrieval (context) - Salton, Buckley - 1988
431   A tutorial on support vector machines for pattern recognitio.. - Burges - 1998
416   Information Retrieval - van Rijsbergen - 1979
376   Text categorization with support vector machines: Learning w.. - Joachims - 1998
372   Modern Information Retrieval - Baeza-Yates, Ribeiro-Neto - 1999
296   A Probabilistic Theory of Pattern Recognition (context) - Devroye, Gy et al. - 1996
288   Relevance feedback in information retrieval (context) - Rocchio - 1971
268   Making Large-Scale SVM Learning Practical - Joachims - 1999
268   Making large-scale svm learning practical - Joachims - 1998
244   Letizia: An agent that assists Web browsing - Lieberman - 1995
215   A comparative study on feature selection in text categorizat.. - Yang, Pedersen - 1997
207   WebWatcher: A learning apprentice for the World Wide Web - Armstrong, Freitag et al. - 1998
202   Introduction to wordnet: An on-line lexical database (context) - Miller, Fellbaum et al. - 1990
200   Training support vector machines: An application to face det.. - Osuna, Freund et al. - 1997
191   Fast training of support vector machines using sequential mi.. (context) - Platt - 1999
191   The SMART Retrieval System: Experiments in Automatic Documen.. (context) - Salton - 1971
189   WebWatcher: a tour guide for the world wide web - Joachims, Freitag et al. - 1997
180   Combining labeled and unlabeled data with co-training - Blum, Mitchell - 1998
166   A re-examination of text categorization methods - Yang, Liu - 1999
164   A study of cross-validation and bootstrap for accuracy estim.. - Kohavi - 1995
164   webert: Identifying interesting web sites (context) - Pazzani, Muramatsu et al. - 1996
157   Probability inequalities for sums of bounded random variable.. (context) - ding - 1963
149   An evaluation of statistical approaches to text categorizati.. - Yang - 1997
149   An evaluation of statistical approaches to text categorizati.. - Yang - 1999
143   An Introduction to Support Vector Machines and Other Kernel-.. (context) - Cristianini, Shawe-Taylor - 2000
134   Philosophical Investigations (context) - Wittgenstein - 1967
130   A probabilistic analysis of the Rocchio algorithm with TFIDF.. - Joachims - 1997
124   Learning information retrieval agents: Experiments with auto.. - Balabanovic, Shoham - 1995
121   Classi cation and regression trees (context) - Breiman, Friedman et al. - 1984
120   Greedy attribute selection - Caruana, Freitag - 1994
120   Inductive learning algorithms and representations for text c.. (context) - Dumais, Platt et al. - 1998
117   Estimation of Dependencies Based on Empirical Data (context) - Vapnik - 1982
110   Context-sensitive learning methods for text categorization - Cohen - 1996
106   Foil: A midterm report - Quinlan, Cameron-Jones - 1993
103   at forty: The independence assumption in information retriev.. (context) - Lewis - 1998
101   An algorithm for sux stripping (context) - Porter - 1980
100   Learning with Kernels (context) - Smola - 1998
100   Personalized information delivery: An analysis of informatio.. (context) - Foltz, Dumais - 1992
96   Introduction to the Theory of Statistics (context) - Mood, Graybill et al. - 1974
94   Loqo: An interior point code for quadratic programming - Vanderbei - 1994
89   Machine Learning (context) - Michie, Spiegelhalter et al. - 1994
81   Developments in automatic text retrieval (context) - Salton - 1991
80   Support vector machines: Training and applications - Osuna, Freund et al. - 1996
80   Learning to classify text from labeled and unlabeled documen.. - Nigam, McCallum et al. - 1998
78   Boosting the margin: a new explanation for the e ectiveness .. (context) - Schapire, Freund et al. - 1997
77   Probabilistic outputs for support vector machines and compar.. - Platt - 1999
77   Boosting in the limit: Maximizing the margin of learned ense.. - Grove, Schuurmans - 1998
76   BoosTexter: a boosting-based system for text categorization - Schapire, Singer - 2000
75   The probability ranking principle in ir (context) - Robertson - 1977
73   An evaluation of phrasal and clustered representations on a .. (context) - Lewis - 1992
66   An experimental and theoretical comparison of model selectio.. - Kearns, Mansour et al. - 1997
59   A neural network approach to topic spotting - Wiener, Pedersen et al. - 1995
55   Multi-class support vector machines - Weston, Watkins - 1998
52   Latent semantic indexing (context) - Dumais - 1994
51   Experiments in Automatic Phrase Indexing for Document Retrie.. (context) - Fagan - 1987
51   Classifying news stories using memory based reasoning (context) - Masand, Lino et al. - 1992
51   Convolution kernels on discrete structures - Haussler - 1999
48   Newsweeder: Learning to lter netnews (context) - Lang - 1995
48   Semi-supervised support vector machines - Bennett, Demiriz - 1998
47   Sparse greedy matrix approximation for machine learning - Smola, Sch - 2000
45   The Jackknife and Bootstrap (context) - Shao, Tu - 1995
44   Multitask learning - Caruana, Pratt et al. - 1997
43   Algorithmic stability and sanitycheck bounds for leave-one-o.. - Kearns, Ron - 1997
42   Learning to classify english text with ilp methods - Cohen - 1995
42   Dynamic alignment kernels - Watkins - 2000
41   Automated learning of decision rules for text categorization (context) - Damerau - 1994
41   Estimating the error rate of a prediction rule: Improvements.. (context) - Efron - 1983
39   Probabilistic kernel regression models - Jaakkola, Haussler - 1999
38   Information Retrieval: Computational and Theoretical Aspects (context) - Heaps - 1978
36   Structural risk minimization over data-dependent hierarchies (context) - Shawe-Taylor, Bartlett et al. - 1996
36   A machine learning architecture for optimizing web search en.. - Boyan, Freitag et al. - 1996
35   and Other Resampling Plans (context) - Efron - 1982
34   A probabilistic learning approach for document indexing - Fuhr, Buckley - 1991
34   Estimation of error rates in discriminant analysis (context) - Lachenbruch, Mickey - 1968
32   Sequential minimal optimization: A fast algorithm for traini.. - Platt - 1998
32   A comparison of event models for naive bayes text classi cat.. (context) - McCallum, Nigam - 1998
29   Maximizing text-mining performance (context) - Weiss, Apt et al. - 1999
29   A theoretical basis for the use of cooccurrence data in info.. (context) - van Rijsbergen - 1977
27   the optimality of the simple bayesian classi er under zero-o.. - Domingos, Pazzani - 1997
26   Support vector machines for spam categorization (context) - Drucker, Wu et al. - 1999
26   Term clustering of syntactic phrases - Croft, Lewis - 1990
25   Solving the quadratic programming problem arising in support.. (context) - Kaufman - 1999
25   Large text searching allowing errors (context) - ujo, Navarro et al. - 1997
25   Noise reduction in a statistical approach to text categoriza.. - Yang - 1995
25   Successive overrelaxation for support vector machines - Mangasarian, Musicant - 1999
24   Distribution-free performance bounds for potential function .. (context) - Devroye, Wagner - 1979
24   Combining support vector and mathematical programming method.. - Bennett - 1999
24   Estimating the generalization performance of a SVM eciently - Joachims - 2000
24   Text categorization: A symbolic approach (context) - Moulinier, Raskinis et al. - 1996
23   A critical investigation of recall and precision as measures.. (context) - Raghavan, Bollmann et al. - 1989
23   Automatic indexing based on bayesian inference networks - Tzeras, Hartmann - 1993
23   Automatic indexing: An experimental inquiry (context) - Maron - 1961
23   Asymptotics for and against cross-validation (context) - Stone - 1977
22   Detecting concept drift with support vector machines - Klinkenberg, Joachims - 2000
22   Using sparseness and analytic qp to speed training of suppor.. (context) - Platt - 1999
21   Learning by transduction - Gammerman, Vapnik et al. - 1998
20   New support vector algorithms - olkopf, Smola et al. - 2000
20   Optimized rule induction (context) - Weiss, Indurkhya - 1993
19   Learning from a mixture of labeled and unlabeled examples wi.. - Ratsaby, Venkatesh - 1995
19   Distributional clustering of words for text classi cation (context) - Baker, McCallum - 1998
18   A fast iterative nearest point algorithm for support vector .. - Keerthi, Shevade et al. - 1999
17   Autoclass: A bayesian classi cation system (context) - Cheeseman, Kelly et al. - 1988
17   Some inconsistencies and misnomers in probabilistic informat.. (context) - Cooper - 1991
17   A bound on the error of cross validation using the approxima.. - Kearns - 1996
17   Support vector machines (context) - Wahba - 1999
16   Using Machine Learning to Improve Information Access - Sahami - 1998
16   ect of adding relevance information in a relevance feedback .. (context) - Buckley, Salton et al. - 1994
16   Construeti system content based indexing database new storie (context) - Weinstein, Weinstein et al. - 1990
16   The KernelAdatron algorithm: a fast and simple learning proc.. - Cristianini, Campbell - 1998
16   Cross-validatory choice and assesment of statistical predict.. (context) - Stone - 1974
16   A case study in using linguistic phrases for text categoriza.. (context) - urnkranz, Mitchell et al. - 1998
15   The analysis of decomposition methods for support vector mac.. - Chang, Hsu et al. - 1999
15   Estimating the accuracy of learned concepts - Bailey, Elkan - 1993
14   Support Vector Learning - olkopf - 1997
14   the convergence of the decomposition method for support vect.. - Lin - 2000
14   A cluster-based approach to thesaurus construction (context) - Crouch - 1988
13   Combining statistical learning with a knowledge-based approa.. - Morik, Brockhausen et al. - 1999
13   Optimum polynomial retrieval functions based on the probabil.. (context) - Fuhr - 1989
12   Uniqueness of the SVM solution - Burges, Crisp - 1999
11   Transductive inference for text classi cation using support .. (context) - Joachims - 1999
11   Machinelearning applications of algorithmic randomness - Vovk, Gammerman et al. - 1999
11   An improved decomposition algorithm for regression support v.. - Laskov - 2000
11   Applying an existing machine learning algorithm to text cate.. - Moulinier, Ganascia - 1996
11   A note on a class of skew distribution functions: Analysis a.. (context) - Mandelbrot - 1959
10   Airx rule based multistage indexing system large subject e.. - Hartmann, Tzeras et al. - 1991
9   A comparison of classi ers and document representations for .. (context) - Schutze, Hull et al. - 1995
9   Feature selection in svm text categorization (context) - Taira, Haruno - 1999
9   Department of Computer and Information Science (context) - Lewis - 1992
9   Department of Computer and Information Science (context) - Lewis - 1992
9   Boosting and Rocchio applied to text ltering (context) - Schapire, Singer et al. - 1998
9   Information extraction as a basis for high-precision text cl.. - Lehnert - 1994
9   A simple decomposition method for support vector machines - Hsu, Lin - 2000
9   One term or two (context) - Church - 1995
8   Distribution-free inequalities for the deleted and holdout e.. (context) - Devroye, Wagner - 1979
8   A case study of latent semantic indexing - Berry, Dumais et al. - 1995
8   Introduction to Nonlinear Optimization (context) - Wismer, Chattergy - 1978
8   Evaluation of attributes obtained in statistical decision ru.. (context) - Lunts, Brailovskiy - 1967
8   and Reality (context) - Whorf - 1959
8   Large margin dags for multiclass classi cation (context) - Platt, Cristianini et al. - 2000
7   Semi-supervised support vector machines for unlabeled data c.. - Fung, Mangasarian - 1999
7   A quadratic programming procedure (context) - Hildreth - 1957
7   A comparison of two learning algorithms for text classi cati.. (context) - Lewis, Ringuette - 1994
7   Theorie der Zeichenerkennung (context) - Wapnik, Tscherwonenkis - 1979
7   Knowledge discovery and knowledge validation in intensive ca.. - Morik, Imho et al. - 2000
7   Bayesian transduction - Graepel, Herbrich et al. - 2000
7   Lengthfrequency statistics for written English (context) - Miller, Newman et al. - 1958
6   A probabilistic description-oriented approach for categorisi.. (context) - overt, Lalmas et al. - 1999
6   Block addressing indices for approximate text retrieval (context) - Baeza-Yates, Navarro - 1997
6   Retrieval test evaluation of a rule based automatic indexing (context) - Fuhr, Knorz - 1984
5   Incorporating test inputs into learning - Cataltepe, Magdon-Ismail - 1998
5   Expected error analysis for model selection - er, Joachims - 1999
5   Evaluating and optmizing autonomous text classi cation syste.. (context) - Lewis - 1995
5   Optimization - Theory and Applications (context) - Werner - 1984
5   ect of unlabeled samples in reducing the small sample size p.. (context) - Shahshahani, Landgrebe et al. - 1994
5   Human Behavior and the Principle of Least E ort: An Introduc.. (context) - Zipf - 1949
5   Pairwise classi cation and support vector machines (context) - el - 1999
5   WebWatcher: Machine learning and hypertext - Joachims, Mitchell et al. - 1995
5   Probabilistic learning approaches for indexing and retrieval.. - Fuhr, Pfeifer et al. - 1994
4   A probabilistic approach to automated keyword indexing (context) - Harter - 1975
4   A probabilistic approach to automated keyword indexing (context) - Harter - 1975
4   the convergence of gradient methods under constraint (context) - Wolfe - 1972
4   Topic characterization of full length texts using direct and.. - Fisher - 1994
4   Shrinking the tube: a new support vector regression algorith.. - olkopf, Smola et al. - 1999
4   A distribution-free performance bound in error estimation (context) - Devroye, Wagner - 1976
4   Words or concepts: the features of indexing units and their .. (context) - Yang, Chute - 1993
3   Mathematical Programming Methods (context) - Zoutendijk - 1970
3   The selection of good search terms (context) - van Rijsbergen, Harper et al. - 1981
3   Gaussian process classi cation and SVM: Mean eld results and.. (context) - Opper, Winther - 2000
3   Combining shallow text processing and machine learning in re.. (context) - Neumann, Schmeier - 1999
3   A study of support vectors on model independent example sele.. - Syed, Liu et al. - 1999
3   Key concepts in model selection: Performance and generalizat.. - Forster - 2000
3   Algorithms for recognizing contour-traced handprinted charac.. (context) - Toussaint, Donaldson - 1970
3   On structural risk minimization or overall risk in a problem.. (context) - Vapnik, Sterin - 1977
2   Improving learning accuracy in information ltering (context) - de Kroon, Mitchell et al. - 1996
2   Learning classi cation with unlabeled data (context) - de Sa - 1993
2   Large margin trees for induction and transduction - Wu, Bennett et al. - 1999
2   Probabilistic models for automated indexing (context) - Bookstein, Swanson - 1974
2   Active support vector machine classi cation (context) - Mangasarian, Musicant - 2000
2   A traininig algorithm for optimal margin classi ers (context) - Boser, Guyon et al. - 1992
2   tour guides und adaptive www-server (context) - Joachims, Mladeni - 1998
2   Message classi cation in the call center (context) - Busemann, Schmeier et al. - 2000
2   Learning for Text Categorization (context) - Sahami, Craven et al. - 1998
2   Machine Learning Journal (context) - Cortes, Vapnik - 1995
2   Wissenserlangung aus grossen datenbanken (context) - Joachims - 1999
2   Analysis of molecular pro le data using generative and discr.. (context) - Moler, Chow et al. - 2000
2   cation from labeled and unlabeled documents using EM (context) - Nigam, McCallum et al. - 2000
1   Aktuelles schlagwort: Support vector machines (context) - Joachims - 1999
1   Collection properties (context) - Spark-Jones - 1973

[Article contains additional citations not shown here]



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www-ai.informatik.uni-dortmund.de/DOKUMENTE):   More
Efficient Kernel Calculation for Multirelational Data - Rüping (2002)   (Correct)
Domain Knowledge and Data Mining Process Decisions - Knobbe, Schipper, Brockhausen (2000)   (Correct)
Text Categorization with Support Vector Machines: Learning with.. - Joachims (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC