(Enter summary)
Abstract: . The naive Bayes classifier, currently experiencing a renaissance in machine learning, has long been a core technique in information retrieval. We review some of the variations of naive Bayes models used for text retrieval and classification, focusing on the distributional assumptions made about word occurrences in documents. 1 Introduction The naive Bayes classifier, long a favorite punching bag of new classification techniques, has recently emerged as a focus of research itself in machine... (Update)
Cited by: More
Classifying under Computational Resource Constraints.. - Georey Webb Geoff (2005)
(Correct)
The Organisation and Retrieval of Document Collections: A.. - Vinokourov (2003)
(Correct)
Exploiting Structural Information in Semi-structured Document .. - Bratko, Filipic (2004)
(Correct)
Active bibliography (related documents): More All
3.5: Naive (Bayes) at Forty: The Independence Assumption in Information .. - Lewis (1998)
(Correct)
0.6: A Probabilistic Learning Approach for Document Indexing - Fuhr, Buckley (1991)
(Correct)
0.5: Optimizing Ranking Functions: A Connectionist Approach to.. - Bartell (1994)
(Correct)
Similar documents based on text: More All
0.3: On the Naive Bayes Model for Text Categorization - Eyheramendy, Lewis (2003)
(Correct)
0.0: Air traffic control communication at Portland International.. - Ward, Novick, Sousa (1990)
(Correct)
0.0: Renaissance User Interface Implementation Guide - August National Aeronautics
(Correct)
Related documents from co-citation: More All
4: Maximum Likelihood from Incomplete Data via the EM Algorithm (context) - Dempster, Laird et al. - 1977
3: Learning limited dependence Bayesian classifiers
- Sahami - 1996
3: Text categorization with Support Vector Machines: Learning with many relevant fe..
- Joachims - 1998
BibTeX entry: (Update)
Lewis, D. (1998). Naive Bayes at forty: The independence assumption in information retrieval. Conference proceedings of European Conference on Machine Learning (pp. 4--15). http://citeseer.ist.psu.edu/article/lewis98naive.html More
@inproceedings{ lewis98naive,
author = "David D. Lewis",
title = "Naive ({B}ayes) at forty: The independence assumption in information retrieval.",
booktitle = "Proceedings of {ECML}-98, 10th European Conference on Machine Learning",
number = "1398",
publisher = "Springer Verlag, Heidelberg, DE",
address = "Chemnitz, DE",
editor = "Claire N{\'{e}}dellec and C{\'{e}}line Rouveirol",
pages = "4--15",
year = "1998",
url = "citeseer.ist.psu.edu/article/lewis98naive.html" }
Citations (may not include all citations):
2133
Pattern Classification and Scene Analysis (context) - Duda, Hart - 1973
1256
Introduction to Modern Information Retrieval (context) - Salton, McGill - 1983
416
Information Retrieval
- van Rijsbergen - 1979
376
Text categorization with support vector machines: Learning w..
- Joachims - 1997
288
Relevance feedback in information retrieval (context) - Rocchio - 1971
271
Improving retrieval performance by relevance feedback (context) - Salton, Buckley - 1990
243
Information Retrieval: Data Structures and Algorithms (context) - Frakes, Baeza-Yates - 1992
221
Perceptrons: An Introduction to Computational Geometry (context) - Minsky, Papert - 1988
201
Relevance weighting of search terms (context) - Robertson, Jones - 1976
135
A sequential algorithm for training text classifiers
- Lewis, Gale - 1994
128
the optimality of the simple bayesian classifier under zero-..
- Domingos, Pazzani - 1997
110
Context-sensitive learning methods for text categorization
- Cohen, Singer - 1996
101
Evaluation of an inference network-based retrieval model (context) - Turtle, Croft - 1991
94
A method for disambiguating word senses in a large corpus (context) - Gale, Church et al. - 1993
51
Information Storage and Retrieval (context) - Korfhage - 1997
44
Natural language processing for information retrieval
- Lewis, Jones - 1996
40
Scaling up the accuracy of Naive-Bayes classifiers: a decisi..
- Kohavi - 1996
37
Models for retrieval with probabilistic indexing (context) - Fuhr - 1989
32
Relevance feedback and other query modification techniques (context) - Harman - 1992
32
Evaluating and optimizing autonomous text classification sys..
- Lewis - 1995
29
A theoretical basis for the use of co-occurrence data in inf.. (context) - van Rijsbergen - 1977
28
Information Retrieval Systems: Theory and Implementation (context) - Kowalski - 1997
27
Text categorization of low quality images (context) - Ittner, Lewis et al. - 1995
23
Automatic indexing: An experimental inquiry (context) - Maron - 1961
23
and signatures for navigating in text databases (context) - Chakrabarti, Dom et al. - 1997
20
Distribution of content words and phrases in text and langua.. (context) - Katz - 1996
20
A probabilistic approach to automatic keyword indexing (context) - Harter - 1975
20
A probabilistic approach to automatic keyword indexing (context) - Harter - 1975
19
Probabilistic models of indexing and searching (context) - Robertson, van Rijsbergen et al. - 1981
17
Experiments with representation in a document retrieval syst.. (context) - Croft - 1983
16
Pivoted document length normalization (context) - Singhal, Buckley et al. - 1996
14
Document classification using a finite mixture model
- Li, Yamanishi - 1997
11
Parameter estimation for probabilistic document-retrieval mo.. (context) - Losee - 1988
11
Boolean queries and term dependencies in probabilistic retri.. (context) - Croft - 1986
11
Text representation for intelligent text retrieval: A classi.. (context) - Lewis - 1992
8
A decision theoretic foundation for indexing (context) - Bookstein, Swanson - 1975
8
Search term relevance weighting given little relevance infor.. (context) - Jones - 1979
8
Some inconsistencies and misidentified modeling assumptions .. (context) - Cooper - 1995
8
An evaluation of feedback in document retrieval using co-occ.. (context) - Harper, van Rijsbergen - 1978
6
on and J. L. Kuhns. On relevance, probabilistic indexing, an.. (context) - Mar - 1960
5
Two learning schemes in information retrieval (context) - Yu, Mizuno - 1998
5
One term or two
- Church - 1995
5
Document classification by machine: Theory and practice
- Guthrie, Walker et al. - 1994
3
Applied Bayesian and Classical Inference (context) - Mosteller, Wallace - 1984
2
Modelling documents with multiple Poisson distributions (context) - Margulis - 1993
2
Operations research applied to document indexing and retriev.. (context) - Bookstein, Kraft - 1977
2
Bayesian inference with node aggregation for information ret.. (context) - Favero, Fung - 1994
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www-dbs.cs.uni-sb.de/public_html/lehre/prosem00lit.html): More
Authoritative Sources in a Hyperlinked Environment - Kleinberg (1999)
(Correct)
Integrating Keyword Search into XML Query Processing - Florescu, al. (2000)
(Correct)
Using Linear Algebra for Intelligent Information Retrieval - Berry, Dumais, O'Brien (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC