See this document in CiteSeerX!

Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval (1998)  (Make Corrections)  (12 citations)
David Lewis
Proceedings of ECML-98, 10th European Conference on Machine Learning



  Home/Search   Context   Related

 
View or download:
dbs.cs.unisb.de/p...lewis98becml98.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  dbs.cs.unisb.de/pu...prosem00lit (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: . The naive Bayes classifier, currently experiencing a renaissance in machine learning, has long been a core technique in information retrieval. We review some of the variations of naive Bayes models used for text retrieval and classification, focusing on the distributional assumptions made about word occurrences in documents. 1 Introduction The naive Bayes classifier, long a favorite punching bag of new classification techniques, has recently emerged as a focus of research itself in machine... (Update)

Cited by:   More
Classifying under Computational Resource Constraints.. - Georey Webb Geoff (2005)   (Correct)
The Organisation and Retrieval of Document Collections: A.. - Vinokourov (2003)   (Correct)
Exploiting Structural Information in Semi-structured Document .. - Bratko, Filipic (2004)   (Correct)

Active bibliography (related documents):   More   All
3.5:   Naive (Bayes) at Forty: The Independence Assumption in Information .. - Lewis (1998)   (Correct)
0.6:   A Probabilistic Learning Approach for Document Indexing - Fuhr, Buckley (1991)   (Correct)
0.5:   Optimizing Ranking Functions: A Connectionist Approach to.. - Bartell (1994)   (Correct)

Similar documents based on text:   More   All
0.3:   On the Naive Bayes Model for Text Categorization - Eyheramendy, Lewis (2003)   (Correct)
0.0:   Air traffic control communication at Portland International.. - Ward, Novick, Sousa (1990)   (Correct)
0.0:   Renaissance User Interface Implementation Guide - August National Aeronautics   (Correct)

Related documents from co-citation:   More   All
4:   Maximum Likelihood from Incomplete Data via the EM Algorithm (context) - Dempster, Laird et al. - 1977
3:   Learning limited dependence Bayesian classifiers - Sahami - 1996
3:   Text categorization with Support Vector Machines: Learning with many relevant fe.. - Joachims - 1998

BibTeX entry:   (Update)

Lewis, D. (1998). Naive Bayes at forty: The independence assumption in information retrieval. Conference proceedings of European Conference on Machine Learning (pp. 4--15). http://citeseer.ist.psu.edu/article/lewis98naive.html   More

@inproceedings{ lewis98naive,
    author = "David D. Lewis",
    title = "Naive ({B}ayes) at forty: The independence assumption in information retrieval.",
    booktitle = "Proceedings of {ECML}-98, 10th European Conference on Machine Learning",
    number = "1398",
    publisher = "Springer Verlag, Heidelberg, DE",
    address = "Chemnitz, DE",
    editor = "Claire N{\'{e}}dellec and C{\'{e}}line Rouveirol",
    pages = "4--15",
    year = "1998",
    url = "citeseer.ist.psu.edu/article/lewis98naive.html" }
Citations (may not include all citations):
2133   Pattern Classification and Scene Analysis (context) - Duda, Hart - 1973
1256   Introduction to Modern Information Retrieval (context) - Salton, McGill - 1983
416   Information Retrieval - van Rijsbergen - 1979
376   Text categorization with support vector machines: Learning w.. - Joachims - 1997
288   Relevance feedback in information retrieval (context) - Rocchio - 1971
271   Improving retrieval performance by relevance feedback (context) - Salton, Buckley - 1990
243   Information Retrieval: Data Structures and Algorithms (context) - Frakes, Baeza-Yates - 1992
221   Perceptrons: An Introduction to Computational Geometry (context) - Minsky, Papert - 1988
201   Relevance weighting of search terms (context) - Robertson, Jones - 1976
135   A sequential algorithm for training text classifiers - Lewis, Gale - 1994
128   the optimality of the simple bayesian classifier under zero-.. - Domingos, Pazzani - 1997
110   Context-sensitive learning methods for text categorization - Cohen, Singer - 1996
101   Evaluation of an inference network-based retrieval model (context) - Turtle, Croft - 1991
94   A method for disambiguating word senses in a large corpus (context) - Gale, Church et al. - 1993
51   Information Storage and Retrieval (context) - Korfhage - 1997
44   Natural language processing for information retrieval - Lewis, Jones - 1996
40   Scaling up the accuracy of Naive-Bayes classifiers: a decisi.. - Kohavi - 1996
37   Models for retrieval with probabilistic indexing (context) - Fuhr - 1989
32   Relevance feedback and other query modification techniques (context) - Harman - 1992
32   Evaluating and optimizing autonomous text classification sys.. - Lewis - 1995
29   A theoretical basis for the use of co-occurrence data in inf.. (context) - van Rijsbergen - 1977
28   Information Retrieval Systems: Theory and Implementation (context) - Kowalski - 1997
27   Text categorization of low quality images (context) - Ittner, Lewis et al. - 1995
23   Automatic indexing: An experimental inquiry (context) - Maron - 1961
23   and signatures for navigating in text databases (context) - Chakrabarti, Dom et al. - 1997
20   Distribution of content words and phrases in text and langua.. (context) - Katz - 1996
20   A probabilistic approach to automatic keyword indexing (context) - Harter - 1975
20   A probabilistic approach to automatic keyword indexing (context) - Harter - 1975
19   Probabilistic models of indexing and searching (context) - Robertson, van Rijsbergen et al. - 1981
17   Experiments with representation in a document retrieval syst.. (context) - Croft - 1983
16   Pivoted document length normalization (context) - Singhal, Buckley et al. - 1996
14   Document classification using a finite mixture model - Li, Yamanishi - 1997
11   Parameter estimation for probabilistic document-retrieval mo.. (context) - Losee - 1988
11   Boolean queries and term dependencies in probabilistic retri.. (context) - Croft - 1986
11   Text representation for intelligent text retrieval: A classi.. (context) - Lewis - 1992
8   A decision theoretic foundation for indexing (context) - Bookstein, Swanson - 1975
8   Search term relevance weighting given little relevance infor.. (context) - Jones - 1979
8   Some inconsistencies and misidentified modeling assumptions .. (context) - Cooper - 1995
8   An evaluation of feedback in document retrieval using co-occ.. (context) - Harper, van Rijsbergen - 1978
6   on and J. L. Kuhns. On relevance, probabilistic indexing, an.. (context) - Mar - 1960
5   Two learning schemes in information retrieval (context) - Yu, Mizuno - 1998
5   One term or two - Church - 1995
5   Document classification by machine: Theory and practice - Guthrie, Walker et al. - 1994
3   Applied Bayesian and Classical Inference (context) - Mosteller, Wallace - 1984
2   Modelling documents with multiple Poisson distributions (context) - Margulis - 1993
2   Operations research applied to document indexing and retriev.. (context) - Bookstein, Kraft - 1977
2   Bayesian inference with node aggregation for information ret.. (context) - Favero, Fung - 1994



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www-dbs.cs.uni-sb.de/public_html/lehre/prosem00lit.html):   More
Authoritative Sources in a Hyperlinked Environment - Kleinberg (1999)   (Correct)
Integrating Keyword Search into XML Query Processing - Florescu, al. (2000)   (Correct)
Using Linear Algebra for Intelligent Information Retrieval - Berry, Dumais, O'Brien (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC