See this document in CiteSeerX!

A Bayesian Approach to Filtering Junk E-Mail (1998)  (Make Corrections)  (76 citations)
Mehran Sahami, Susan Dumais, David Heckerman, Eric Horvitz
Learning for Text Categorization: Papers from the 1998 Workshop



  Home/Search   Context   Related

 
View or download:
microsoft.com/~heckerman/spam98.ps
microsoft.com/~sdumais/spam98.ps
stanford.edu/users/sahami/pap...spam.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  microsoft.com/~heckerman/ (more)
From:  stanford.edu/users/saham...papers
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: In addressing the growing problem of junk E-mail on the Internet, we examine methods for the automated construction of filters to eliminate such unwanted messages from a user's mail stream. By casting this problem in a decision theoretic framework, we are able to make use of probabilistic learning methods in conjunction with a notion of differential misclassification cost to produce filters which are especially appropriate for the nuances of this task. While this may appear, at first, to be a... (Update)

Cited by:   More
Filtron: A Learning-Based Anti-Spam Filter - Eirinaios Michelakis Ion (2004)   (Correct)
RE: Reliable Email - Scott Garriss Michael   (Correct)
Leave a Reply: An Analysis of Weblog Comments - Mishne, Glance (2006)   (Correct)

Active bibliography (related documents):   More   All
0.2:   Using Machine Learning To Improve Information Access - Sahami (1999)   (Correct)
0.1:   Learning Limited Dependence Bayesian Classifiers - Sahami (1996)   (Correct)
0.1:   Exploring the Robustness of Bayesian and.. - Kontkanen.. (1999)   (Correct)

Similar documents based on text:   More   All
0.3:   Boosting Trees for Anti-Spam Email Filtering - Carreras, Marquez (2001)   (Correct)
0.3:   Representation of Electronic Mail Filtering Profiles: A User Study - Pazzani (2000)   (Correct)
0.2:   Filtering Junk Mail with A Maximum Entropy Model - Le Zhang, Yao (2003)   (Correct)

Related documents from co-citation:   More   All
21:   Learning rules that classify e-mail - Cohen
19:   Machine Learning (context) - Mitchell - 1997
17:   Text categorization with Support Vector Machines: Learning with many relevant fe.. - Joachims - 1998

BibTeX entry:   (Update)

Mehran Sahami, Susan Dumais, David Heckerman, and Eric Horvitz. A bayesian approach to filtering junk e-mail. In AAAI-98 Workshop on Learning for Text Categorization, 1998. http://citeseer.ist.psu.edu/sahami98bayesian.html   More

@inproceedings{ sahami98bayesian,
    author = "Mehran Sahami and Susan Dumais and David Heckerman and Eric Horvitz",
    title = "A Bayesian Approach to Filtering Junk {E}-Mail",
    booktitle = "Learning for Text Categorization: Papers from the 1998 Workshop",
    publisher = "AAAI Technical Report WS-98-05",
    address = "Madison, Wisconsin",
    year = "1998",
    url = "citeseer.ist.psu.edu/sahami98bayesian.html" }
Citations (may not include all citations):
2319   Elements of Information Theory (context) - Cover, Thomas - 1991  ACM
1543   Probabilistic Reasoning in Intelligent Systems: Networks of .. (context) - Pearl - 1988  ACM
1291   The Nature of Statistical Learning Theory (context) - Vapnik - 1995  ACM
1256   Introduction to Modern Information Retrieval (context) - Salton, McGill - 1983  ACM
976   Machine Learning (context) - Mitchell - 1997  ACM   DBLP
416   A bayesian method for the induction of probabilistic network.. (context) - Cooper, Herskovits - 1992  ACM   DBLP
217   Human Behavior and the Principle of Least Effort (context) - Zipf - 1949
147   Boolean feature discovery in empirical learning (context) - Pagallo, Haussler - 1990  ACM   DBLP
138   Bayesian network classifiers - Friedman, Geiger et al. - 1997  ACM   DBLP
123   Toward optimal feature selection - Koller, Sahami - 1996  DBLP
97   Comparison of two learning algorithms for text categorizatio.. - Lewis, Ringuette - 1994
66   Learning rules that classify email - Cohen - 1996
61   Improving text classification by shrinkage in a hierarchy of.. - McCallum, Rosenfeld et al. - 1998  ACM   DBLP
58   The Estimation of Probabilities: An Essay on Modern Bayesian.. (context) - Good - 1965
41   Feature selection in statistical learning of text categoriza.. (context) - Yang, Pedersen - 1997
28   Learning limited dependence bayesian classifiers - Sahami - 1996  DBLP
9   Smokey: Automatic recognition of hostile messages - Spertus - 1997  DBLP
7   and Chickering (context) - Heckerman, Geiger - 1995
1   Text categorization with support vector machines: Learning w.. (context) - networks, of et al. - 1997



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://research.microsoft.com/~heckerman/):
An MDP-Based Recommender System - Shani, Brafman, Heckerman (2002)   (Correct)
David Heckerman Microsoft Research Valencia 7 June 1, 2002 - Tutorial On Graphical   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC