See this document in CiteSeerX!

The Enron Corpus:  (Make Corrections)  
A New Dataset for Email Classification Research Bryan Klimt and Yiming Yang...



  Home/Search   Context   Related

 
View or download:
cmu.edu/yiming/Publi...klimtecml04.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/~yiming/publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Automated classification of email messages into user-specific folders and information extraction from chronologically ordered email streams have become interesting areas in text learning research. However, the lack of large benchmark collections has been an obstacle for studying the problems and evaluating the solutions. In this paper, we introduce the Enron corpus as a new test bed. We analyze its suitability with respect to email folder prediction, and provide the baseline results of a... (Update)

Active bibliography (related documents):   More   All
0.5:   Structured Ontology and Information Retrieval for Email Search .. - Eklund, Cole (2002)   (Correct)
0.5:   A Comparative Study of Classification Based Personal E-mail .. - Yanlei Diao Hongjun (2000)   (Correct)
0.0:   On Integrating Catalogs - Agrawal, Srikant (2001)   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@misc{ for-enron,
  author = "New Dataset For",
  title = "The Enron Corpus:",
  url = "citeseer.ist.psu.edu/751763.html" }
Citations (may not include all citations):
23   MailCat: An Intelligent Assistant for Organizing E-Mail - Segal, Kephart - 1999
2   Meek: Challenges of the Email Domain for Text Classification (context) - Brutlag - 2000
2   Knowles: Threading Electronic Mail: A Preliminary Study (context) - Lewis - 1997
1   Ochimizu: Construction of Deliberation Structure in Email Co.. (context) - Murakoshi, Shimazu - 1999
1   Wu: A comparative study of classification-based personal ema.. (context) - Diao, Lu - 2000
1   McCreath: Automatic Induction of Rules for e-mail Classifica.. (context) - Crawford, Kay - 2001
1   individual research project report (context) - Hung, Procmail et al. - 2001
1   Matwin: Email classification with co-training (context) - Kiritchenko - 2001
1   Tagarelli: Towards an Adaptive Mail Classifier (context) - Manco, Masciari et al. - 2002

Documents on the same site (http://www.cs.cmu.edu/~yiming/publications.html):   More
Topic Detection and Tracking Pilot Study - Allan, Carbonell, Doddington.. (1998)   (Correct)
Report on the CONALD Workshop on Learning from Text.. - Carbonell, Craven.. (1998)   (Correct)
Noise Reduction in a Statistical Approach to Text Categorization - Yang (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC