See this document in CiteSeerX!

Statistical Models for Text Segmentation (1999)  (Make Corrections)  (60 citations)
Doug Beeferman, Adam Berger, John Lafferty
Machine Learning



  Home/Search   Context   Related

 
View or download:
cmu.edu/~lafferty/ps/mlfinal.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/~dougb/research (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: . This paper introduces a new statistical approach to automatically partitioning text into coherent segments. The approach is based on a technique that incrementally builds an exponential model to extract features that are correlated with the presence of boundaries in labeled training text. The models use two classes of features: topicality features that use adaptive language models in a novel way to detect broad changes of topic, and cue-word features that detect occurrences of specific words, ... (Update)

Cited by:   More
Using Context History for Data Collection in the Home - Daniel Wilson Robotics   (Correct)
Using Discrete PCA on Web Pages - Buntine, Perttu, Tuulos (2004)   (Correct)
Automatic Segmentation of Multiparty Dialogue - Hsueh, Moore, Renals (2006)   (Correct)

Similar documents (at the sentence level):
10.8%:   Text Segmentation Using Exponential Models - Beeferman, Berger, Lafferty (1997)   (Correct)

Related documents from co-citation:   More   All
17:   Text Segmentation by Topic - Ponte, Croft - 1997
15:   A Tutorial on Hidden Markov Models and Selected Applications in Speech Recogniti.. (context) - Rabiner - 1989
14:   Topic detection and tracking pilot study: Final report - Allan, Carbonell et al. - 1998

BibTeX entry:   (Update)

D. Beeferman, A. Berger, and J. Lafferty. Statistical models for text segmentation. Machine Learning, 1999. To appear. http://citeseer.ist.psu.edu/beeferman99statistical.html   More

@article{ beeferman99statistical,
    author = "Doug Beeferman and Adam Berger and John D. Lafferty",
    title = "Statistical Models for Text Segmentation",
    journal = "Machine Learning",
    volume = "34",
    number = "1-3",
    pages = "177-210",
    year = "1999",
    url = "citeseer.ist.psu.edu/beeferman99statistical.html" }
Citations not processed or no citations identified.



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.cmu.edu/~dougb/research.html):   More
Link Grammar Prefix Measures for Spontaneous Speech Recognition - Beeferman (1995)   (Correct)
Evaluation Metrics For Language Models - Chen, Beeferman, Rosenfeld (1998)   (Correct)
Statistical Models for Text Segmentation - Beeferman, Berger, Lafferty (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC