(Enter summary)
Abstract: This paper introduces new methods based
on exponential families for modeling the
correlations between words in text and
speech. While previous work assumed the
effects of word co-occurrence statistics to
be constant over a window of several hun-
dred words, we show that their influence
is nonstationary on a much smaller time
scale. Empirical data drawn from English
and Japanese text, as well as conversational
speech,' reveals that the "attraction
" between words decays
... (Update)
Context of citations to this paper: More
.... use linguistic criteria such as cue phrases, punctuation marks, prosodic features, reference, syntax and lexical attraction [1, 2, 3, 15, 24, 25]. Another approach utilizes statistical similarity measures such as word cooccurrence (according to Halliday and Hasan [8]...
.... by Beeferman et al. uses n grams within a language model, but does not explicitly track the frequency of multiple word phrases [Beeferman et al. 1997b] The number of repeated bigrams in text is smaller than the number of repeated words and the number of trigrams which...
Cited by: More
Bayesian Methods for Frequent Terms in Text: Models of.. - Airoldi, Cohen, Fienberg (2005)
(Correct)
Eliminating Noisy Information in Web Pages for Data Mining - Yi, al. (2003)
(Correct)
Identifying the Interactions of Multi-Criteria in Turkish.. - Yöndem (2001)
(Correct)
Similar documents (at the sentence level):
65.9%: A Model of Lexical Attraction and Repulsion - Doug Beeferman Adam (1997)
(Correct)
42.3%: Accounting for Distance in a - Long-Range Language Model
(Correct)
Similar documents based on text: More All
0.5: CYBERPUNC: A lightweight punctuation annotation system.. - Beeferman, Berger.. (1998)
(Correct)
0.4: Statistical Models for Text Segmentation - Beeferman, Berger, Lafferty (1999)
(Correct)
0.3: Text Segmentation Using Exponential Models - Beeferman, Berger, Lafferty (1997)
(Correct)
Related documents from co-citation: More All
8: A Maximum Entropy Approach to Natural Language Processing
- Berger, Pietra et al. - 1996
7: Estimation of probabilities from sparse data for the language model component of.. (context) - Katz - 1997
6: Query Expansion Using Local and Global Document Analysis
- Xu, Croft - 1996
BibTeX entry: (Update)
D. Beeferman, A. Berger, and J. Lafferty, A model of lexical attraction and repulsion, In Proceedings of the ACL, Madrid, Spain, 1997. http://citeseer.ist.psu.edu/beeferman97model.html More
@inproceedings{ beeferman97model,
author = "Doug Beeferman and Adam Berger and John Lafferty",
title = "A Model of Lexical Attraction and Repulsion",
booktitle = "Proceedings of the Thirty-Fifth Annual Meeting of the {A}ssociation for {C}omputational {L}inguistics and Eighth Conference of the {E}uropean Chapter of the {A}ssociation for {C}omputational {L}inguistics",
publisher = "Association for Computational Linguistics",
address = "Somerset, New Jersey",
editor = "Philip R. Cohen and Wolfgang Wahlster",
pages = "373--380",
year = "1997",
url = "citeseer.ist.psu.edu/beeferman97model.html" }
Citations not processed or no citations identified.
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://acl.ldc.upenn.edu/P/P97/): More
Finite State Transducers Approximating Hidden Markov Models - Kempe (1997)
(Correct)
Planning Reference Choices for Argumentative Texts - Huang
(Correct)
Morphological Disambiguation by Voting Constraints - Oflazer, Tür (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC