• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

DMCA

An Empirical Study of Smoothing Techniques for Language Modeling (1998)

Cached

  • Download as a PDF

Download Links

  • [l2r.cs.uiuc.edu]
  • [l2r.cs.uiuc.edu]
  • [l2r.cs.uiuc.edu]
  • [acl.ldc.upenn.edu]
  • [wing.comp.nus.edu.sg]
  • [aclweb.org]
  • [www.aclweb.org]
  • [aclweb.org]
  • [aclweb.org]
  • [ucrel.lancs.ac.uk]
  • [wing.comp.nus.edu.sg]
  • [arxiv.org]
  • [arxiv.org]
  • [www2.denizyuret.com]
  • [research.microsoft.com]
  • [www.cs.cmu.edu]
  • [www.cs.cmu.edu]
  • [nlp.postech.ac.kr]
  • [www.isip.msstate.edu]
  • [www.speech.sri.com]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Stanley F. Chen
Citations:1224 - 21 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@TECHREPORT{Chen98anempirical,
    author = {Stanley F. Chen},
    title = {An Empirical Study of Smoothing Techniques for Language Modeling},
    institution = {},
    year = {1998}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

We present an extensive empirical comparison of several smoothing techniques in the domain of language modeling, including those described by Jelinek and Mercer (1980), Katz (1987), and Church and Gale (1991). We investigate for the first time how factors such as training data size, corpus (e.g., Brown versus Wall Street Journal), and n-gram order (bigram versus trigram) affect the relative performance of these methods, which we measure through the cross-entropy of test data. In addition, we introduce two novel smoothing techniques, one a variation of Jelinek-Mercer smoothing and one a very simple linear interpolation technique, both of which outperform existing methods. 1

Keyphrases

language modeling    smoothing technique    empirical study    brown versus wall street journal    test data    training data size    bigram versus trigram    relative performance    simple linear interpolation technique    first time    extensive empirical comparison    n-gram order    jelinek-mercer smoothing   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University