See this document in CiteSeerX!

Latent Dirichlet Allocation (2003)  (Make Corrections)  (164 citations)
David M. Blei, Andrew Y. Ng, Michael I. Jordan
Journal of machine Learning Research 3



  Home/Search   Context   Related

Links:   ACM   DBLP

 
View or download:
jmlr.org/papers/volume3...blei03a.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help
Problem Downloading?
From:  mit.edu/papers/v3/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian model, in which each item of a collection is modeled as a finite mixture over an underlying set of topics. Each topic is, in turn, modeled as an infinite mixture over an underlying set of topic probabilities. In the context of text modeling, the topic probabilities provide an explicit representation of a document. We... (Update)

Cited by:   More
Journal of Machine Learning Research 7 (2006) 2189--2213.. - And Its Application   (Correct)
Variational Learning for Noisy-OR Component Analysis - Tomas Singliar And   (Correct)
Bayesian Methods for Frequent Terms in Text: Models of.. - Airoldi, Cohen, Fienberg (2005)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Variational Inference for Dirichlet Process Mixtures - David M. Blei, Michael I.. (2006)   (Correct)
0.5:   Graphical Models - Michael Jordan Computer (2003)   (Correct)
0.3:   BAYESIAN STATISTICS 7, pp. 25--43 - Bernardo Bayarri Berger   (Correct)

Similar documents based on text:   More   All
0.5:   Journal of Machine Learning Research 3 (2003) 993-1022.. - David Blei Blei   (Correct)
0.5:   Latent Dirichlet Allocation - Blei, Ng, Jordan (2003)   (Correct)
0.3:   The Author-Topic Model for Authors and Documents - Rosen-Zvi, Griffiths..   (Correct)

Related documents from co-citation:   More   All
1345:   Mixtures of Dirichlet processes with applications to Bayesian nonparametric prob.. (context) - Antoniak - 1974
1284:   A variational Bayesian framework for graphical models - Attias - 2000
1223:   Matching Words and Pictures - Barnard, Duygulu et al. - 2002

BibTeX entry:   (Update)

Blei, D., Ng, A., Jordan, M.: Latent Dirichlet allocation. In: NIPS*14. (2002) to appear. http://citeseer.ist.psu.edu/blei03latent.html   More

@misc{ blei02latent,
  author = "D. Blei and A. Ng and M. Jordan",
  title = "Latent Dirichlet allocation",
  text = "Blei, D., Ng, A., Jordan, M.: Latent Dirichlet allocation.",
  journal = "Journal of machine Learning Research 3",
  page = "993-1022",
  year = "2003",
  url = "citeseer.ist.psu.edu/blei03latent.html",
  url = "http://www.cs.princeton.edu/~blei/papers/BleiNgJordan2003.pdf#search=%22Latent%20Dirichlet%20Allocation%22" }
Citations (may not include all citations):
1120   Handbook of Mathematical Functions (context) - Abramowitz, Stegun - 1970
268   Making large-scale SVM learning practical - Joachims - 1999
261   Bayesian data analysis (context) - Gelman, Carlin et al. - 1995
245   Introduction to variational methods for graphical models - Jordan, Ghahramani et al. - 1999
202   Statistical Methods for Speech Recognition (context) - Jelinek - 1997  ACM
140   Text classification from labeled and unlabeled documents usi.. - Nigam, McCallum et al. - 2000
123   Probabilistic latent semantic indexing - Hofmann - 1999  ACM   DBLP
120   A variational Bayesian framework for graphical models - Attias - 2000
110   Learning in Graphical Models (context) - Jordan - 1999  ACM
64   Latent semantic indexing: A probabilistic analysis - Papadimitriou, Tamaki et al. - 1998  DBLP
49   Overview of the first text retrieval conference (context) - Harman - 1992
37   Theory of probability (context) - de Finetti - 1990
30   Exchangeability and related topics (context) - Aldous - 1985
27   An experimental comparison of several clustering and initial.. - Heckerman, Meila - 2001  DBLP
22   Probabilistic models for unified collaborative and content-b.. - Popescul, Ungar et al. - 2001  ACM   DBLP
15   Estimating a Dirichlet distribution (context) - Minka - 2000
14   Expectation-propagation for the generative aspect model - Minka, Lafferty - 2002
14   Approximate Bayesian inference in conditionally independent .. (context) - Kass, Steffey - 1989
12   Recent progress on de Finetti's notions of exchangeability (context) - Diaconis - 1988
11   Improving multi-class text classification with naive Bayes - Rennie - 2001
5   Maximum likelihood estimation of Dirichlet distributions (context) - Ronning - 1989
4   A probabilistic approach to semantic representation - Griffiths, Steyvers - 2002
3   Multiple hypergeometric functions: Probabilistic interpretat.. (context) - Dickey - 1983
2   General lower bounds based on computer generated higher orde.. - Leisink, Kappen - 2002  DBLP
1   Bayesian methods for censored categorical data (context) - Dickey, Jiang et al. - 1987
1   Berkeley Computer Science Division (context) - Blei, Jordan et al. - 2002
1   URL http://elegans (context) - Avery, center - 1999
1   With discussion (context) - Morris, Bayes et al. - 1999



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://jmlr.csail.mit.edu/papers/v3/):   More
Kernel Independent Component Analysis - Bach, Jordan (2002)   (Correct)
On Online Learning of Decision Lists - Nevo, El-Yaniv (2002)   (Correct)
Learning Precise Timing with LSTM Recurrent Networks - Gers, Schraudolph, Schmidhuber (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC