(Enter summary)
Abstract: We present a technique for constructing random fields from a set of training samples. The learning paradigm builds increasingly complex fields by allowing potential functions, or features, that are supported by increasingly large subgraphs. Each feature has a weight that is trained by minimizing the Kullback-Leibler divergence between the model and the empirical distribution of the training data. A greedy algorithm determines how features are incrementally added to the field and an iterative... (Update)
Cited by: More
A Model of Lexical Attraction and Repulsion - Doug Beeferman Adam
(Correct)
Large Scale Use of Common Sense for Activity Recognition and.. - Pentney (2005)
(Correct)
Unknown -
(Correct)
Similar documents (at the sentence level):
56.5%: Inducing Features of Random Fields - Pietra, Pietra, Lafferty (1997)
(Correct)
51.2%: IEEE TRANSACTIONS PATTERN ANALYSIS AND MACHINE.. - Stephen Della Pietra (1980)
(Correct)
22.4%: Maximum Entropy And Iterative Scaling - Pietra, Pietra, Lafferty
(Correct)
Active bibliography (related documents): More All
0.5: Selection And Information: A Class-based Approach to Lexical.. - Resnik (1993)
(Correct)
0.5: Enriching Object-Oriented Methods with Domain Specific Knowledge.. - Frank (1997)
(Correct)
0.3: Additive Models, Boosting, and Inference for Generalized.. - Lafferty (1999)
(Correct)
Similar documents based on text: More All
0.7: Grammatical Trigrams: A New Approach To Statistical Language .. - Sleator, Lafferty (1997)
(Correct)
0.6: Stochastic Attribute-Value Grammars - Abney (1997)
(Correct)
0.5: Duality and Auxiliary Functions for Bregman Distances - Pietra, Pietra, Lafferty (2001)
(Correct)
Related documents from co-citation: More All
41: A Maximum Entropy Approach to Natural Language Processing
- Berger, Pietra et al. - 1996
39: Generalized Iterative Scaling for Log-Linear Models (context) - Darroch, Ratcliff - 1972
19: A Maximum Entropy Approach to Adaptive Statistical Language Modeling
- Rosenfeld - 1996
BibTeX entry: (Update)
S. Della Pietra, V. Della Pietra, and J. Lafferty, "Inducing features of random fields," In IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 4, pp. 380-393, April 1997. http://citeseer.ist.psu.edu/dellapietra95inducing.html More
@article{ pietra97inducing,
author = "Della Pietra, Stephen and Della Pietra, Vincent J. and John D. Lafferty",
title = "Inducing Features of Random Fields",
journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",
volume = "19",
number = "4",
pages = "380-393",
year = "1997",
url = "citeseer.ist.psu.edu/dellapietra95inducing.html" }
Citations (may not include all citations):
2528
Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
1262
Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
548
Stochastic relaxation, Gibbs distributions, and the Bayesian.. (context) - Geman, Geman - 1984
376
A learning algorithm for Boltzmann machines (context) - Ackley, Hinton et al. - 1985
326
An inequality and associated maximization technique in stati.. (context) - Baum - 1972
219
A statistical approach to machine translation
- Brown, Cocke et al. - 1990
213
A maximum entropy approach to natural language processing
- Berger, Pietra et al. - 1995
165
Generalized iterative scaling for log-linear models (context) - Darroch, Ratcliff - 1972
88
Class-based n-gram models of natural language
- Brown, Pietra et al. - 1992
56
Divergence geometry of probability distributions and minimiz.. (context) - Csisz'ar - 1975
52
Random fields and inverse problems in imaging (context) - Geman - 1990
44
The power of amnesia
- Ron, Singer et al. - 1994
38
Best-first model merging for hidden Markov model induction
- Stolcke, Omohundro
35
Information geometry and alternating minimization procedures (context) - Csisz'ar, Tusn'ady - 1984
35
Data compression using dynamic Markov modelling
- Cormack, Horspool - 1987
14
A note on approximations to discrete probability distributio.. (context) - Brown - 1959
14
Alternating minimization and Boltzmann machine learning (context) - Byrne - 1992
13
Optimal spectral structure of reversible stochastic matrices.. (context) - Frigessi, Hwang et al. - 1992
12
Noncausal Gauss Markov random fields: Parameter structure an.. (context) - Balram, Moura - 1993
7
Convergence of some partially parallel Gibbs samplers with a.. (context) - Ferrari, Frigessi et al. - 1993
7
Higher order Boltzmann machines (context) - Sejnowski - 1986
7
A variational method for estimating the parameters of MRF fr.. (context) - Almeida, Gidas - 1993
7
Reidel Publishing Co (context) - Jaynes, Probability et al. - 1983
4
Institute of Mathematical Statistics Lecture Notes--Monograp.. (context) - Brown, Statistical et al. - 1986
4
A geometric interpretation of Darroch and Ratcliff's general.. (context) - Csisz'ar - 1989
4
Partition function estimation of Gibbs random field images u.. (context) - Potamianos, Goutsias - 1993
3
Automatic word classification using features of spellings (context) - Lafferty, Mercer - 1993
2
Englewood Cliffs: Prentice Hall (context) - Bell, Cleary et al. - 1990
2
Parsing as statistical pattern recognition
- Magerman - 1994
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.cmu.edu/People/clamen/reports/1995.html): More
A Programming Interface for Application-Aware.. - Noble, Price.. (1995)
(Correct)
NESL User's Manual (For NESL Version 3.1) - Blelloch, Sipelstein, Hardwick, .. (1995)
(Correct)
Clustering Learning Tasks and the Selective Cross-Task.. - Thrun, O'Sullivan (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC