(Enter summary)
Abstract: Real-world learning tasks often involve high-dimensional data sets with complex patterns of missing
features. In this paper we review the problem of learning from incomplete data from two statistical
perspectives---the likelihood-based and the Bayesian. The goal is two-fold: to place current neural network
approaches to missing data within a statistical framework, and to describe a set of algorithms, derived
from the likelihood-based framework, that handle clustering, classification, and... (Update)
Context of citations to this paper: More
...the E step, the expectation of Znj is E[znj Itr Oj] rnj (equation (1. 2) measured only on tr (the observed dimensions of 4 Following [5], we introduce t = E(tlzj = 1, t2,0j) y )Od. moo.o l(t (y)old) 1.3) which is the least squares regression between tn TM and tr...
Cited by: More
A Comparison of Informative and Discriminative Estimation of.. - Goodman (2000)
(Correct)
Radar Esm With A What-And-Where Fusion Neural Network - Granger, Rubin, Grossberg.. (2001)
(Correct)
Scale and Translation Invariant Collaborative Filtering Systems - Lemire (2003)
(Correct)
Similar documents (at the sentence level):
9.1%: Supervised learning from incomplete data via an EM approach - Ghahramani, Jordan (1994)
(Correct)
7.8%: Supervised Learning From Incomplete - Data Via An
(Correct)
Active bibliography (related documents): More All
0.5: A Comprehensive Case Study: An Examination of Machine Learning.. - Zarndt (1995)
(Correct)
0.5: Bayesian Motion Estimation and Segmentation - Weiss (1998)
(Correct)
0.2: On Computing the Largest Fraction of Missing Information for the.. - Fraley (1998)
(Correct)
Similar documents based on text: More All
1.6: Unknown -
(Correct)
0.6: Active Learning with Statistical Models - Cohn, al. (1995)
(Correct)
0.5: Factorial Hidden Markov Models - Ghahramani, Jordan (1996)
(Correct)
Related documents from co-citation: More All
8: Maximum Likelihood from Incomplete Data via the EM Algorithm (context) - Dempster, Laird et al. - 1977
6: Neural Networks for Pattern Recognition (context) - Bishop - 1995
4: Pattern classification and scene analysis (context) - Duda, Hart - 1973
BibTeX entry: (Update)
Z. Ghahramani and M. I. Jordan, "Learning from Incomplete Data," by ftp to publications. ai.mit.edu. Massachusetts Institute of Technology, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Department of Brain and Cognitive Sciences, December 10, 1995. http://citeseer.ist.psu.edu/ghahramani95learning.html More
@techreport{ ghahramani94learning,
author = "Zoubin Ghahramani and Michael I. Jordan",
title = "Learning from Incomplete Data",
number = "AIM-1509",
pages = "11",
year = "1994",
url = "citeseer.ist.psu.edu/ghahramani95learning.html" }
Citations (may not include all citations):
1359
Induction of decision trees (context) - Quinlan - 1986
1262
Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
643
Equation of state calculations by fast computing machines (context) - Metropolis, Rosenbluth et al. - 1953
472
Hierarchical mixtures of experts and the EM algorithm
- Jordan, Jacobs - 1994
317
Monte carlo sampling methods using markov chains and their a.. (context) - Hastings - 1970
303
Stochastic relaxation (context) - Geman, Geman - 1984
273
Multivariate adaptive regression splines (context) - Friedman - 1991
205
Mixture models: Inference and applications to clustering (context) - McLachlan, Basford - 1988
203
Statistical Analysis with Missing Data (context) - Little, Rubin - 1987
196
Bayesian Inference in Statistical Analysis (context) - Box, Tiao - 1973
138
Learning and relearning in Boltzmann machines (context) - --, Sejnowski - 1986
99
Learning in artificial neural networks: A statistical perspe.. (context) - White - 1989
82
UCI Repository of machine learning databases [Machine-readab.. (context) - Murphy, Aha - 1992
71
A theory of networks for approximation and learning
- Poggio, Girosi - 1989
59
Unknown attribute values in induction
- Quinlan - 1989
58
A general regression neural network (context) - Specht - 1991
57
Discriminant analysis by Gaussian mixtures
- Hastie, Tibshirani - 1994
48
Soft Competitive Adaptation: Neural Network Learning Algorit.. (context) - Nowlan - 1991
20
Mixture density networks
- Bishop - 1994
9
and Rubin (context) - Dempster, Laird - 1977
6
Maximum likelihood estimation for mixed continuous and categ.. (context) - Little, Schluchter - 1985
4
Analysis of Incomplete Multivariate Data by Simulation (context) - Schafer - 1994
2
Multi-valued standard regularization theory (context) - Shizawa - 1993
2
Classification of incomplete data using neural networks (context) - Southcott, Bogner - 1993
1
Training neural networks with deficient data (context) - American, Association et al. - 1994
1
Royal Statistical Society Series B (context) - from, via et al. - 1973
1
Fast learning in networks of locally-tuned processing units (context) - Chemical, -- et al. - 1989
1
The calculation of posterior distributions by data augmentat.. (context) - Trans, -- et al. - 1987
1
Autoclass: A bayesian classification system (context) - Systems, Cheeseman et al. - 1988
1
Bayesian backpropagation (context) - Group, CA et al. - 1991
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.neurop2.ruhr-uni-bochum.de/MIT_lit.html): More
MIT SchMUSE: Class-Based Remote Delegation in a.. - Blair, Cohen.. (1993)
(Correct)
The Logical Problem of Language Change - Niyogi (1995)
(Correct)
Aeroacoustics on Non-Dedicated Workstations - Skordos (1995)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC