(Enter summary)
Abstract: We present a tree-structured architecture for supervised learning. The statistical
model underlying the architecture is a hierarchical mixture model in which both
the mixture coefficients and the mixture components are generalized linear models
(GLIM's). Learning is treated as a maximum likelihood problem; in particular, we
present an Expectation-Maximization (EM) algorithm for adjusting the parameters
of the architecture. We also develop an on-line learning algorithm in which the... (Update)
Cited by: More
Pattern Analysis and Applications manuscript No. - Will Be Inserted
(Correct)
Sensitivity Analysis of the Result in Binary Decision Trees - Alvarez (2004)
(Correct)
Correlation Clustering for Learning Mixtures of Canonical.. - Fern Brodley Friedl
(Correct)
Active bibliography (related documents): More All
2.0: Hierarchical mixtures of experts and the EM algorithm - Jordan, Jacobs (1994)
(Correct)
0.5: A Statistical Approach to Decision Tree Modeling - Jordan (1994)
(Correct)
0.2: Constructive Learning Techniques for Designing Neural Network.. - Campbell (1997)
(Correct)
Similar documents based on text:
94.0: Unknown -
(Correct)
Related documents from co-citation: More All
35: Maximum Likelihood from Incomplete Data via the EM Algorithm (context) - Dempster, Laird et al. - 1977
32: Adaptive Mixtures of Local Experts (context) - Jacobs, Jordan et al. - 1991
22: Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
BibTeX entry: (Update)
Jordan, M. I., & Jacobs, R. A. (1994). Hierarchical mixtures of experts and the EM algorithm. Neural Computation, 6, 181-214. http://citeseer.ist.psu.edu/jordan94hierarchical.html More
@techreport{ jordan93hierarchical,
author = "Michael I. Jordan and Robert A. Jacobs",
title = "Hierarchical Mixtures of Experts and the {EM} Algorithm",
number = "AIM-1440",
pages = "29",
year = "1993",
url = "citeseer.ist.psu.edu/jordan94hierarchical.html" }
Citations (may not include all citations):
2133
Pattern Classification and Scene Analysis (context) - Duda, Hart - 1973
1359
Induction of decision trees (context) - Quinlan - 1986
1262
Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
815
Adaptive Filter Theory (context) - Haykin - 1991
744
Matrix Computations (context) - Golub, Van Loan - 1989
623
Multilayer feedforward networks are universal approximators (context) - Hornik, Stinchcombe et al. - 1989
520
Generalized Linear Models (context) - McCullagh, Nelder - 1983
450
Generalized Additive Models
- Hastie, Tibshirani - 1990
413
Adaptive mixtures of local experts (context) - Jacobs, Jordan et al. - 1991
310
Statistical Analysis of Finite Mixture Distributions (context) - Titterington, Smith et al. - 1985
273
Multivariate adaptive regression splines (context) - Friedman - 1991
244
Theory and practice of recursive identification (context) - Ljung, Soderstrom - 1986
203
Statistical Analysis with Missing Data (context) - Little, Rubin - 1987
185
Inferring decision trees using the Minimum Description Lengt.. (context) - Quinlan, Rivest - 1989
153
Autoclass: A Bayesian classification system (context) - Cheeseman, Kelly et al. - 1988
149
Multivariate Density Estimation (context) - Scott - 1992
138
Learning and relearning in Boltzmann machines (context) - Hinton, Sejnowski - 1986
135
Neural networks and the bias/variance dilemma (context) - Geman, Bienenstock et al. - 1992
132
the convergence properties of the EM algorithm (context) - Wu - 1983
130
Probabilistic interpretation of feedforward classification n.. (context) - Bridle - 1989
98
Consistent nonparametric regression (context) - Stone - 1977
83
Learning classification trees
- Buntine - 1991
61
Theoretical Statistics (context) - Cox, Hinkley - 1977
58
A general regression neural network (context) - Specht - 1991
52
The Analysis of Binary Data (context) - Cox - 1970
49
Maximum likelihood competitive learning (context) - Nowlan - 1990
48
Soft competitive adaptation: Neural network learning algorit.. (context) - Nowlan - 1991
46
Hierarchies of adaptive experts (context) - Jordan, Jacobs - 1992
45
Auto-association by multilayer perceptrons and singular valu.. (context) - Bourlard, Kamp - 1988
42
Mixture densities (context) - Redner, Walker - 1984
26
Fast learning in multi-resolution hierarchies (context) - Moody - 1989
22
A tree-structured adaptive network for function approximatio.. (context) - Sanger - 1991
19
An incremental method for finding multivariate splits for de..
- Utgoff, Brodley - 1990
8
Soft classification (context) - Wahba, Gu et al. - 1993
7
A new approach to estimating switching regressions (context) - Quandt, Ramsey - 1972
6
Statistical Methods in Biological Assay (context) - Finney - 1973
5
Convergence properties of the EM approach to learning in mix.. (context) - Jordan, Xu - 1993
4
Neural trees---using neural nets in a tree classifier struct.. (context) - Stromberg, Zrida et al. - 1991
3
OC1: A randomized algorithm for building oblique decision tr.. (context) - Murthy, Kasif et al. - 1993
2
The moving basin: Effective action search in forward models (context) - Fun, Jordan - 1993
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.berkeley.edu/~jordan/publications.html): More
Improving the Mean Field Approximation via the Use of.. - Jaakkola, Jordan (1998)
(Correct)
Variational probabilistic inference and the QMR-DT database - Jaakkola, Jordan (1998)
(Correct)
Hidden Markov decision trees - Jordan, Ghahramani, Saul (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC