Download:
|
by Zoubin Ghahramani, Geoffrey E. Hinton
ftp://ftp.cs.toronto.edu/pub/zoubin/scaling.ps.gz
Add To MetaCart
Abstract:
A persistent worry with computational models of unsupervised learning is that learning will become more difficult as the problem is scaled. We examine this issue in the context of a novel hierarchical, generative model that can be viewed as a nonlinear generalization of factor analysis and can be implemented in a neural network. The model performs perceptual inference in a probabilistically consistent manner by using top-down, bottom-up and lateral connections. These connections can be learned using simple rules that require only locally available information. We first demonstrate that the model can extract a sparse, distributed, hierarchical representation of depth from simplified random-dot stereograms. We then investigate some of the scaling properties of the algorithm on this problem and find that: (1) Increasing the image size leads to faster and more reliable learning; (2) Increasing the depth of the network from one to two hidden layers leads to better representations at the first hidden layer, and (3) Once one part of the network has discovered how to represent depth, it "supervises " other parts of the network, greatly speeding up their learning.
Citations
|
2322
|
Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images
– Geman, Geman
- 1984
|
|
750
|
Self organzed formation of topologically correct feature maps
– Kohonen
- 1982
|
|
419
|
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
– Olshausen, Field
|
|
273
|
A simplified neuron model as a principal component analyzer
– Oja
- 1982
|
|
247
|
Self organization in a perceptual network
– Linsker
- 1988
|
|
211
|
Learning and relearning in Boltzmann machines
– Hinton, Sejnowski
- 1986
|
|
182
|
The wake-sleep algorithm for unsupervised neural networks
– Hinton, Dayan, et al.
- 1995
|
|
171
|
Unsupervised learning
– Barlow
- 1989
|
|
154
|
The Helmholtz machine
– Dayan, Hinton, et al.
- 1995
|
|
150
|
Connectionist learning of belief networks
– Neal
- 1992
|
|
134
|
An analogue approach to the travelling salesman problem using an elastic net method
– Durbin, Willshaw
- 1987
|
|
114
|
Hinton,“Self-organizing neural network that discovers surfaces in random-dot stereograms
– Becker, E
- 1992
|
|
111
|
The ART of Adaptive Pattern Recognition by a Self-Organizing Neural Network
– Carpenter, Grossberg
- 1988
|
|
94
|
An Introduction to Latent Variable Models
– Everitt
- 1984
|
|
52
|
Optimal perceptual inference
– Hinton, Sejnowski
- 1983
|
|
18
|
Bayesian unsupervised learning of higher order structure
– Lewicki, Sejnowski
- 1997
|