(Enter summary)
Abstract: Neural networks can be regarded as statistical models, and can be analysed in a
Bayesian framework. Generalisation is measured by the performance on independent
test data drawn from the same distribution as the training data. Such performance
can be quantified by the posterior average of the information divergence between the
true and the model distributions. Averaging over the Bayesian posterior guarantees
internal coherence; Using information divergence guarantees invariance with respect
to... (Update)
Similar documents based on text: More All
0.7: Bayesian Invariant Measurements of Generalisation for Discrete.. - Zhu, Rohwer (1995)
(Correct)
0.6: Bayesian Invariant Measurements of Generalisation for.. - Zhu, Rohwer (1995)
(Correct)
0.6: Gaussian Regression and Optimal Finite Dimensional Linear Models - al. (1997)
(Correct)
Related documents from co-citation: More All
12: Bayesian invariant measurements of generalisation for continuous distributions
- Zhu, Rohwer - 1995
11: Differential-Geometrical Methods in Statistics (context) - Amari - 1985
7: Second order efficiency of minimum contrast estimators in a curved exponential f.. (context) - Eguchi - 1983
BibTeX entry: (Update)
H. Zhu and R. Rohwer. Information geometric measurements of generalisation. Technical Report NCRG/4350, Dept. Comp. Sci. & Appl. Math., Aston University, August 1995. ftp://cs.aston.ac.uk/neural/zhuh/generalisation.ps.Z. http://citeseer.ist.psu.edu/zhu95information.html More
@misc{ zhu95information,
author = "H. Zhu and R. Rohwer",
title = "Information geometric measurements of generalisation",
text = "H. Zhu and R. Rohwer. Information geometric measurements of generalisation.
Technical Report NCRG/4350, Dept. Comp. Sci. & Appl. Math., Aston University,
August 1995. ftp://cs.aston.ac.uk/neural/zhuh/generalisation.ps.Z.",
year = "1995",
url = "citeseer.ist.psu.edu/zhu95information.html" }
Citations (may not include all citations):
2528
Maximum likelihood from incomplete data via the em algorithm (context) - Dempster, Laird et al. - 1977
1527
Optimization by simulated annealing
- Kirkpatrick, Gelat et al. - 1983
1447
A mathematical theory of communication (context) - Shannon - 1948
643
Equations of state calculations by fast computing machines (context) - Metropolis, Rosenbluth et al. - 1953
509
Sobolev Spaces (context) - Adams - 1975
452
Statistical Decision Theory and Bayesian Analysis (context) - Berger - 1985
376
A learning algorithm for Boltzmann machines (context) - Ackley, Hinton et al. - 1985
373
Cambridge University Press (context) - Hardy, Littlewood et al. - 1952
301
Neural networks and the bias/variance dilemma (context) - Geman, Bienenstock et al. - 1992
277
Information Theory and Statistics
- Kullback - 1959
196
Bayesian Inference in Statistical Analysis (context) - Box, Tiao - 1973
117
Chapman and Hall (context) - Cox, Hinkley - 1974
101
Differential-Geometrical Methods in Statistics (context) - Amari - 1985
99
Learning in artificial neural networks: A statistical perspe.. (context) - White - 1989
88
Conditional independence in statistical theory (context) - Dawid - 1979
84
Neuron-like elements that can solve difficult learning contr.. (context) - Barto, Sutton et al. - 1983
83
Theory of Function Spaces (context) - Triebel - 1983
63
California Institute of Technology (context) - MacKay, for et al. - 1992
62
Information geometry of the EM and em algorithms for neural ..
- Amari - 1994
59
and Applications (context) - Abraham, Marsden et al. - 1983
50
Modeling Brain Function: The World of Attractor Neural netwo.. (context) - Amit - 1989
45
Advances in Neural Information Processing Systems (context) - Hanson, Cowan et al. - 1993
34
Theory of statistical estimation (context) - Fisher - 1925
28
frequency and reasonable expectations (context) - Cox - 1946
24
Bayesian learning via stochastic dynamics (context) - Neal
23
Sequential decision problems and neural networks (context) - Barto, Sutton et al. - 1990
22
Differential geometry of curved exponential families---curva.. (context) - Amari - 1982
18
Applied Functional Analysis (context) - Aubin - 1979
17
Bayesian invariant measurements of generalisation for discre..
- Zhu, Rohwer - 1995
17
Bayesian invariant measurements of generalisation for contin..
- Zhu, Rohwer - 1995
15
The geometry of asymptotic inference (context) - Kass - 1989
14
The EM algorithm and information geometry in neural network .. (context) - Amari - 1995
12
Second order efficiency of minimum contrast estimators in a .. (context) - Eguchi - 1983
12
Contributions to Mathematical Statistics (context) - Fisher - 1950
12
Application of the Radon-Nikodym theorem to the theory of su.. (context) - Halmos, Savage - 1949
11
Differential Geometry in Statistical Inference (context) - Amari, Barndoff-Nieldon et al. - 1987
11
Two new properties of mathematical likelihood (context) - Fisher - 1934
10
Use of the Gibbs samplers in expert systems (context) - York - 1992
9
Parameterization of non-linear models (context) - Hougaard - 1982
7
Statistical manifolds (context) - Lauritzen
7
Marginalization paradoxes in Bayesian and structural inferen.. (context) - Dawid, Stone et al. - 1973
7
Differential geometrical theory of statistics (context) - Amari
7
The role of differential geometry in statistical theory (context) - Barndorff-Nielsen, Cox et al. - 1986
6
Canonical parameterization and zero parameter effects curvat.. (context) - Kass - 1984
5
The interpretation of improper prior distributions as limits.. (context) - Akaike - 1980
5
Wiley Series in Probability and Mathematical Statistics (context) - Zacks, of - 1971
5
Bayes estimation for the linear model (context) - Lindley, Smith - 1972
4
Neural Networks and Adaptive Computers: Theory and Methods o.. (context) - Zhu - 1993
4
the use of evidence in neural neworks (context) - Wolpert
4
Mathematical Statistics : A Decision Theoretic Approach (context) - Fersuson - 1967
3
Optimal Statistical Decisiosns (context) - DeGroot - 1970
3
Measurements of generalisation based on information geometry (context) - Zhu, Rohwer - 1995
3
Uncertain inference (context) - Fisher - 1936
3
The Advanced Theory of Statistics: Inference and Relationshi.. (context) - Kendall, Stuart - 1979
2
Inverse probability (context) - Fisher - 1930
2
or integrate out (context) - MacKay, Optimise - 1993
1
an and D. Geman. Stochastic relaxation, Gibbs distribution, .. (context) - Gem - 1984
1
Can we get something out of nothing (context) - Zhu - 1995
The graph only includes citing articles where the year of publication is known.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC