(Enter summary)
Abstract: Non-parametric models and techniques enjoy a growing popularity in the field of
machine learning, and among these Bayesian inference for Gaussian process (GP)
models has recently received significant attention. We feel that GP priors should
be part of the standard toolbox for constructing models relevant to machine
learning in the same way as parametric linear models are, and the results in this
thesis help to remove some obstacles on the way towards this goal. (Update)
Cited by: More
Sparse Gaussian Process Classification with Multiple Classes - Seeger, Jordan (2004)
(Correct)
Extensions of the Informative Vector Machine - Lawrence, Platt, Jordan
(Correct)
Sparse Gaussian Processes using Pseudo-inputs - Edward Snelson Zoubin (2006)
(Correct)
Active bibliography (related documents): More All
6.3: Gaussian Processes for Machine Learning - Seeger (2004)
(Correct)
1.4: Variational Inference for Dirichlet Process Mixtures - David M. Blei, Michael I.. (2006)
(Correct)
1.2: PAC-Bayesian Theorems for Gaussian Process Classification - Seeger (2002)
(Correct)
Similar documents based on text: More All
0.3: PAC-Bayesian Generalisation Error Bounds for Gaussian Process.. - Seeger (2002)
(Correct)
0.1: Fast Forward Selection to Speed Up Sparse Gaussian.. - Seeger, Williams.. (2003)
(Correct)
0.1: PAC-Bayesian Generalization Error Bounds for Gaussian Process.. - Seeger (2002)
(Correct)
Related documents from co-citation: More All
4: Fast sparse Gaussian process methods: The informative vector machine
- Lawrence, Seeger et al.
3: Bayesian Gaussian Processes for Regression and Classification (context) - Gibbs - 1997
3: Covariance kernels from Bayesian generative models
- Seeger - 2000
BibTeX entry: (Update)
M. Seeger. Bayesian Gaussian Process Models: PAC-Bayesian Generalisation Error Bounds and Sparse Approximations. PhD thesis, University of Edinburgh, July 2003. See www.cs.berkeley.edu/~mseeger. http://citeseer.ist.psu.edu/seeger03bayesian.html More
@misc{ seeger03bayesian,
author = "M. Seeger",
title = "Bayesian Gaussian Process Models: PAC-Bayesian Generalisation Error Bounds
and Sparse Approximations",
text = "M. Seeger. Bayesian Gaussian Process Models: PAC-Bayesian Generalisation
Error Bounds and Sparse Approximations. PhD thesis, University of Edinburgh,
July 2003. See www.cs.berkeley.edu/~mseeger.",
year = "2003",
url = "citeseer.ist.psu.edu/seeger03bayesian.html" }
Citations (may not include all citations):
2528
Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
2319
Elements of Information Theory (context) - Cover, Thomas - 1991 ACM
1662
Neural Networks for Pattern Recognition (context) - Bishop - 1995 ACM DBLP
947
Statistical Learning Theory (context) - Vapnik - 1998 ACM
640
Princeton University Press (context) - Rockafellar - 1970
525
Iterative Methods for Sparse Linear Systems (context) - Saad - 1996 ACM
524
Support vector networks
- Cortes, Vapnik - 1995
520
Generalized linear models (context) - Nelder, Wedderburn - 1972
500
Experiments with a new boosting algorithm
- Freund, Schapire - 1996 DBLP
493
Communications of the ACM (context) - Valiant, of et al. - 1984
472
Hierarchical mixtures of experts and the EM algorithm
- Jordan, Jacobs - 1994 ACM
466
Probability and Measure (context) - Billingsley - 1995
461
Markov Chain Monte Carlo in Practice (context) - Gilks, Richardson et al. - 1996
454
the uniform convergence of relative frequencies of events to.. (context) - Vapnik, Chervonenkis - 1971
452
Statistical Decision Theory and Bayesian Analysis (context) - Berger - 1985
422
Networks for approximation and learning (context) - Poggio, Girosi - 1990
335
Statistics for Spatial Data (context) - Cressie - 1993
269
Bayesian Learning for Neural Networks (context) - Neal - 1996 ACM
248
An Introduction to Computational Learning Theory (context) - Kearns, Vazirani - 1994 ACM
245
An extended set of FORTRAN basic linear algebra subprograms
- Dongarra, Croz et al. - 1988 ACM
236
Additive logistic regression: a statistical view of boosting
- Friedman, Hastie et al. - 1998
228
Nonlinear dimensionality reduction by locally linear embeddi..
- Roweis, Saul - 2000
212
Methods of Numerical Integration (context) - Davis, Rabinovitz - 1984
206
Cambridge University Press (context) - Press, Teukolsky et al. - 1992
199
Probabilistic inference using Markov chain Monte Carlo metho..
- Neal - 1993
196
Bayesian Inference in Statistical Analysis (context) - Box, Tiao - 1992
191
Fast training of support vector machines using sequential mi.. (context) - Platt
184
A global geometric framework for nonlinear dimensionality re.. (context) - Tenenbaum, de Silva et al. - 2000
175
Stochastic Simulation (context) - Ripley - 1987 ACM
165
Generalized iterative scaling for log-linear models (context) - Darroch, Ratcli - 1972
161
Nonparametric Regression and Generalized Linear Models (context) - Green, Silverman - 1994
149
Theory of reproducing kernels (context) - Aronszajn - 1950
148
The perceptron: A probabilistic model for information storag.. (context) - Rosenblatt - 1958
147
A Course in Probability Theory (context) - Chung - 1974
146
Monte Carlo Statistical Methods
- Robert, Casella - 1999
143
An Introduction to Support Vector Machines and Other Kernel .. (context) - Cristianini, Shawe-Taylor - 2000
132
Theory of Optimal Experiments (context) - Fedorov - 1972
129
Probability in Banach Spaces (context) - Ledoux, Talagrand - 1991
120
A variational Bayesian framework for graphical models
- Attias
119
Exploiting generative models in discriminative classifiers
- Jaakkola, Haussler ACM DBLP
113
Learning with Kernels (context) - Scholkopf, Smola - 2002 ACM
113
Tractable inference for complex stochastic processes
- Boyen, Koller - 1998 DBLP
111
Active learning with statistical models
- Cohn, Ghahramani et al. - 1996 ACM DBLP
110
Learning in Graphical Models (context) - Jordan - 1997 ACM
110
Geometry of Random Fields (context) - Adler - 1981
105
Information-based objective functions for active data select..
- MacKay - 1991 ACM
104
A new view on the EM algorithm that justifies incremental an..
- Hinton, Neal
98
maximum likelihood and the EM algorithm (context) - Redner, Walker - 1984
97
Computational Learning Theory (context) - Anthony, Biggs - 1997 ACM
96
Backpropagation applied to handwritten zip code recognition (context) - LeCun, Boser et al. - 1989
88
Propagation algorithms for variational Bayesian learning
- Ghahramani, Beal DBLP
88
Neural Network Learning: Theoretical Foundations (context) - Anthony, Bartlett - 1999
85
Bounds on the sample complexity of Bayesian learning using i..
- Haussler, Kearns et al. - 1994 ACM DBLP
85
Prediction with Gaussian processes: From linear regression t..
- Williams
83
Query by committee
- Seung, Opper et al. - 1992 ACM DBLP
82
Generalized belief propagation
- Yedidia, Freeman et al.
78
Boosting the margin: A new explanation for the e#ectiveness .. (context) - Schapire, Freund et al. - 1998
78
Boosting the margin: A new explanation for the e#ectiveness .. (context) - Schapire, Freund et al. - 1998
77
Probabilistic outputs for support vector machines and compar..
- Platt
74
Loopy belief propagation for approximate inference: An empir..
- Murphy, Weiss et al. - 1999 DBLP
72
Turbo decoding as an instance of Pearl's belief propagation .. (context) - McEliece, MacKay et al. - 1998
71
The connection between regularization operators and support ..
- Smola, Scholkopf et al. - 1998 ACM
67
Information-theoretic asymptotics of Bayes methods (context) - Clarke, Barron - 1990
66
Information Theory: Coding Theorems for Discrete Memoryless .. (context) - Csiszar, Korner - 1981
63
Weak Convergence and Empirical Processes (context) - van der Waart, Wellner - 1996
62
Selective sampling using the query by committee algorithm
- Freund, Seung et al. - 1997 ACM DBLP
61
On convergence properties of the EM algorithm for Gaussian m..
- Xu, Jordan - 1996 ACM
58
Dynamic Bayesian Networks: Representation (context) - Murphy - 2003
55
Probable networks and plausible predictions -- a review of p..
- MacKay - 1995
53
Evaluation of Gaussian Processes and Other Methods for Nonli..
- Rasmussen - 1996
51
Convolution kernels on discrete structures
- Haussler - 1999
51
Kernel independent component analysis
- Bach, Jordan - 2002 ACM DBLP
50
Sparse Bayesian learning and the relevance vector machine
- Tipping - 2001 ACM DBLP
49
A correspondence between Bayesian estimation of stochastic p.. (context) - Kimeldorf, Wahba - 1970
48
Advances in Kernel Methods: Support Vector Learning (context) - Scholkopf, Burges et al. - 1998
48
Maximum entropy discrimination
- Jaakkola, Meila et al. ACM DBLP
47
Sparse greedy matrix approximation for machine learning
- Smola, Scholkopf ACM DBLP
47
Monte Carlo implementation of Gaussian process models for Ba..
- Neal - 1997
46
Some PAC-Bayesian theorems
- McAllester - 1999 ACM DBLP
46
Selecting concise training sets from clean data (context) - Plutowski, White - 1993
42
Deterministic annealing for clustering (context) - Rose - 1998
41
A Family of Algorithms for Approximate Bayesian Inference
- Minka - 2001 ACM
41
Correlation Theory of Stationary and Related Random Function.. (context) - Yaglom - 1987
39
reproducing kernel Hilbert spaces and the randomized GACV (context) - Wahba, machines
39
Probabilistic kernel regression models
- Jaakkola, Haussler - 1999
39
Estimation and Tracking: Principles (context) - Bar-Shalom, Li - 1993
38
Information geometry and alternating minimization procedures (context) - Csiszar, Tusnady - 1984
35
Correctness of belief propagation in Gaussian graphical mode..
- Weiss, Freeman DBLP
35
Support vector machine active learning with applications to ..
- Tong, Koller - 2001 ACM DBLP
34
ciency of tests of a hypothesis based on the sum of observat.. (context) - Cherno, of - 1952
33
Foundations of the Theory of Probability (context) - Kolmogorov - 1933
32
A result of Vapnik with applications
- Anthony, Shawe-Taylor - 1993 ACM DBLP
32
Introduction to Gaussian processes (context) - MacKay - 1997
32
Incremental and decremental support vector machine learning
- Cauwenberghs, Poggio DBLP
31
BUGS: Bayesian inference using Gibbs sampling (context) - Spiegelhalter, Thomas et al. - 1995
26
Empirical margin distributions and bounding the generalizati..
- Koltchinskii, Panchenko - 2002
26
Variational Methods for Inference and Estimation in Graphica..
- Jaakkola - 1997 ACM
26
Bayesian Gaussian Processes for Regression and Classificatio.. (context) - Gibbs - 1997
25
Expectation propagation for approximate Bayesian inference (context) - Minka - 2001 ACM DBLP
25
A sharp concentration inequality with applications
- Boucheron, Lugosi et al. - 2000 DBLP
25
Classification using hierarchical mixtures of experts
- Waterhouse, Robinson - 1994
24
Model selection and error estimation
- Bartlett, Bocheron et al. - 2000 ACM DBLP
23
Principles of geostatistics (context) - Matheron - 1963
22
Fast Monte-Carlo algorithms for finding low-rank approximati..
- Frieze, Kannan et al. - 1998 ACM DBLP
22
Hybrid adaptive splines
- Luo, Wahba - 1997
21
Interpolation of Spatial Data: Some Theory for Kriging (context) - Stein - 1999
20
A Bayesian committee machine
- Tresp - 2000 DBLP
20
PAC-Bayesian model averaging
- McAllester - 1999 ACM DBLP
20
Sparse greedy Gaussian process regression
- Smola, Bartlett DBLP
19
Advances in Large Margin Classifiers
- Smola, Bartlett et al. - 1999 ACM
19
The Numerical Treatment of Integral Equations (context) - Baker - 1977
18
Hyperbolic Householder algorithms for factoring structured m.. (context) - Cybenko, Berry - 1990 ACM
18
The intrinsic random functions and their applications (context) - Matheron - 1973
17
A new class of upper bounds on the log partition function
- Wainwright, Jaakkola et al. DBLP
17
Learning Kernel Classifiers (context) - Herbrich - 2001 ACM
16
Bayesian model selection for support vector machines
- Seeger
16
Probabilistic methods for support vector machines
- Sollich DBLP
16
Bayesian numerical analysis (context) - Skilling - 1989
15
Cambridge University Press (context) - Horn, Johnson - 1985
15
Ensemble learning for multi-layer networks (context) - Barber, Bishop - 1998 ACM DBLP
15
Structural risk minimization over data-dependent hierarchies
- Shawe-Taylor, Bartlett et al. - 1926
15
Gaussian processes for classification: Mean field algorithms
- Opper, Winther - 2000
14
The infinite Gaussian mixture model
- Rasmussen DBLP
13
Tree-based modeling and estimation of Gaussian processes on ..
- Wainwright, Sudderth et al. DBLP
13
Expectation propagation for approximate inference in dynamic..
- Heskes, Zoeter
13
Modified Cholesky factorizations in interior-point algorithm..
- Wright - 1999
13
An introduction to probabilistic graphical models (context) - Jordan - 2003
12
cient SVM training using low-rank kernel representations (context) - Fine, Scheinberg - 2001
12
Applications of Mathematics: Stochastic Modelling and Applie.. (context) - Devroye, Gyorfi et al. - 1996
12
A Bennett concentration inequality and its application to su..
- Bousquet - 2002
12
Information Theory for Continuous Systems (context) - Ihara - 1993
12
Stability and generalization (context) - Bousquet, Elissee - 2002 ACM DBLP
12
Convex Analysis and Minimization Algorithms II (context) - Hiriart-Urruty, Lemarechal - 1993
12
Bayesian non-linear modeling for the energy prediction compe.. (context) - MacKay - 1994
11
Infinite mixtures of Gaussian process experts
- Rasmussen, Ghahramani DBLP
11
Relating data compression and learnability
- Littlestone, Warmuth - 1986
11
Advances in Neural Information Processing Systems (context) - Becker, Thrun et al. - 2003
10
Sparse online Gaussian processes (context) - Csato, Opper - 2002
9
PAC-Bayesian generalization error bounds for Gaussian proces..
- Seeger - 2002
9
Advances in Neural Information Processing Systems
- Kearns, Solla et al. - 1999
9
Construction of fully symmetric numerical integration formul.. (context) - McNamee, Stenger - 1967
9
Covariance kernels from Bayesian generative models
- Seeger DBLP
9
Advances in Neural Information Processing Systems
- Solla, Leen et al. - 2000
9
PAC-Bayesian generalization error bounds for Gaussian proces..
- Seeger - 2002
8
Gaussian regression and optimal finite dimensional linear mo..
- Zhu, Williams et al. - 1998
8
Gaussian process classification and SVM: Mean field results ..
- Opper, Winther
8
Bayesian methods for support vector machines and Gaussian pr.. (context) - Seeger - 1999
8
Advances in Neural Information Processing Systems (context) - Leen, Dietterich et al. - 2001
8
SIAM Society for Industrial and Applied Mathematics (context) - Wahba, for et al. - 1990
8
Practical Methods of Optimization: Unconstrained Optimizatio.. (context) - Fletcher - 1980
8
usion kernels on graphs and other discrete input spaces (context) - Kondor, La et al. - 2002
8
Advances in Neural Information Processing Systems (context) - Dietterich, Becker et al. - 2002
7
Learning a Gaussian process prior for automatically generati..
- Platt, Burges et al.
7
Cross-validated spline methods for the estimation of three d.. (context) - Nychka, Wahba et al. - 1984
7
PAC-Bayesian stochastic model selection
- McAllester - 2002 ACM DBLP
7
Probability and Random Processes (context) - rey, David - 2001
6
A review of Gaussian random fields and correlation functions (context) - Abrahamsen - 1997
6
Discovering hidden features with Gaussian process regression
- Vivarelli, Williams
6
A PAC-Bayesian margin bound for linear classifiers: Why SVMs..
- Herbrich, Graepel DBLP
6
Rational kernels
- Cortes, Ha et al.
6
Uncertainty in Artificial Intelligence (context) - Darwiche, Friedman - 2002 ACM
5
Concentration inequalities for the missing mass and for hist..
- McAllester, Ortiz ACM DBLP
5
Structured variational distributions in VIBES
- Bishop, Winn
5
Fast forward selection to speed up sparse Gaussian process r..
- Seeger, Williams et al.
5
Regularization with dot-product kernels
- Smola, Ovari et al. DBLP
5
A sparse Bayesian compression scheme - the informative vecto..
- Lawrence, Herbrich - 2001
5
Learning curves for Gaussian processes (context) - Sollich ACM DBLP
4
Practical Methods of Optimization: Constrained Optimization (context) - Fletcher - 1989
4
Spline functions and the problem of graduation (context) - Schonberg - 1964
4
Learning how to learn is learning with point sets
- Minka, Picard - 1997
4
The EP energy function and minimization schemes (context) - Minka - 2001
4
metric entropy and cumulative relative entropy risk (context) - Haussler, Opper - 1997
4
A statistical approach to some basic mine valuation problems.. (context) - Krige - 1951
4
Bernardo and Adrian F (context) - Jose - 1994
4
Some Bayesian numerical analysis (context) - O'Hagan - 1992
4
Using the Nystrom method to speed up kernel machines (context) - Williams, Seeger
4
Bayesian classification with Gaussian processes (context) - Williams, Barber - 1998
4
Analysis of sparse Bayesian learning
- Faul, Tipping DBLP
4
Gaussian process priors with uncertain inputs --- applicatio..
- Girard, Rasmussen et al.
4
Computation with infinite neural networks
- Williams - 1998 ACM DBLP
4
On properly positive hermitian matrices (context) - Moore - 1916
3
Gaussian Processes --- Iterative Sparse Approximations (context) - Csato - 2002
3
Number 37 in Monographs on Statistics and Applied Probabilit.. (context) - McCullach, Nelder et al. - 1983
3
Derivative observations in Gaussian process models of dynami..
- Solak, Murray-Smith et al.
3
Available online at www (context) - Boyd, Vandenberghe - 2002
3
The stability of kernel principal components analysis and it.. (context) - Shawe-Taylor, Williams
3
Gaussian process networks
- Friedman, Nachman - 2000 ACM DBLP
3
Sur un nouveau theoreme limite de la theorie des probabilite.. (context) - Cramer - 1938
3
A fast method for calculating the perceptron with maximal st.. (context) - Rujan - 1993
2
Markov chain Monte Carlo and related topics
- Liu - 1999
2
A PAC analysis of a Bayesian estimator (context) - Shawe-Taylor, Williamson - 1997 ACM DBLP
2
PAC-Bayes and margin (context) - Langford, Shawe-Taylor
2
bounding the true error (context) - Langford, Caruana
2
Data-dependent bounds for Bayesian mixture methods
- Meir, Zhang
2
belief propagation and sparsity (context) - Csato, Opper et al.
2
Sparse Bayesian learning: The informative vector machine (context) - Seeger, Lawrence et al. - 2002
1
Presented at Summer School Machine Learning (context) - Lugosi, Technical - 2003
1
ciency of the iterative proportional fitting procedure (context) - Teh, Welling et al.
1
Pattern classification and learning theory (context) - Lugosi
1
Number 305 in Grundlehren der mathematischen Wissenschaften (context) - Hiriart-Urruty, Lemarechal et al. - 1993
[Article contains additional citations not shown here]
Documents on the same site (http://www.inf.ed.ac.uk/publications/thesis/phd.html): More
An Evolutionary Algorithm Approach to Poetry Generation - Manurung (2003)
(Correct)
Improving Architectural 3D Reconstruction by Constrained Modelling - Cantzler (2003)
(Correct)
From Distributional to Semantic Similarity - Curran (2003)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC