See this document in CiteSeerX!

Bayesian Gaussian Process Models: PAC-Bayesian Generalisation Error Bounds and Sparse Approximations (2003)  (Make Corrections)  (5 citations)
Matthias Seeger



  Home/Search   Context   Related

 
View or download:
inf.ed.ac.uk/publication...IP030012.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  inf.ed.ac.uk/publications/t...phd (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Non-parametric models and techniques enjoy a growing popularity in the field of machine learning, and among these Bayesian inference for Gaussian process (GP) models has recently received significant attention. We feel that GP priors should be part of the standard toolbox for constructing models relevant to machine learning in the same way as parametric linear models are, and the results in this thesis help to remove some obstacles on the way towards this goal. (Update)

Cited by:   More
Sparse Gaussian Process Classification with Multiple Classes - Seeger, Jordan (2004)   (Correct)
Extensions of the Informative Vector Machine - Lawrence, Platt, Jordan   (Correct)
Sparse Gaussian Processes using Pseudo-inputs - Edward Snelson Zoubin (2006)   (Correct)

Active bibliography (related documents):   More   All
6.3:   Gaussian Processes for Machine Learning - Seeger (2004)   (Correct)
1.4:   Variational Inference for Dirichlet Process Mixtures - David M. Blei, Michael I.. (2006)   (Correct)
1.2:   PAC-Bayesian Theorems for Gaussian Process Classification - Seeger (2002)   (Correct)

Similar documents based on text:   More   All
0.3:   PAC-Bayesian Generalisation Error Bounds for Gaussian Process.. - Seeger (2002)   (Correct)
0.1:   Fast Forward Selection to Speed Up Sparse Gaussian.. - Seeger, Williams.. (2003)   (Correct)
0.1:   PAC-Bayesian Generalization Error Bounds for Gaussian Process.. - Seeger (2002)   (Correct)

Related documents from co-citation:   More   All
4:   Fast sparse Gaussian process methods: The informative vector machine - Lawrence, Seeger et al.
3:   Bayesian Gaussian Processes for Regression and Classification (context) - Gibbs - 1997
3:   Covariance kernels from Bayesian generative models - Seeger - 2000

BibTeX entry:   (Update)

M. Seeger. Bayesian Gaussian Process Models: PAC-Bayesian Generalisation Error Bounds and Sparse Approximations. PhD thesis, University of Edinburgh, July 2003. See www.cs.berkeley.edu/~mseeger. http://citeseer.ist.psu.edu/seeger03bayesian.html   More

@misc{ seeger03bayesian,
  author = "M. Seeger",
  title = "Bayesian Gaussian Process Models: PAC-Bayesian Generalisation Error Bounds
    and Sparse Approximations",
  text = "M. Seeger. Bayesian Gaussian Process Models: PAC-Bayesian Generalisation
    Error Bounds and Sparse Approximations. PhD thesis, University of Edinburgh,
    July 2003. See www.cs.berkeley.edu/~mseeger.",
  year = "2003",
  url = "citeseer.ist.psu.edu/seeger03bayesian.html" }
Citations (may not include all citations):
2528   Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
2319   Elements of Information Theory (context) - Cover, Thomas - 1991  ACM
1662   Neural Networks for Pattern Recognition (context) - Bishop - 1995  ACM   DBLP
947   Statistical Learning Theory (context) - Vapnik - 1998  ACM
640   Princeton University Press (context) - Rockafellar - 1970
525   Iterative Methods for Sparse Linear Systems (context) - Saad - 1996  ACM
524   Support vector networks - Cortes, Vapnik - 1995
520   Generalized linear models (context) - Nelder, Wedderburn - 1972
500   Experiments with a new boosting algorithm - Freund, Schapire - 1996  DBLP
493   Communications of the ACM (context) - Valiant, of et al. - 1984
472   Hierarchical mixtures of experts and the EM algorithm - Jordan, Jacobs - 1994  ACM
466   Probability and Measure (context) - Billingsley - 1995
461   Markov Chain Monte Carlo in Practice (context) - Gilks, Richardson et al. - 1996
454   the uniform convergence of relative frequencies of events to.. (context) - Vapnik, Chervonenkis - 1971
452   Statistical Decision Theory and Bayesian Analysis (context) - Berger - 1985
422   Networks for approximation and learning (context) - Poggio, Girosi - 1990
335   Statistics for Spatial Data (context) - Cressie - 1993
269   Bayesian Learning for Neural Networks (context) - Neal - 1996  ACM
248   An Introduction to Computational Learning Theory (context) - Kearns, Vazirani - 1994  ACM
245   An extended set of FORTRAN basic linear algebra subprograms - Dongarra, Croz et al. - 1988  ACM
236   Additive logistic regression: a statistical view of boosting - Friedman, Hastie et al. - 1998
228   Nonlinear dimensionality reduction by locally linear embeddi.. - Roweis, Saul - 2000
212   Methods of Numerical Integration (context) - Davis, Rabinovitz - 1984
206   Cambridge University Press (context) - Press, Teukolsky et al. - 1992
199   Probabilistic inference using Markov chain Monte Carlo metho.. - Neal - 1993
196   Bayesian Inference in Statistical Analysis (context) - Box, Tiao - 1992
191   Fast training of support vector machines using sequential mi.. (context) - Platt
184   A global geometric framework for nonlinear dimensionality re.. (context) - Tenenbaum, de Silva et al. - 2000
175   Stochastic Simulation (context) - Ripley - 1987  ACM
165   Generalized iterative scaling for log-linear models (context) - Darroch, Ratcli - 1972
161   Nonparametric Regression and Generalized Linear Models (context) - Green, Silverman - 1994
149   Theory of reproducing kernels (context) - Aronszajn - 1950
148   The perceptron: A probabilistic model for information storag.. (context) - Rosenblatt - 1958
147   A Course in Probability Theory (context) - Chung - 1974
146   Monte Carlo Statistical Methods - Robert, Casella - 1999
143   An Introduction to Support Vector Machines and Other Kernel .. (context) - Cristianini, Shawe-Taylor - 2000
132   Theory of Optimal Experiments (context) - Fedorov - 1972
129   Probability in Banach Spaces (context) - Ledoux, Talagrand - 1991
120   A variational Bayesian framework for graphical models - Attias
119   Exploiting generative models in discriminative classifiers - Jaakkola, Haussler  ACM   DBLP
113   Learning with Kernels (context) - Scholkopf, Smola - 2002  ACM
113   Tractable inference for complex stochastic processes - Boyen, Koller - 1998  DBLP
111   Active learning with statistical models - Cohn, Ghahramani et al. - 1996  ACM   DBLP
110   Learning in Graphical Models (context) - Jordan - 1997  ACM
110   Geometry of Random Fields (context) - Adler - 1981
105   Information-based objective functions for active data select.. - MacKay - 1991  ACM
104   A new view on the EM algorithm that justifies incremental an.. - Hinton, Neal
98   maximum likelihood and the EM algorithm (context) - Redner, Walker - 1984
97   Computational Learning Theory (context) - Anthony, Biggs - 1997  ACM
96   Backpropagation applied to handwritten zip code recognition (context) - LeCun, Boser et al. - 1989
88   Propagation algorithms for variational Bayesian learning - Ghahramani, Beal  DBLP
88   Neural Network Learning: Theoretical Foundations (context) - Anthony, Bartlett - 1999
85   Bounds on the sample complexity of Bayesian learning using i.. - Haussler, Kearns et al. - 1994  ACM   DBLP
85   Prediction with Gaussian processes: From linear regression t.. - Williams
83   Query by committee - Seung, Opper et al. - 1992  ACM   DBLP
82   Generalized belief propagation - Yedidia, Freeman et al.
78   Boosting the margin: A new explanation for the e#ectiveness .. (context) - Schapire, Freund et al. - 1998
78   Boosting the margin: A new explanation for the e#ectiveness .. (context) - Schapire, Freund et al. - 1998
77   Probabilistic outputs for support vector machines and compar.. - Platt
74   Loopy belief propagation for approximate inference: An empir.. - Murphy, Weiss et al. - 1999  DBLP
72   Turbo decoding as an instance of Pearl's belief propagation .. (context) - McEliece, MacKay et al. - 1998
71   The connection between regularization operators and support .. - Smola, Scholkopf et al. - 1998  ACM
67   Information-theoretic asymptotics of Bayes methods (context) - Clarke, Barron - 1990
66   Information Theory: Coding Theorems for Discrete Memoryless .. (context) - Csiszar, Korner - 1981
63   Weak Convergence and Empirical Processes (context) - van der Waart, Wellner - 1996
62   Selective sampling using the query by committee algorithm - Freund, Seung et al. - 1997  ACM   DBLP
61   On convergence properties of the EM algorithm for Gaussian m.. - Xu, Jordan - 1996  ACM
58   Dynamic Bayesian Networks: Representation (context) - Murphy - 2003
55   Probable networks and plausible predictions -- a review of p.. - MacKay - 1995
53   Evaluation of Gaussian Processes and Other Methods for Nonli.. - Rasmussen - 1996
51   Convolution kernels on discrete structures - Haussler - 1999
51   Kernel independent component analysis - Bach, Jordan - 2002  ACM   DBLP
50   Sparse Bayesian learning and the relevance vector machine - Tipping - 2001  ACM   DBLP
49   A correspondence between Bayesian estimation of stochastic p.. (context) - Kimeldorf, Wahba - 1970
48   Advances in Kernel Methods: Support Vector Learning (context) - Scholkopf, Burges et al. - 1998
48   Maximum entropy discrimination - Jaakkola, Meila et al.  ACM   DBLP
47   Sparse greedy matrix approximation for machine learning - Smola, Scholkopf  ACM   DBLP
47   Monte Carlo implementation of Gaussian process models for Ba.. - Neal - 1997
46   Some PAC-Bayesian theorems - McAllester - 1999  ACM   DBLP
46   Selecting concise training sets from clean data (context) - Plutowski, White - 1993
42   Deterministic annealing for clustering (context) - Rose - 1998
41   A Family of Algorithms for Approximate Bayesian Inference - Minka - 2001  ACM
41   Correlation Theory of Stationary and Related Random Function.. (context) - Yaglom - 1987
39   reproducing kernel Hilbert spaces and the randomized GACV (context) - Wahba, machines
39   Probabilistic kernel regression models - Jaakkola, Haussler - 1999
39   Estimation and Tracking: Principles (context) - Bar-Shalom, Li - 1993
38   Information geometry and alternating minimization procedures (context) - Csiszar, Tusnady - 1984
35   Correctness of belief propagation in Gaussian graphical mode.. - Weiss, Freeman  DBLP
35   Support vector machine active learning with applications to .. - Tong, Koller - 2001  ACM   DBLP
34   ciency of tests of a hypothesis based on the sum of observat.. (context) - Cherno, of - 1952
33   Foundations of the Theory of Probability (context) - Kolmogorov - 1933
32   A result of Vapnik with applications - Anthony, Shawe-Taylor - 1993  ACM   DBLP
32   Introduction to Gaussian processes (context) - MacKay - 1997
32   Incremental and decremental support vector machine learning - Cauwenberghs, Poggio  DBLP
31   BUGS: Bayesian inference using Gibbs sampling (context) - Spiegelhalter, Thomas et al. - 1995
26   Empirical margin distributions and bounding the generalizati.. - Koltchinskii, Panchenko - 2002
26   Variational Methods for Inference and Estimation in Graphica.. - Jaakkola - 1997  ACM
26   Bayesian Gaussian Processes for Regression and Classificatio.. (context) - Gibbs - 1997
25   Expectation propagation for approximate Bayesian inference (context) - Minka - 2001  ACM   DBLP
25   A sharp concentration inequality with applications - Boucheron, Lugosi et al. - 2000  DBLP
25   Classification using hierarchical mixtures of experts - Waterhouse, Robinson - 1994
24   Model selection and error estimation - Bartlett, Bocheron et al. - 2000  ACM   DBLP
23   Principles of geostatistics (context) - Matheron - 1963
22   Fast Monte-Carlo algorithms for finding low-rank approximati.. - Frieze, Kannan et al. - 1998  ACM   DBLP
22   Hybrid adaptive splines - Luo, Wahba - 1997
21   Interpolation of Spatial Data: Some Theory for Kriging (context) - Stein - 1999
20   A Bayesian committee machine - Tresp - 2000  DBLP
20   PAC-Bayesian model averaging - McAllester - 1999  ACM   DBLP
20   Sparse greedy Gaussian process regression - Smola, Bartlett  DBLP
19   Advances in Large Margin Classifiers - Smola, Bartlett et al. - 1999  ACM
19   The Numerical Treatment of Integral Equations (context) - Baker - 1977
18   Hyperbolic Householder algorithms for factoring structured m.. (context) - Cybenko, Berry - 1990  ACM
18   The intrinsic random functions and their applications (context) - Matheron - 1973
17   A new class of upper bounds on the log partition function - Wainwright, Jaakkola et al.  DBLP
17   Learning Kernel Classifiers (context) - Herbrich - 2001  ACM
16   Bayesian model selection for support vector machines - Seeger
16   Probabilistic methods for support vector machines - Sollich  DBLP
16   Bayesian numerical analysis (context) - Skilling - 1989
15   Cambridge University Press (context) - Horn, Johnson - 1985
15   Ensemble learning for multi-layer networks (context) - Barber, Bishop - 1998  ACM   DBLP
15   Structural risk minimization over data-dependent hierarchies - Shawe-Taylor, Bartlett et al. - 1926
15   Gaussian processes for classification: Mean field algorithms - Opper, Winther - 2000
14   The infinite Gaussian mixture model - Rasmussen  DBLP
13   Tree-based modeling and estimation of Gaussian processes on .. - Wainwright, Sudderth et al.  DBLP
13   Expectation propagation for approximate inference in dynamic.. - Heskes, Zoeter
13   Modified Cholesky factorizations in interior-point algorithm.. - Wright - 1999
13   An introduction to probabilistic graphical models (context) - Jordan - 2003
12   cient SVM training using low-rank kernel representations (context) - Fine, Scheinberg - 2001
12   Applications of Mathematics: Stochastic Modelling and Applie.. (context) - Devroye, Gyorfi et al. - 1996
12   A Bennett concentration inequality and its application to su.. - Bousquet - 2002
12   Information Theory for Continuous Systems (context) - Ihara - 1993
12   Stability and generalization (context) - Bousquet, Elissee - 2002  ACM   DBLP
12   Convex Analysis and Minimization Algorithms II (context) - Hiriart-Urruty, Lemarechal - 1993
12   Bayesian non-linear modeling for the energy prediction compe.. (context) - MacKay - 1994
11   Infinite mixtures of Gaussian process experts - Rasmussen, Ghahramani  DBLP
11   Relating data compression and learnability - Littlestone, Warmuth - 1986
11   Advances in Neural Information Processing Systems (context) - Becker, Thrun et al. - 2003
10   Sparse online Gaussian processes (context) - Csato, Opper - 2002
9   PAC-Bayesian generalization error bounds for Gaussian proces.. - Seeger - 2002
9   Advances in Neural Information Processing Systems - Kearns, Solla et al. - 1999
9   Construction of fully symmetric numerical integration formul.. (context) - McNamee, Stenger - 1967
9   Covariance kernels from Bayesian generative models - Seeger  DBLP
9   Advances in Neural Information Processing Systems - Solla, Leen et al. - 2000
9   PAC-Bayesian generalization error bounds for Gaussian proces.. - Seeger - 2002
8   Gaussian regression and optimal finite dimensional linear mo.. - Zhu, Williams et al. - 1998
8   Gaussian process classification and SVM: Mean field results .. - Opper, Winther
8   Bayesian methods for support vector machines and Gaussian pr.. (context) - Seeger - 1999
8   Advances in Neural Information Processing Systems (context) - Leen, Dietterich et al. - 2001
8   SIAM Society for Industrial and Applied Mathematics (context) - Wahba, for et al. - 1990
8   Practical Methods of Optimization: Unconstrained Optimizatio.. (context) - Fletcher - 1980
8   usion kernels on graphs and other discrete input spaces (context) - Kondor, La et al. - 2002
8   Advances in Neural Information Processing Systems (context) - Dietterich, Becker et al. - 2002
7   Learning a Gaussian process prior for automatically generati.. - Platt, Burges et al.
7   Cross-validated spline methods for the estimation of three d.. (context) - Nychka, Wahba et al. - 1984
7   PAC-Bayesian stochastic model selection - McAllester - 2002  ACM   DBLP
7   Probability and Random Processes (context) - rey, David - 2001
6   A review of Gaussian random fields and correlation functions (context) - Abrahamsen - 1997
6   Discovering hidden features with Gaussian process regression - Vivarelli, Williams
6   A PAC-Bayesian margin bound for linear classifiers: Why SVMs.. - Herbrich, Graepel  DBLP
6   Rational kernels - Cortes, Ha et al.
6   Uncertainty in Artificial Intelligence (context) - Darwiche, Friedman - 2002  ACM
5   Concentration inequalities for the missing mass and for hist.. - McAllester, Ortiz  ACM   DBLP
5   Structured variational distributions in VIBES - Bishop, Winn
5   Fast forward selection to speed up sparse Gaussian process r.. - Seeger, Williams et al.
5   Regularization with dot-product kernels - Smola, Ovari et al.  DBLP
5   A sparse Bayesian compression scheme - the informative vecto.. - Lawrence, Herbrich - 2001
5   Learning curves for Gaussian processes (context) - Sollich  ACM   DBLP
4   Practical Methods of Optimization: Constrained Optimization (context) - Fletcher - 1989
4   Spline functions and the problem of graduation (context) - Schonberg - 1964
4   Learning how to learn is learning with point sets - Minka, Picard - 1997
4   The EP energy function and minimization schemes (context) - Minka - 2001
4   metric entropy and cumulative relative entropy risk (context) - Haussler, Opper - 1997
4   A statistical approach to some basic mine valuation problems.. (context) - Krige - 1951
4   Bernardo and Adrian F (context) - Jose - 1994
4   Some Bayesian numerical analysis (context) - O'Hagan - 1992
4   Using the Nystrom method to speed up kernel machines (context) - Williams, Seeger
4   Bayesian classification with Gaussian processes (context) - Williams, Barber - 1998
4   Analysis of sparse Bayesian learning - Faul, Tipping  DBLP
4   Gaussian process priors with uncertain inputs --- applicatio.. - Girard, Rasmussen et al.
4   Computation with infinite neural networks - Williams - 1998  ACM   DBLP
4   On properly positive hermitian matrices (context) - Moore - 1916
3   Gaussian Processes --- Iterative Sparse Approximations (context) - Csato - 2002
3   Number 37 in Monographs on Statistics and Applied Probabilit.. (context) - McCullach, Nelder et al. - 1983
3   Derivative observations in Gaussian process models of dynami.. - Solak, Murray-Smith et al.
3   Available online at www (context) - Boyd, Vandenberghe - 2002
3   The stability of kernel principal components analysis and it.. (context) - Shawe-Taylor, Williams
3   Gaussian process networks - Friedman, Nachman - 2000  ACM   DBLP
3   Sur un nouveau theoreme limite de la theorie des probabilite.. (context) - Cramer - 1938
3   A fast method for calculating the perceptron with maximal st.. (context) - Rujan - 1993
2   Markov chain Monte Carlo and related topics - Liu - 1999
2   A PAC analysis of a Bayesian estimator (context) - Shawe-Taylor, Williamson - 1997  ACM   DBLP
2   PAC-Bayes and margin (context) - Langford, Shawe-Taylor
2   bounding the true error (context) - Langford, Caruana
2   Data-dependent bounds for Bayesian mixture methods - Meir, Zhang
2   belief propagation and sparsity (context) - Csato, Opper et al.
2   Sparse Bayesian learning: The informative vector machine (context) - Seeger, Lawrence et al. - 2002
1   Presented at Summer School Machine Learning (context) - Lugosi, Technical - 2003
1   ciency of the iterative proportional fitting procedure (context) - Teh, Welling et al.
1   Pattern classification and learning theory (context) - Lugosi
1   Number 305 in Grundlehren der mathematischen Wissenschaften (context) - Hiriart-Urruty, Lemarechal et al. - 1993

[Article contains additional citations not shown here]

Documents on the same site (http://www.inf.ed.ac.uk/publications/thesis/phd.html):   More
An Evolutionary Algorithm Approach to Poetry Generation - Manurung (2003)   (Correct)
Improving Architectural 3D Reconstruction by Constrained Modelling - Cantzler (2003)   (Correct)
From Distributional to Semantic Similarity - Curran (2003)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC