The problem of regression under Gaussian assumptions is treated generally. The relationship between Bayesian prediction, regularization and smoothing is elucidated. The ideal regression is the posterior mean and its computation scales as O(n3), where n is the sample size. We show that the optimal m-dimensional linear model under a given prior is spanned by the first m eigenfunctions of a covariance operator, which is a trace-class operator. This is an infinite dimensional analogue of principal component analysis. The importance of Hilbert space methods to practical statistics is also discussed

Christopher K. I. Williams

Huaiyu Zhu

Huaiyu Zhu Santa

Michal Morciniec

Richard Rohwer

English

The problem of regression under Gaussian assumptions is treated generally. The relationship between Bayesian prediction, regularization and smoothing is elucidated. The ideal regression is the posterior mean and its computation scales as O(n  3  ), where n is the sample size. We show that the optimal m-dimensional linear model under a given prior is spanned by the first m eigenfunctions of a covariance operator, which is a trace-class operator. This is an infinite dimensional analogue of principal component analysis. The importance of Hilbert space methods to practical statistics is also discussed. Keywords: regression, Gaussian measures, linear model, principal component, spline, regularization, eigenfunctions.  Gaussian Regression and Optimal Finite Dimensional Linear Models 3 1 Introduction Many problems in computation and statistics can be generally described as fitting a &quot;curve&quot; from a discrete set of data. Here we allow a liberal interpretation of curve which could be any mappin..

CiteSeerX

Gaussian Regression and Optimal Finite Dimensional Linear Models

Zhu, Huaiyu

Williams, Christopher K. I.

Rohwer, Richard

Morciniec, Michal

Aston Publications Explorer

Gaussian regression and optimal finite dimensional linear models

. The problem of regression under Gaussian assumptions is treated generally. The relationship between Bayesian prediction, regularization and smoothing is elucidated. The ideal regression is the posterior mean and its computation scales as O(n  3  ), where n is the sample size. We show that the optimal m-dimensional  linear model under a given prior is spanned by the first m eigenfunctions of a covariance operator, which is a trace-class operator. This is an infinite dimensional analogue of principal component analysis. The importance of Hilbert space methods to practical statistics is also discussed.  1 Introduction  Many problems in computation and statistics can be generally described as fitting a &quot;curve&quot; from a discrete set of data. Here we allow a liberal interpretation of curve which could be any mapping from a finite dimensional space to a finite dimensional space. Such problems are usually studied under the name &quot;regression&quot; in statistics or &quot;approximation&quot; in numerical analysi..

Michal

The problem of regression under Gaussian assumptions is treated generally. The relationship between Bayesian prediction, regularization and smoothing is elucidated. The ideal regression is the posterior mean and its computation scales as O(n3), where nis the sample size. We show that the optimal m-dimensional linear model under a given prior is spanned by the first meigenfunctions of a covariance operator, which is a trace-class operator. This is an infinite dimensional analogue of principal component analysis. The importance of Hilbert space methods to practical statistics is also discussed

Hammel, Michal

Edinburgh Research Explorer

http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.52.7108

Gaussian regression and optimal finite dimensional linear models

Abstract

Similar works

Full text

Available Versions

CiteSeerX

Aston Publications Explorer

CiteSeerX

Edinburgh Research Explorer