21,621 research outputs found
Modelling function-valued processes with complex structure
PhD ThesisExisting approaches to functional principal component analysis (FPCA) usually rely
on nonparametric estimation of the covariance structure. When function-valued processes
are observed on a multidimensional domain, the nonparametric estimation suffers from
the curse of dimensionality, forcing FPCA methods to make restrictive assumptions such
as covariance separability.
In this thesis, we discuss a general Bayesian framework on modelling function-valued
processes by using a Gaussian process (GP) as a prior, enabling us to handle nonseparable and/or nonstationary covariance structure. The nonstationarity is introduced by a
convolution-based approach through a varying kernel, whose parameters vary along the
input space and are estimated via a local empirical Bayesian method. For the varying
anisotropy matrix, we propose to use a spherical parametrisation, leading to unconstrained
and interpretable parameters and allowing for interaction between coordinate directions in
the covariance function. The unconstrained nature allows the parameters to be modelled
as a nonparametric function of time, spatial location and even additional covariates.
In the spirit of FPCA, the Bayesian framework can decompose the function-valued
processes using the eigenvalues and eigensurfaces calculated from the estimated covariance
structure. A finite number of the eigensurfaces can be used to extract some of the most
important information involved in data with complex covariance structure.
We also extend the methods to handle multivariate function-valued processes. The
estimated covariance structure is shown to be important to analyse joint variation in
the data and is further used in our proposed multiple functional partial least squares
regression model. We show that the interaction between the scalar response variable and
function-valued covariates can be explained by fewer terms than in a regression model
which uses multivariate functional principal components.
Simulation studies and applications to real data show that our proposed approaches
provide new insights into the data and excellent prediction results
Efficient Bayesian hierarchical functional data analysis with basis function approximations using Gaussian-Wishart processes
Functional data are defined as realizations of random functions (mostly
smooth functions) varying over a continuum, which are usually collected with
measurement errors on discretized grids. In order to accurately smooth noisy
functional observations and deal with the issue of high-dimensional observation
grids, we propose a novel Bayesian method based on the Bayesian hierarchical
model with a Gaussian-Wishart process prior and basis function representations.
We first derive an induced model for the basis-function coefficients of the
functional data, and then use this model to conduct posterior inference through
Markov chain Monte Carlo. Compared to the standard Bayesian inference that
suffers serious computational burden and unstableness for analyzing
high-dimensional functional data, our method greatly improves the computational
scalability and stability, while inheriting the advantage of simultaneously
smoothing raw observations and estimating the mean-covariance functions in a
nonparametric way. In addition, our method can naturally handle functional data
observed on random or uncommon grids. Simulation and real studies demonstrate
that our method produces similar results as the standard Bayesian inference
with low-dimensional common grids, while efficiently smoothing and estimating
functional data with random and high-dimensional observation grids where the
standard Bayesian inference fails. In conclusion, our method can efficiently
smooth and estimate high-dimensional functional data, providing one way to
resolve the curse of dimensionality for Bayesian functional data analysis with
Gaussian-Wishart processes.Comment: Under revie
Functional Regression
Functional data analysis (FDA) involves the analysis of data whose ideal
units of observation are functions defined on some continuous domain, and the
observed data consist of a sample of functions taken from some population,
sampled on a discrete grid. Ramsay and Silverman's 1997 textbook sparked the
development of this field, which has accelerated in the past 10 years to become
one of the fastest growing areas of statistics, fueled by the growing number of
applications yielding this type of data. One unique characteristic of FDA is
the need to combine information both across and within functions, which Ramsay
and Silverman called replication and regularization, respectively. This article
will focus on functional regression, the area of FDA that has received the most
attention in applications and methodological development. First will be an
introduction to basis functions, key building blocks for regularization in
functional regression methods, followed by an overview of functional regression
methods, split into three types: [1] functional predictor regression
(scalar-on-function), [2] functional response regression (function-on-scalar)
and [3] function-on-function regression. For each, the role of replication and
regularization will be discussed and the methodological development described
in a roughly chronological manner, at times deviating from the historical
timeline to group together similar methods. The primary focus is on modeling
and methodology, highlighting the modeling structures that have been developed
and the various regularization approaches employed. At the end is a brief
discussion describing potential areas of future development in this field
- …