695 research outputs found

    Geometric Data Analysis: Advancements of the Statistical Methodology and Applications

    Get PDF
    Data analysis has become fundamental to our society and comes in multiple facets and approaches. Nevertheless, in research and applications, the focus was primarily on data from Euclidean vector spaces. Consequently, the majority of methods that are applied today are not suited for more general data types. Driven by needs from fields like image processing, (medical) shape analysis, and network analysis, more and more attention has recently been given to data from non-Euclidean spaces–particularly (curved) manifolds. It has led to the field of geometric data analysis whose methods explicitly take the structure (for example, the topology and geometry) of the underlying space into account. This thesis contributes to the methodology of geometric data analysis by generalizing several fundamental notions from multivariate statistics to manifolds. We thereby focus on two different viewpoints. First, we use Riemannian structures to derive a novel regression scheme for general manifolds that relies on splines of generalized Bézier curves. It can accurately model non-geodesic relationships, for example, time-dependent trends with saturation effects or cyclic trends. Since Bézier curves can be evaluated with the constructive de Casteljau algorithm, working with data from manifolds of high dimensions (for example, a hundred thousand or more) is feasible. Relying on the regression, we further develop a hierarchical statistical model for an adequate analysis of longitudinal data in manifolds, and a method to control for confounding variables. We secondly focus on data that is not only manifold- but even Lie group-valued, which is frequently the case in applications. We can only achieve this by endowing the group with an affine connection structure that is generally not Riemannian. Utilizing it, we derive generalizations of several well-known dissimilarity measures between data distributions that can be used for various tasks, including hypothesis testing. Invariance under data translations is proven, and a connection to continuous distributions is given for one measure. A further central contribution of this thesis is that it shows use cases for all notions in real-world applications, particularly in problems from shape analysis in medical imaging and archaeology. We can replicate or further quantify several known findings for shape changes of the femur and the right hippocampus under osteoarthritis and Alzheimer's, respectively. Furthermore, in an archaeological application, we obtain new insights into the construction principles of ancient sundials. Last but not least, we use the geometric structure underlying human brain connectomes to predict cognitive scores. Utilizing a sample selection procedure, we obtain state-of-the-art results

    Inference on Riemannian Manifolds: Regression and Stochastic Differential Equations

    Get PDF
    Statistical inference for manifolds attracts much attention because of its power of working with more general forms of data or geometric objects. We study regression and stochastic differential equations on manifolds from the intrinsic point of view. Firstly, we are able to provide alternative parametrizations for data that lie on Lie group in the problem of fitting a regression model, by mapping this space intrinsically onto its Lie algebra, while we explore the behaviour of fitted values when this base point is chosen differently. Due to the nature of our data in the application of soft tissue artefacts, we employ two correlation structures, namely Matern and quasi-periodic correlation functions when using the generalized least squares, and show that some patterns of the residuals are removed. Secondly, we construct a generalization of the Ornstein-Uhlenbeck process on the cone of covariance matrices SP(n) endowed with two popular Riemannian metrics, namely Log-Euclidean (LE) and Affine-Invariant (AI) metrics. We show that the Riemannian Brownian motion on SP(n) has infinite explosion time as on the Euclidean space and establish the calculation for the horizontal lifts of smooth curves. Moreover, we provide Bayesian inference for discretely observed diffusion processes of covariance matrices associated with either the LE or the AI metrics, and present a novel diffusion bridge sampling method using guided proposals when equipping SP(n) with the AI metric. The estimation algorithms are illustrated with an application in finance, together with a goodness-of-fit test comparing models associated with different metrics. Furthermore, we explore the multivariate volatility models via simulation study, in which covariance matrices in the models are assumed to be unobservable
    • …
    corecore