Search CORE

266 research outputs found

Simplicial principal component analysis for density functions in Bayes spaces

Author: Filzmoser P.
Hron K.
Hrůzová K.
Menafoglio A.
Templ M.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Probability density functions are frequently used to characterize the distributional properties of large-scale database systems. As functional compositions, densities primarily carry relative information. As such, standard methods of functional data analysis (FDA) are not appropriate for their statistical processing. The specific features of density functions are accounted for in Bayes spaces, which result from the generalization to the infinite dimensional setting of the Aitchison geometry for compositional data. The aim is to build up a concise methodology for functional principal component analysis of densities. A simplicial functional principal component analysis (SFPCA) is proposed, based on the geometry of the Bayes space B2 of functional compositions. SFPCA is performed by exploiting the centred log-ratio transform, an isometric isomorphism between B2 and L2 which enables one to resort to standard FDA tools. The advantages of the proposed approach with respect to existing techniques are demonstrated using simulated data and a real-world example of population pyramids in Upper Austria

Archivio istituzionale della ricerca - Politecnico di Milano

Object Oriented Geostatistical Simulation of Functional Compositions via Dimensionality Reduction in Bayes spaces

Author: Guadagnini Alberto
Menafoglio Alessandra
Secchi Piercesare
Publication venue
Publication date: 01/01/2016
Field of study

We address the problem of geostatistical simulation of spatial complex data, with emphasis on functional compositions (FCs). We pursue an object oriented geostatistical approach and interpret FCs as random points in a Bayes Hilbert space. This enables us to deal with data dimensionality and constraints by relying on a solid geometric basis, and to develop a simulation strategy consisting of: (i) optimal dimensionality reduction of the problem through a simplicial principal component analysis, and (ii) geostatistical simulation of random realizations of FCs via an approximate multivariate problem.We illustrate our methodology on a dataset of natural soil particle-size densities collected in an alluvial aquifer

Archivio istituzionale della ricerca - Politecnico di Milano

Projected Statistical Methods for Distributional Data on the Real Line with the Wasserstein Metric

Author: Beraha Mario
Pegoraro Matteo
Publication venue
Publication date: 29/11/2021
Field of study

We present a novel class of projected methods, to perform statistical analysis on a data set of probability distributions on the real line, with the 2-Wasserstein metric. We focus in particular on Principal Component Analysis (PCA) and regression. To define these models, we exploit a representation of the Wasserstein space closely related to its weak Riemannian structure, by mapping the data to a suitable linear space and using a metric projection operator to constrain the results in the Wasserstein space. By carefully choosing the tangent point, we are able to derive fast empirical methods, exploiting a constrained B-spline approximation. As a byproduct of our approach, we are also able to derive faster routines for previous work on PCA for distributions. By means of simulation studies, we compare our approaches to previously proposed methods, showing that our projected PCA has similar performance for a fraction of the computational cost and that the projected regression is extremely flexible even under misspecification. Several theoretical properties of the models are investigated and asymptotic consistency is proven. Two real world applications to Covid-19 mortality in the US and wind speed forecasting are discussed

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Profile Monitoring of Probability Density Functions via Simplicial Functional PCA with application to Image Data

Author: COLOSIMO BIANCA MARIA
GRASSO MARCO LUIGI GIUSEPPE
MENAFOGLIO ALESSANDRA
SECCHI PIERCESARE
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2018
Field of study

The advance of sensor and information technologies is leading to data-rich industrial environments, where large amounts of data are potentially available. This study focuses on industrial applications where image data are used more and more for quality inspection and statistical process monitoring. In many cases of interest, acquired images consist of several and similar features that are randomly distributed within a given region. Examples are pores in parts obtained via casting or additive manufacturing, voids in metal foams and light-weight components, grains in metallographic analysis, etc. The proposed approach summarizes the random occurrences of the observed features via their (empirical) probability density functions (PDFs). In particular, a novel approach for PDF monitoring is proposed. It is based on simplicial functional principal component analysis (SFPCA), which is performed within the space of density functions, that is, the Bayes space B2. A simulation study shows the enhanced monitoring performances provided by SFPCA-based profile monitoring against other competitors proposed in the literature. Finally, a real case study dealing with the quality control of foamed material production is discussed, to highlight a practical use of the proposed methodology. Supplementary materials for the article are available online

Archivio istituzionale della ricerca - Politecnico di Milano

Evidence functions: a compositional approach to information

Author: Egozcue Juan-José
Pawlowsky-Glahn Vera
Publication venue: Institut d'Estadística de Catalunya
Publication date: 21/12/2018
Field of study

The discrete case of Bayes’ formula is considered the paradigm of information acquisition. Prior and posterior probability functions, as well as likelihood functions, called evidence functions, are compositions following the Aitchison geometry of the simplex, and have thus vector character. Bayes’ formula becomes a vector addition. The Aitchison norm of an evidence function is introduced as a scalar measurement of information. A fictitious fire scenario serves as illustration. Two different inspections of affected houses are considered. Two questions are addressed: (a) which is the information provided by the outcomes of inspections, and (b) which is the most informative inspection.Peer Reviewe

UPCommons. Portal del coneixement obert de la UPC

Evidence functions: a compositional approach to information

Author: Egozcue Rubí Juan José
Pawlowsky Glahn Vera
Publication venue: Institut d'Estadística de Catalunya
Publication date: 01/01/2018
Field of study

UPCommons. Portal del coneixement obert de la UPC

Diposit Digital de Documents de la UAB