Search CORE

63 research outputs found

Independent component analysis for non-standard data structures

Author: Virta Joni
Publication venue: fi=Turun yliopisto|en=University of Turku|
Publication date: 13/04/2018
Field of study

Independent component analysis is a classical multivariate tool used for estimating independent sources among collections of mixed signals. However, modern forms of data are typically too complex for the basic theory to adequately handle. In this thesis extensions of independent component analysis to three cases of non-standard data structures are developed: noisy multivariate data, tensor-valued data and multivariate functional data. In each case we define the corresponding independent component model along with the related assumptions and implications. The proposed estimators are mostly based on the use of kurtosis and its analogues for the considered structures, resulting into functionals of rather unified form, regardless of the type of the data. We prove the Fisher consistencies of the estimators and particular weight is given to their limiting distributions, using which comparisons between the methods are also made.Riippumattomien komponenttien analyysi on moniulotteisen tilastotieteen työkalu,jota käytetään estimoimaan riippumattomia lähdesignaaleja sekoitettujen signaalien joukosta. Modernit havaintoaineistot ovat kuitenkin tyypillisesti rakenteeltaan liian monimutkaisia, jotta niitä voitaisiin lähestyä alan perinteisillä menetelmillä. Tässä väitöskirjatyössä esitellään laajennukset riippumattomien komponenttien analyysin teoriasta kolmelle epästandardille aineiston muodolle: kohinaiselle moniulotteiselle datalle, tensoriarvoiselle datalle ja moniulotteiselle funktionaaliselle datalle. Kaikissa tapauksissa määriteläään vastaava riippumattomien komponenttien malli oletuksineen ja seurauksineen. Esitellyt estimaattorit pohjautuvat enimmäkseen huipukkuuden ja sen laajennuksien käyttöönottoon ja saatavat funktionaalit ovat analyyttisesti varsin yhtenäisen muotoisia riippumatta aineiston tyypistä. Kaikille estimaattoreille näytetään niiden Fisher-konsistenttisuus ja painotettuna on erityisesti estimaattoreiden rajajakaumat, jotka mahdollistavat teoreettiset vertailut eri menetelmien välillä

UTUPub

On the behavior of extreme $d$ -dimensional spatial quantiles under minimal assumptions

Author: Paindaveine Davy
Virta Joni
Publication venue
Publication date: 29/01/2020
Field of study

"Spatial" or "geometric" quantiles are the only multivariate quantiles coping with both high-dimensional data and functional data, also in the framework of multiple-output quantile regression. This work studies spatial quantiles in the finite-dimensional case, where the spatial quantile

\mu_{\alpha,u}(P)

of the distribution

P

taking values in

\mathbb{R}^d

is a point in

\mathbb{R}^d

indexed by an order

\alpha\in[0,1)

and a direction

u

in the unit sphere

\mathcal{S}^{d-1}

\mathbb{R}^d

--- or equivalently by a vector

\alpha u

in the open unit ball of

\mathbb{R}^d

. Recently, Girard and Stupfler (2017) proved that (i) the extreme quantiles

\mu_{\alpha,u}(P)

obtained as

\alpha\to 1

exit all compact sets of

\mathbb{R}^d

and that (ii) they do so in a direction converging to

u

. These results help understanding the nature of these quantiles: the first result is particularly striking as it holds even if

P

has a bounded support, whereas the second one clarifies the delicate dependence of spatial quantiles on

u

. However, they were established under assumptions imposing that

P

is non-atomic, so that it is unclear whether they hold for empirical probability measures. We improve on this by proving these results under much milder conditions, allowing for the sample case. This prevents using gradient condition arguments, which makes the proofs very challenging. We also weaken the well-known sufficient condition for uniqueness of finite-dimensional spatial quantiles

arXiv.org e-Print Archive

DI-fusion

Structure-preserving non-linear PCA for matrices

Author: Artemiou Andreas
Virta Joni
Publication venue
Publication date: 10/10/2023
Field of study

We propose MNPCA, a novel non-linear generalization of (2D)

^2

{PCA}, a classical linear method for the simultaneous dimension reduction of both rows and columns of a set of matrix-valued data. MNPCA is based on optimizing over separate non-linear mappings on the left and right singular spaces of the observations, essentially amounting to the decoupling of the two sides of the matrices. We develop a comprehensive theoretical framework for MNPCA by viewing it as an eigenproblem in reproducing kernel Hilbert spaces. We study the resulting estimators on both population and sample levels, deriving their convergence rates and formulating a coordinate representation to allow the method to be used in practice. Simulations and a real data example demonstrate MNPCA's good performance over its competitors.Comment: 23 pages, 4 figure

arXiv.org e-Print Archive

Asymptotic and bootstrap tests for the dimension of the non-Gaussian subspace

Author: Nordhausen Klaus
Oja Hannu
Tyler David E.
Virta Joni
Publication venue
Publication date: 24/01/2017
Field of study

Dimension reduction is often a preliminary step in the analysis of large data sets. The so-called non-Gaussian component analysis searches for a projection onto the non-Gaussian part of the data, and it is then important to know the correct dimension of the non-Gaussian signal subspace. In this paper we develop asymptotic as well as bootstrap tests for the dimension based on the popular fourth order blind identification (FOBI) method

arXiv.org e-Print Archive