13,109 research outputs found
Unifying Amplitude and Phase Analysis: A Compositional Data Approach to Functional Multivariate Mixed-Effects Modeling of Mandarin Chinese
Mandarin Chinese is characterized by being a tonal language; the pitch (or
) of its utterances carries considerable linguistic information. However,
speech samples from different individuals are subject to changes in amplitude
and phase which must be accounted for in any analysis which attempts to provide
a linguistically meaningful description of the language. A joint model for
amplitude, phase and duration is presented which combines elements from
Functional Data Analysis, Compositional Data Analysis and Linear Mixed Effects
Models. By decomposing functions via a functional principal component analysis,
and connecting registration functions to compositional data analysis, a joint
multivariate mixed effect model can be formulated which gives insights into the
relationship between the different modes of variation as well as their
dependence on linguistic and non-linguistic covariates. The model is applied to
the COSPRO-1 data set, a comprehensive database of spoken Taiwanese Mandarin,
containing approximately 50 thousand phonetically diverse sample contours
(syllables), and reveals that phonetic information is jointly carried by both
amplitude and phase variation.Comment: 49 pages, 13 figures, small changes to discussio
Principal Nested Spheres for Time Warped Functional Data Analysis
There are often two important types of variation in functional data: the
horizontal (or phase) variation and the vertical (or amplitude) variation.
These two types of variation have been appropriately separated and modeled
through a domain warping method (or curve registration) based on the Fisher Rao
metric. This paper focuses on the analysis of the horizontal variation,
captured by the domain warping functions. The square-root velocity function
representation transforms the manifold of the warping functions to a Hilbert
sphere. Motivated by recent results on manifold analogs of principal component
analysis, we propose to analyze the horizontal variation via a Principal Nested
Spheres approach. Compared with earlier approaches, such as approximating
tangent plane principal component analysis, this is seen to be the most
efficient and interpretable approach to decompose the horizontal variation in
some examples
Warped Functional Analysis of Variance
This article presents an Analysis of Variance model for functional data that
explicitly incorporates phase variability through a time-warping component,
allowing for a unified approach to estimation and inference in presence of
amplitude and time variability. The focus is on single-random-factor models but
the approach can be easily generalized to more complex ANOVA models. The
behavior of the estimators is studied by simulation, and an application to the
analysis of growth curves of flour beetles is presented. Although the model
assumes a smooth latent process behind the observed trajectories, smoothness of
the observed data is not required; the method can be applied to the sparsely
observed data that is often encountered in longitudinal studies
Simultaneous inference for misaligned multivariate functional data
We consider inference for misaligned multivariate functional data that
represents the same underlying curve, but where the functional samples have
systematic differences in shape. In this paper we introduce a new class of
generally applicable models where warping effects are modeled through nonlinear
transformation of latent Gaussian variables and systematic shape differences
are modeled by Gaussian processes. To model cross-covariance between sample
coordinates we introduce a class of low-dimensional cross-covariance structures
suitable for modeling multivariate functional data. We present a method for
doing maximum-likelihood estimation in the models and apply the method to three
data sets. The first data set is from a motion tracking system where the
spatial positions of a large number of body-markers are tracked in
three-dimensions over time. The second data set consists of height and weight
measurements for Danish boys. The third data set consists of three-dimensional
spatial hand paths from a controlled obstacle-avoidance experiment. We use the
developed method to estimate the cross-covariance structure, and use a
classification setup to demonstrate that the method outperforms
state-of-the-art methods for handling misaligned curve data.Comment: 44 pages in total including tables and figures. Additional 9 pages of
supplementary material and reference
- …