661 research outputs found
A Geometric Approach to Pairwise Bayesian Alignment of Functional Data Using Importance Sampling
We present a Bayesian model for pairwise nonlinear registration of functional
data. We use the Riemannian geometry of the space of warping functions to
define appropriate prior distributions and sample from the posterior using
importance sampling. A simple square-root transformation is used to simplify
the geometry of the space of warping functions, which allows for computation of
sample statistics, such as the mean and median, and a fast implementation of a
-means clustering algorithm. These tools allow for efficient posterior
inference, where multiple modes of the posterior distribution corresponding to
multiple plausible alignments of the given functions are found. We also show
pointwise credible intervals to assess the uncertainty of the alignment
in different clusters. We validate this model using simulations and present
multiple examples on real data from different application domains including
biometrics and medicine
Landmark-Based Registration of Curves via the Continuous Wavelet Transform
This paper is concerned with the problem of the alignment of multiple sets of curves. We analyze two real examples arising from the biomedical area for which we need to test whether there are any statistically significant differences between two subsets of subjects. To synchronize a set of curves, we propose a new nonparametric landmark-based registration method based on the alignment of the structural intensity of the zero-crossings of a wavelet transform. The structural intensity is a multiscale technique recently proposed by Bigot (2003, 2005) which highlights the main features of a signal observed with noise. We conduct a simulation study to compare our landmark-based registration approach with some existing methods for curve alignment. For the two real examples, we compare the registered curves with FANOVA techniques, and a detailed analysis of the warping functions is provided
Spectral analysis for nonstationary audio
A new approach for the analysis of nonstationary signals is proposed, with a
focus on audio applications. Following earlier contributions, nonstationarity
is modeled via stationarity-breaking operators acting on Gaussian stationary
random signals. The focus is on time warping and amplitude modulation, and an
approximate maximum-likelihood approach based on suitable approximations in the
wavelet transform domain is developed. This paper provides theoretical analysis
of the approximations, and introduces JEFAS, a corresponding estimation
algorithm. The latter is tested and validated on synthetic as well as real
audio signal.Comment: IEEE/ACM Transactions on Audio, Speech and Language Processing,
Institute of Electrical and Electronics Engineers, In pres
Statistical modelling of conidial discharge of entomophthoralean fungi using a newly discovered Pandora species
Entomophthoralean fungi are insect pathogenic fungi and are characterized by
their active discharge of infective conidia that infect insects. Our aim was to
study the effects of temperature on the discharge and to characterize the
variation in the associated temporal pattern of a newly discovered Pandora
species with focus on peak location and shape of the discharge. Mycelia were
incubated at various temperatures in darkness, and conidial discharge was
measured over time. We used a novel modification of a statistical model
(pavpop), that simultaneously estimates phase and amplitude effects, into a
setting of generalized linear models. This model is used to test hypotheses of
peak location and discharge of conidia. The statistical analysis showed that
high temperature leads to an early and fast decreasing peak, whereas there were
no significant differences in total number of discharged conidia. Using the
proposed model we also quantified the biological variation in the timing of the
peak location at a fixed temperature.Comment: 23 pages including supplementary materia
Time Series Data Mining Algorithms for Identifying Short RNA in Arabidopsis thaliana
The class of molecules called short RNAs (sRNAs) are known to play a key role in gene regulation. Th are typically sequences of nucleotides between 21-25 nucleotides in length. They are known to play a key role in gene regulation. The identification, clustering and classification of sRNA has recently become the focus of much research activity. The basic problem involves detecting regions of interest on the chromosome where the pattern of candidate matches is somehow unusual. Currently, there are no published algorithms for detecting regions of interest, and the unpublished methods that we are aware of involve bespoke rule based systems designed for a specific organism. Work in this very new field has understandably focused on the outcomes rather than the methods used to obtain the results. In this paper we propose two generic approaches that place the specific biological problem in the wider context of time series data mining problems. Both methods are based on treating the occurrences on a chromosome, or “hit count” data, as a time series, then running a sliding window along a chromosome and measuring unusualness. This formulation means we can treat finding unusual areas of candidate RNA activity as a variety of time series anomaly detection problem. The first set of approaches is model based. We specify a null hypothesis distribution for not being a sRNA, then estimate the p-values along the chromosome. The second approach is instance based. We identify some typical shapes from known sRNA, then use dynamic time warping and fourier trans-form based distance to measure how closely the candidate series matches. We demonstrate that these methods can find known sRNA on Arabidopsis thaliana chromosomes and illustrate the benefits of the added information provided by these algorithms
Most Likely Separation of Intensity and Warping Effects in Image Registration
This paper introduces a class of mixed-effects models for joint modeling of
spatially correlated intensity variation and warping variation in 2D images.
Spatially correlated intensity variation and warp variation are modeled as
random effects, resulting in a nonlinear mixed-effects model that enables
simultaneous estimation of template and model parameters by optimization of the
likelihood function. We propose an algorithm for fitting the model which
alternates estimation of variance parameters and image registration. This
approach avoids the potential estimation bias in the template estimate that
arises when treating registration as a preprocessing step. We apply the model
to datasets of facial images and 2D brain magnetic resonance images to
illustrate the simultaneous estimation and prediction of intensity and warp
effects
Bayesian registration of functions and curves
Bayesian analysis of functions and curves is considered, where warping and other geometrical transformations are often required for meaningful comparisons. The functions and curves of interest are represented using the recently introduced square root velocity function, which enables a warping invariant elastic distance to be calculated in a straightforward manner. We distinguish between various spaces of interest: the original space, the ambient space after standardizing, and the quotient space after removing a group of transformations. Using Gaussian process models in the ambient space and Dirichlet priors for the warping functions, we explore Bayesian inference for curves and functions. Markov chain Monte Carlo algorithms are introduced for simulating from the posterior. We also compare ambient and quotient space estimators for mean shape, and explain their frequent similarity in many practical problems using a Laplace approximation. Simulation studies are carried out, as well as practical alignment of growth rate functions and shape classification of mouse vertebra outlines in evolutionary biology. We also compare the performance of our Bayesian method with some alternative approaches
- …