379 research outputs found
BAMBI: An R Package for Fitting Bivariate Angular Mixture Models
Statistical analyses of directional or angular data have applications in a variety of fields, such as geology, meteorology and bioinformatics. There is substantial literature on descriptive and inferential techniques for univariate angular data, with the bivariate (or more generally, multivariate) cases receiving more attention in recent years. More specifically, the bivariate wrapped normal, von Mises sine and von Mises cosine distributions, and mixtures thereof, have been proposed for practical use. However, there is a lack of software implementing these distributions and the associated inferential techniques. In this article, we introduce BAMBI, an R package for analyzing bivariate (and univariate) angular data. We implement random data generation, density evaluation, and computation of theoretical summary measures (variances and correlation coefficients) for the three aforementioned bivariate angular distributions, as well as two univariate angular distributions: the univariate wrapped normal and the univariate von Mises distribution. The major contribution of BAMBI to statistical computing is in providing Bayesian methods for modeling angular data using finite mixtures of these distributions. We also provide functions for visual and numerical diagnostics and Bayesian inference for the fitted models. In this article, we first provide a brief review of the distributions and techniques used in BAMBI, then describe the capabilities of the package, and finally conclude with demonstrations of mixture model fitting using BAMBI on the two real data sets included in the package, one univariate and one bivariate
Recent advances in directional statistics
Mainstream statistical methodology is generally applicable to data observed
in Euclidean space. There are, however, numerous contexts of considerable
scientific interest in which the natural supports for the data under
consideration are Riemannian manifolds like the unit circle, torus, sphere and
their extensions. Typically, such data can be represented using one or more
directions, and directional statistics is the branch of statistics that deals
with their analysis. In this paper we provide a review of the many recent
developments in the field since the publication of Mardia and Jupp (1999),
still the most comprehensive text on directional statistics. Many of those
developments have been stimulated by interesting applications in fields as
diverse as astronomy, medicine, genetics, neurology, aeronautics, acoustics,
image analysis, text mining, environmetrics, and machine learning. We begin by
considering developments for the exploratory analysis of directional data
before progressing to distributional models, general approaches to inference,
hypothesis testing, regression, nonparametric curve estimation, methods for
dimension reduction, classification and clustering, and the modelling of time
series, spatial and spatio-temporal data. An overview of currently available
software for analysing directional data is also provided, and potential future
developments discussed.Comment: 61 page
Toroidal PCA via density ridges
Principal Component Analysis (PCA) is a well-known linear dimension-reduction
technique designed for Euclidean data. In a wide spectrum of applied fields,
however, it is common to observe multivariate circular data (also known as
toroidal data), rendering spurious the use of PCA on it due to the periodicity
of its support. This paper introduces Toroidal Ridge PCA (TR-PCA), a novel
construction of PCA for bivariate circular data that leverages the concept of
density ridges as a flexible first principal component analog. Two reference
bivariate circular distributions, the bivariate sine von Mises and the
bivariate wrapped Cauchy, are employed as the parametric distributional basis
of TR-PCA. Efficient algorithms are presented to compute density ridges for
these two distribution models. A complete PCA methodology adapted to toroidal
data (including scores, variance decomposition, and resolution of edge cases)
is introduced and implemented in the companion R package ridgetorus. The
usefulness of TR-PCA is showcased with a novel case study involving the
analysis of ocean currents on the coast of Santa Barbara.Comment: 20 pages, 8 figures, 1 tabl
Exploring wind direction and SO2 concentration by circular-linear density estimation
The study of environmental problems usually requires the description of
variables with different nature and the assessment of relations between them.
In this work, an algorithm for flexible estimation of the joint density for a
circular-linear variable is proposed. The method is applied for exploring the
relation between wind direction and SO2 concentration in a monitoring station
close to a power plant located in Galicia (NW-Spain), in order to compare the
effectiveness of precautionary measures for pollutants reduction in two
different years.Comment: 17 pages, 7 figures, 2 table
- …