3,811 research outputs found
Recent advances in directional statistics
Mainstream statistical methodology is generally applicable to data observed
in Euclidean space. There are, however, numerous contexts of considerable
scientific interest in which the natural supports for the data under
consideration are Riemannian manifolds like the unit circle, torus, sphere and
their extensions. Typically, such data can be represented using one or more
directions, and directional statistics is the branch of statistics that deals
with their analysis. In this paper we provide a review of the many recent
developments in the field since the publication of Mardia and Jupp (1999),
still the most comprehensive text on directional statistics. Many of those
developments have been stimulated by interesting applications in fields as
diverse as astronomy, medicine, genetics, neurology, aeronautics, acoustics,
image analysis, text mining, environmetrics, and machine learning. We begin by
considering developments for the exploratory analysis of directional data
before progressing to distributional models, general approaches to inference,
hypothesis testing, regression, nonparametric curve estimation, methods for
dimension reduction, classification and clustering, and the modelling of time
series, spatial and spatio-temporal data. An overview of currently available
software for analysing directional data is also provided, and potential future
developments discussed.Comment: 61 page
Bayesian nonparametric dependent model for partially replicated data: the influence of fuel spills on species diversity
We introduce a dependent Bayesian nonparametric model for the probabilistic
modeling of membership of subgroups in a community based on partially
replicated data. The focus here is on species-by-site data, i.e. community data
where observations at different sites are classified in distinct species. Our
aim is to study the impact of additional covariates, for instance environmental
variables, on the data structure, and in particular on the community diversity.
To that purpose, we introduce dependence a priori across the covariates, and
show that it improves posterior inference. We use a dependent version of the
Griffiths-Engen-McCloskey distribution defined via the stick-breaking
construction. This distribution is obtained by transforming a Gaussian process
whose covariance function controls the desired dependence. The resulting
posterior distribution is sampled by Markov chain Monte Carlo. We illustrate
the application of our model to a soil microbial dataset acquired across a
hydrocarbon contamination gradient at the site of a fuel spill in Antarctica.
This method allows for inference on a number of quantities of interest in
ecotoxicology, such as diversity or effective concentrations, and is broadly
applicable to the general problem of communities response to environmental
variables.Comment: Main Paper: 22 pages, 6 figures. Supplementary Material: 11 pages, 1
figur
Modeling Human Understanding of Complex Intentional Action with a Bayesian Nonparametric Subgoal Model
Most human behaviors consist of multiple parts, steps, or subtasks. These
structures guide our action planning and execution, but when we observe others,
the latent structure of their actions is typically unobservable, and must be
inferred in order to learn new skills by demonstration, or to assist others in
completing their tasks. For example, an assistant who has learned the subgoal
structure of a colleague's task can more rapidly recognize and support their
actions as they unfold. Here we model how humans infer subgoals from
observations of complex action sequences using a nonparametric Bayesian model,
which assumes that observed actions are generated by approximately rational
planning over unknown subgoal sequences. We test this model with a behavioral
experiment in which humans observed different series of goal-directed actions,
and inferred both the number and composition of the subgoal sequences
associated with each goal. The Bayesian model predicts human subgoal inferences
with high accuracy, and significantly better than several alternative models
and straightforward heuristics. Motivated by this result, we simulate how
learning and inference of subgoals can improve performance in an artificial
user assistance task. The Bayesian model learns the correct subgoals from fewer
observations, and better assists users by more rapidly and accurately inferring
the goal of their actions than alternative approaches.Comment: Accepted at AAAI 1
- …