1,160 research outputs found
Toward efficient Bayesian approaches to inference in hierarchical hidden Markov models for inferring animal behavior
The study of animal behavioral states inferred through hidden Markov models and similar state switching models has seen a significant increase in popularity in recent years. The ability to account for varying levels of behavioral scale has become possible through hierarchical hidden Markov models, but additional levels lead to higher complexity and increased correlation between model components. Maximum likelihood approaches to inference using the EM algorithm and direct optimization of likelihoods are more frequently used, with Bayesian approaches being less favored due to computational demands. Given these demands, it is vital that efficient estimation algorithms are developed when Bayesian methods are preferred. We study the use of various approaches to improve convergence times and mixing in Markov chain Monte Carlo methods applied to hierarchical hidden Markov models, including parallel tempering as an inference facilitation mechanism. The method shows promise for analysing complex stochastic models with high levels of correlation between components, but our results show that it requires careful tuning in order to maximize that potential.Publisher PDFPeer reviewe
Data fusion in a two-stage spatio-temporal model using the INLA-SPDE approach
This paper proposes a two-stage estimation approach for a spatial misalignment scenario that is motivated by the epidemiological problem of linking pollutant exposures and health outcomes. We use integrated nested Laplace approximation method to estimate the parameters of a two-stage spatio-temporal model – the first stage models the exposures using data fusion while the second stage links the health outcomes to exposures. The first stage is based on the Bayesian melding model, which assumes a common latent field for the different data sources for the pollutants. The second stage fits a GLMM using the spatial averages of the estimated latent field, and additional spatial and temporal random effects. Uncertainty from the first stage is accounted for by simulating repeatedly from the posterior predictive distribution of the latent field. A simulation study was carried out to assess the impact of the sparsity of the data on the monitors, number of time points, and the specification of the priors in terms of the biases, RMSEs, and coverage probabilities of the parameters and the block-level exposure estimates. The results show that the parameters are generally estimated correctly but there is difficulty in estimating the Matèrn field parameters. The effect of exposures on the health outcomes is the primary parameter of interest for spatial epidemiologists and health policy makers, and our results show that the proposed method estimates these very well. The proposed method is applied to measurements of NO2 concentration and respiratory hospitalizations for year 2007 in England. The results show that an increase in NO2 levels is significantly associated with an increase in the relative risks of the health outcome. Also, there is a strong spatial structure in the risks, a strong temporal autocorrelation, and a significant spatio-temporal interaction effect.Publisher PDFPeer reviewe
Bayesian multi-species modelling of non-negative continuous ecological data with a discrete mass at zero
Severe declines in the number of some songbirds over the last 40 years
have caused heated debate amongst interested parties. Many factors
have been suggested as possible causes for these declines, including
an increase in the abundance and distribution of an avian predator,
the Eurasian sparrowhawk Accipiter nisus. To test for evidence for a
predator effect on the abundance of its prey, we analyse data on 10
species visiting garden bird feeding stations monitored by the British
Trust for Ornithology in relation to the abundance of sparrowhawks.
We apply Bayesian hierarchical models to data relating to averaged
maximum weekly counts from a garden bird monitoring survey. These
data are essentially continuous, bounded below by zero, but for many
species show a marked spike at zero that many standard distributions
would not be able to account for. We use the Tweedie distributions,
which for certain areas of parameter space relate to continuous nonnegative
distributions with a discrete probability mass at zero, and
are hence able to deal with the shape of the empirical distributions of
the data.
The methods developed in this thesis begin by modelling single prey
species independently with an avian predator as a covariate, using
MCMC methods to explore parameter and model spaces. This model
is then extended to a multiple-prey species model, testing for interactions
between species as well as synchrony in their response to environmental
factors and unobserved variation.
Finally we use a relatively new methodological framework, namely
the SPDE approach in the INLA framework, to fit a multi-species
spatio-temporal model to the ecological data.
The results from the analyses are consistent with the hypothesis that
sparrowhawks are suppressing the numbers of some species of birds
visiting garden feeding stations. Only the species most susceptible to
sparrowhawk predation seem to be affected
Toward efficient Bayesian approaches to inference in hierarchical hidden Markov models for inferring animal behavior
The study of animal behavioral states inferred through hidden Markov models and similar state switching models has seen a significant increase in popularity in recent years. The ability to account for varying levels of behavioral scale has become possible through hierarchical hidden Markov models, but additional levels lead to higher complexity and increased correlation between model components. Maximum likelihood approaches to inference using the EM algorithm and direct optimization of likelihoods are more frequently used, with Bayesian approaches being less favored due to computational demands. Given these demands, it is vital that efficient estimation algorithms are developed when Bayesian methods are preferred. We study the use of various approaches to improve convergence times and mixing in Markov chain Monte Carlo methods applied to hierarchical hidden Markov models, including parallel tempering as an inference facilitation mechanism. The method shows promise for analysing complex stochastic models with high levels of correlation between components, but our results show that it requires careful tuning in order to maximize that potential
The impact of cancer incidence on catastrophic health expenditure in Iran with a Bayesian spatio-temporal analysis
Background : We investigated the impact of cancer incidence on CHE in Iran by considering spatial variation across provinces as well as temporal trends. Methods : Data from Household Income-Expenditure Survey were merged with cancer incidence rates during 2011-2016. We developed a Bayesian hierarchical model to explore the spatial and temporal patterns of CHE and its associated factors at provincial level. We used a Besag-York-Mollie2 prior and a random walk prior for spatial and temporal random effects respectively. All statistical analysis was carried out in R software. Results : All-type cancer incidence (OR per SD (95% CrI) = 1.16 (1.02, 1.32)), unemployment rate (1.08 (1.01, 1.15)) and income equity (0.88 (0.81, 0.97)) have important association with CHE. Percentage of urbanization and percentage of poverty were not statistically significant. Conclusion : The results suggest the development of new policies to protect cancer patients against financial hardship, narrow the gap in income inequality and solve the problem of high unemployment rate to reduce the level of CHE at provincial level.Publisher PDFPeer reviewe
Challenges on the interaction of models and policy for pandemic control.
The COVID-19 pandemic has seen infectious disease modelling at the forefront of government decision-making. Models have been widely used throughout the pandemic to estimate pathogen spread and explore the potential impact of different intervention strategies. Infectious disease modellers and policymakers have worked effectively together, but there are many avenues for progress on this interface. In this paper, we identify and discuss seven broad challenges on the interaction of models and policy for pandemic control. We then conclude with suggestions and recommendations for the future
Identifying multi-species synchrony in response to environmental covariates
BTS was part funded by EPSRC/NERC grant EP/10009171/1.The importance of multi-species models for understanding complex ecological processes and interactions is beginning to be realised. Recent developments, such as those by Lahoz-Monfort et al. (2011), have enabled synchrony in demographic parameters across multiple species to be explored. Species in a similar environment would be expected to be subject to similar exogenous factors, although their response to each of these factors may be quite different. The ability to group species together according to how they respond to a particular measured covariate may be of particular interest to ecologists. We fit a multi-species model to two sets of similar species of garden bird monitored under the British Trust for Ornithology’s Garden Bird Feeding Survey. Posterior model probabilities were estimated using the reversible jump algorithm to compare posterior support for competing models with different species sharing different subsets of regression coefficients.There was frequently good agreement between species with small asynchronous random effect components and those with posterior support for models with shared regression coefficients; however, this was not always the case. When groups of species were less correlated, greater uncertainty was found in whether regression coefficients should be shared or not.The methods outlined in this paper can test additional hypotheses about the similarities or synchrony across multiple species that share the same environment. Through the use of posterior model probabilities, estimated using the reversible jump algorithm, we can detect multi-species responses in relation to measured covariates across any combination of species and covariates under consideration. The method can account for synchrony across species in relation to measured covariates, as well as unexplained variation accounted for using random effects. For more flexible, multi-parameter distributions, the support for species-specific parameters can also be measured.Publisher PDFPeer reviewe
Statins are underused in recent-onset Parkinson's disease with increased vascular risk : findings from the UK Tracking Parkinson's and Oxford Parkinson's Disease Centre (OPDC) discovery cohorts
Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.Peer reviewedPublisher PD
The challenges of data in future pandemics
This work was supported by Engineering and Physical Sciences Research Council (EPSRC) grant no. EP/R014604/1. M.C. was funded by the Engineering and Physical Sciences Research Council (EPSRC) grant no. EP/V054236/1. G.M. is supported by the Scottish Government’s Rural and Environment Science and Analytical Services Division (RESAS). J.P-G’s work is supported by funding from the UK Health Security Agency and the UK Department of Health and Social Care. L.P. is funded by the Wellcome Trust, UK and the Royal Society, UK (grant no. 202562/Z/16/Z). L.P. is supported by the UK Research and Innovation (UKRI) through the JUNIPER modelling consortium (grant no. MR/V038613/1). L.P. is also supported by The Alan Turing Institute for Data Science and Artificial Intelligence, UK. R.R was supported by the Natural Environment Research Council (NERC) grant no. NE/T004193/1 and NE/T010355/1, the Engineering and Physical Sciences Research Council (EPSRC) grant no. EP/V054236/1 and the Science and Technology Facilities Council (STFC) grant no. ST/V006126/1.The use of data has been essential throughout the unfolding COVID-19 pandemic. We have needed it to populate our models, inform our understanding, and shape our responses to the disease. However, data has not always been easy to find and access, it has varied in quality and coverage, been difficult to reuse or repurpose. This paper reviews these and other challenges and recommends steps to develop a data ecosystem better able to deal with future pandemics by better supporting preparedness, prevention, detection and response.Publisher PDFPeer reviewe
A spatiotemporal multispecies model of a semicontinuous response
BS was part funded by Engineering and Physical Sciences Research Council–Natural Environment Research Council grant EP/10009171/1.As accessible and potentially vulnerable species high up in the food chain, birds are often used as indicator species to highlight changes in ecosystems. This study focuses on multiple spatially dependent relationships between a raptor (sparrowhawk), a potential prey species (house sparrow) and a sympatric species (collared doves) in space and time. We construct a complex spatiotemporal latent Gaussian model to incorporate both predator–prey and sympatric relationships, which is novel in two ways. First, different types of species interactions are represented by a shared spatiotemporal random effect, which extends existing approaches to multivariate spatial modelling through the use of a joint latent modelling approach. Second, we use a delta–gamma model to capture the semicontinuous nature of the data to model the binary and continuous sections of the response jointly. The results indicate that sparrowhawks have a localized effect on the presence of house sparrows, which could indicate that house sparrows avoid sites where sparrowhawks are present.PostprintPeer reviewe
- …