84 research outputs found
Statistical inference of the time-varying structure of gene-regulation networks
<p>Abstract</p> <p>Background</p> <p>Biological networks are highly dynamic in response to environmental and physiological cues. This variability is in contrast to conventional analyses of biological networks, which have overwhelmingly employed static graph models which stay constant over time to describe biological systems and their underlying molecular interactions.</p> <p>Methods</p> <p>To overcome these limitations, we propose here a new statistical modelling framework, the ARTIVA formalism (Auto Regressive TIme VArying models), and an associated inferential procedure that allows us to learn temporally varying gene-regulation networks from biological time-course expression data. ARTIVA simultaneously infers the topology of a regulatory network and how it changes over time. It allows us to recover the chronology of regulatory associations for individual genes involved in a specific biological process (development, stress response, etc.).</p> <p>Results</p> <p>We demonstrate that the ARTIVA approach generates detailed insights into the function and dynamics of complex biological systems and exploits efficiently time-course data in systems biology. In particular, two biological scenarios are analyzed: the developmental stages of <it>Drosophila melanogaster </it>and the response of <it>Saccharomyces cerevisiae </it>to benomyl poisoning.</p> <p>Conclusions</p> <p>ARTIVA does recover essential temporal dependencies in biological systems from transcriptional data, and provide a natural starting point to learn and investigate their dynamics in greater detail.</p
Bayesian detection of piecewise linear trends in replicated time-series with application to growth data modelling
We consider the situation where a temporal process is composed of contiguous
segments with differing slopes and replicated noise-corrupted time series
measurements are observed. The unknown mean of the data generating process is
modelled as a piecewise linear function of time with an unknown number of
change-points. We develop a Bayesian approach to infer the joint posterior
distribution of the number and position of change-points as well as the unknown
mean parameters. A-priori, the proposed model uses an overfitting number of
mean parameters but, conditionally on a set of change-points, only a subset of
them influences the likelihood. An exponentially decreasing prior distribution
on the number of change-points gives rise to a posterior distribution
concentrating on sparse representations of the underlying sequence. A
Metropolis-Hastings Markov chain Monte Carlo (MCMC) sampler is constructed for
approximating the posterior distribution. Our method is benchmarked using
simulated data and is applied to uncover differences in the dynamics of fungal
growth from imaging time course data collected from different strains. The
source code is available on CRAN.Comment: Accepted to International Journal of Biostatistic
Analysis of Heterogeneous Data Sources for Veterinary Syndromic Surveillance to Improve Public Health Response and Aid Decision Making
The standard technique of implementing veterinary syndromic surveillance (VSyS) is the detection of temporal or spatial anomalies in the occurrence of health incidents above a set threshold in an observed population using the Frequentist modelling approach. Most implementation of this technique also requires the removal of historical outbreaks from the datasets to construct baselines. Unfortunately, some challenges exist, such as data scarcity, delayed reporting of health incidents, and variable data availability from sources, which make the VSyS implementation and alarm interpretation difficult, particularly when quantifying surveillance risk with associated uncertainties. This problem indicates that alternate or improved techniques are required to interpret alarms when incorporating uncertainties and previous knowledge of health incidents into the model to inform decision-making. Such methods must be capable of retaining historical outbreaks to assess surveillance risk.
In this research work, the Stochastic Quantitative Risk Assessment (SQRA) model was proposed and developed for detecting and quantifying the risk of disease outbreaks with associated uncertainties using the Bayesian probabilistic approach in PyMC3. A systematic and comparative evaluation of the available techniques was used to select the most appropriate method and software packages based on flexibility, efficiency, usability, ability to retain historical outbreaks, and the ease of developing a model in Python. The social media datasets (Twitter) were first applied to infer a possible disease outbreak incident with associated uncertainties. Then, the inferences were subsequently updated using datasets from the clinical and other healthcare sources to reduce uncertainties in the model and validate the outbreak. Therefore, the proposed SQRA model demonstrates an approach that uses the successive refinement of analysis of different data streams to define a changepoint signalling a disease outbreak.
The SQRA model was tested and validated to show the method's effectiveness and reliability for differentiating and identifying risk regions with corresponding changepoints to interpret an ongoing disease outbreak incident. This demonstrates that a technique such as the SQRA method obtained through this research may aid in overcoming some of the difficulties identified in VSyS, such as data scarcity, delayed reporting, and variable availability of data from sources, ultimately contributing to science and practice
Selective review of offline change point detection methods
This article presents a selective survey of algorithms for the offline
detection of multiple change points in multivariate time series. A general yet
structuring methodological strategy is adopted to organize this vast body of
work. More precisely, detection algorithms considered in this review are
characterized by three elements: a cost function, a search method and a
constraint on the number of changes. Each of those elements is described,
reviewed and discussed separately. Implementations of the main algorithms
described in this article are provided within a Python package called ruptures
- …