2,218 research outputs found

    Improving process algebra model structure and parameters in infectious disease epidemiology through data mining

    Get PDF
    Computational models are increasingly used to assist decision-making in public health epidemiology, but achieving the best model is a complex task due to the interaction of many components and variability of parameter values causing radically different dynamics. The modelling process can be enhanced through the use of data mining techniques. Here, we demonstrate this by applying association rules and clustering techniques to two stages of mod- elling: identifying pertinent structures in the initial model creation stage, and choosing optimal parameters to match that model to observed data. This is illustrated through application to the study of the circulating mumps virus in Scotland, 2004-2015

    Decision Support Based on Bio-PEPA Modeling and Decision Tree Induction: A New Approach, Applied to a Tuberculosis Case Study

    Get PDF
    The problem of selecting determinant features generating appropriate model structure is a challenge in epidemiological modelling. Disease spread is highly complex, and experts develop their understanding of its dynamic over years. There is an increasing variety and volume of epidemiological data which adds to the potential confusion. We propose here to make use of that data to better understand disease systems. Decision tree techniques have been extensively used to extract pertinent information and improve decision making. In this paper, we propose an innovative structured approach combining decision tree induction with Bio-PEPA computational modelling, and illustrate the approach through application to tuberculosis. By using decision tree induction, the enhanced Bio-PEPA model shows considerable improvement over the initial model with regard to the simulated results matching observed data. The key finding is that the developer expresses a realistic predictive model using relevant features, thus considering this approach as decision support, empowers the epidemiologist in his policy decision making

    Effective use of evolutionary computation to parameterise an epidemiological model

    Get PDF
    Predictive epidemiological models are able to be used most effectively when they have first been shown to fit historical data. Finding the right parameters settings for a model is complex: the system is likely to be noisy, the data points may be sparse, and there may be many inter-related parameters. We apply computational intelligence and data mining techniques in novel ways to investigate this significant problem. We construct an original computational model of human papilloma virus and cervical intraepithelial neoplasia with the ultimate aim of predicting the outcomes of varying control techniques (e.g. vaccination, screening, treatment, quarantine). Two computational intelligence techniques (genetic algorithms and particle swarm optimisation) are used over one- stage and two-stage optimisations for eight real-valued model parameters. Rigorous comparison over a variety of quantitative measures demonstrates the explorative nature of the genetic algorithm (useful in this parameter space to support the modeller). Correlations between parameters are drawn out that might otherwise be missed. Clustering highlights the uniformity of the best genetic algorithm results. Prediction of gender-neutral vaccination with the tuned model suggests elimination of the virus across vaccinated and cross-protected strains, supporting recent Scottish government policy. This preliminary study lays the foundation for more widespread use of computational intelligence techniques in epidemiological modelling

    Stat Med

    Get PDF
    Creating statistical models that generate accurate predictions of infectious disease incidence is a challenging problem whose solution could benefit public health decision makers. We develop a new approach to this problem using kernel conditional density estimation (KCDE) and copulas. We obtain predictive distributions for incidence in individual weeks using KCDE and tie those distributions together into joint distributions using copulas. This strategy enables us to create predictions for the timing of and incidence in the peak week of the season. Our implementation of KCDE incorporates 2 novel kernel components: a periodic component that captures seasonality in disease incidence and a component that allows for a full parameterization of the bandwidth matrix with discrete variables. We demonstrate via simulation that a fully parameterized bandwidth matrix can be beneficial for estimating conditional densities. We apply the method to predicting dengue fever and influenza and compare to a seasonal autoregressive integrated moving average model and HHH4, a previously published extension to the generalized linear model framework developed for infectious disease incidence. The KCDE outperforms the baseline methods for predictions of dengue incidence in individual weeks. The KCDE also offers more consistent performance than the baseline models for predictions of incidence in the peak week and is comparable to the baseline models on the other prediction targets. Using the periodic kernel function led to better predictions of incidence. Our approach and extensions of it could yield improved predictions for public health decision makers, particularly in diseases with heterogeneous seasonal dynamics such as dengue fever.CC999999/Intramural CDC HHS/United StatesR01 AI102939/AI/NIAID NIH HHS/United StatesR21 AI115173/AI/NIAID NIH HHS/United States2018-12-30T00:00:00Z28905403PMC5771499vault:2585

    Quantifying Aggregated Uncertainty in Plasmodium falciparum Malaria Prevalence and Populations at Risk via Efficient Space-Time Geostatistical Joint Simulation

    Get PDF
    Risk maps estimating the spatial distribution of infectious diseases are required to guide public health policy from local to global scales. The advent of model-based geostatistics (MBG) has allowed these maps to be generated in a formal statistical framework, providing robust metrics of map uncertainty that enhances their utility for decision-makers. In many settings, decision-makers require spatially aggregated measures over large regions such as the mean prevalence within a country or administrative region, or national populations living under different levels of risk. Existing MBG mapping approaches provide suitable metrics of local uncertainty—the fidelity of predictions at each mapped pixel—but have not been adapted for measuring uncertainty over large areas, due largely to a series of fundamental computational constraints. Here the authors present a new efficient approximating algorithm that can generate for the first time the necessary joint simulation of prevalence values across the very large prediction spaces needed for global scale mapping. This new approach is implemented in conjunction with an established model for P. falciparum allowing robust estimates of mean prevalence at any specified level of spatial aggregation. The model is used to provide estimates of national populations at risk under three policy-relevant prevalence thresholds, along with accompanying model-based measures of uncertainty. By overcoming previously unchallenged computational barriers, this study illustrates how MBG approaches, already at the forefront of infectious disease mapping, can be extended to provide large-scale aggregate measures appropriate for decision-makers

    Measuring infection transmission in a stochastic SIV model with infection reintroduction and imperfect vaccine

    Get PDF
    An additional compartment of vaccinated individuals is considered in a SIS stochastic epidemic model with infection reintroduction. The quantification of the spread of the disease is modeled by a continuous time Markov chain. A well-known measure of the initial transmission potential is the basic reproduction number R0R_0, which determines the herd immunity threshold or the critical proportion of immune individuals required to stop the spread of a disease when a vaccine offers a complete protection. Due to repeated contacts between the typical infective and previously infected individuals, R0R_0 overestimates the average number of secondary infections and leads to, perhaps unnecessary, high immunization coverage. Assuming that the vaccine is imperfect, alternative measures to R0R_0 are defined in order to study the influence of the initial coverage and vaccine efficacy on the transmission of the epidemic

    Measuring infection transmission in a stochastic SIV model with infection reintroduction and imperfect vaccine

    Get PDF
    An additional compartment of vaccinated individuals is considered in a SIS stochastic epidemic model with infection reintroduction. The quantification of the spread of the disease is modeled by a continuous time Markov chain. A well-known measure of the initial transmission potential is the basic reproduction number R0R_0, which determines the herd immunity threshold or the critical proportion of immune individuals required to stop the spread of a disease when a vaccine offers a complete protection. Due to repeated contacts between the typical infective and previously infected individuals, R0R_0 overestimates the average number of secondary infections and leads to, perhaps unnecessary, high immunization coverage. Assuming that the vaccine is imperfect, alternative measures to R0R_0 are defined in order to study the influence of the initial coverage and vaccine efficacy on the transmission of the epidemic
    • …
    corecore