20 research outputs found

    Does Air Pollution Trigger Infant Mortality in Western Europe? A Case-Crossover Study

    Get PDF
    Background: Numerous studies show associations between fine particulate air pollutants [particulate matter with an aerodynamic diameter ā‰¤ 10 Ī¼m (PM10)] and mortality in adults

    Employing phylogenetic tree shape statistics to resolve the underlying host population structure

    Get PDF
    BACKGROUND: Host population structure is a key determinant of pathogen and infectious disease transmission patterns. Pathogen phylogenetic trees are useful tools to reveal the population structure underlying an epidemic. Determining whether a population is structured or not is useful in informing the type of phylogenetic methods to be used in a given study. We employ tree statistics derived from phylogenetic trees and machine learning classification techniques to reveal an underlying population structure. RESULTS: In this paper, we simulate phylogenetic trees from both structured and non-structured host populations. We compute eight statistics for the simulated trees, which are: the number of cherries; Sackin, Colless and total cophenetic indices; ladder length; maximum depth; maximum width, and width-to-depth ratio. Based on the estimated tree statistics, we classify the simulated trees as from either a non-structured or a structured population using the decision tree (DT), K-nearest neighbor (KNN) and support vector machine (SVM). We incorporate the basic reproductive number ([Formula: see text]) in our tree simulation procedure. Sensitivity analysis is done to investigate whether the classifiers are robust to different choice of model parameters and to size of trees. Cross-validated results for area under the curve (AUC) for receiver operating characteristic (ROC) curves yield mean values of over 0.9 for most of the classification models. CONCLUSIONS: Our classification procedure distinguishes well between trees from structured and non-structured populations using the classifiers, the two-sample Kolmogorov-Smirnov, Cucconi and Podgor-Gastwirth tests and the box plots. SVM models were more robust to changes in model parameters and tree size compared to KNN and DT classifiers. Our classification procedure was applied to real -world data and the structured population was revealed with high accuracy of [Formula: see text] using SVM-polynomial classifier

    The zero-inflated negative binomial regression model with correction for misclassification: an example in caries research

    No full text
    Zero-inflated models for count data are becoming quite popular nowadays and are found in many application areas, such as medicine, economics, biology, sociology and so on. However, in practice these counts are often prone to measurement error which in this case boils down to misclassification. Methods to deal with misclassification of counts have been suggested recently, but only for the binomial model and the Poisson model. Here we look at a more complex model, that is, the zero-inflated negative binomial, and illustrate how correction for misclassification can be achieved. Our approach is illustrated on the dmft-index which is a popular measure for caries experience in caries research. An extra problem was the fact that several dental examiners were involved in scoring caries experience. Using our example, we illustrate how a non-differential misclassification process for each examiner can lead to differential misclassification overall.status: publishe

    Modeling the Effects of Nonpharmaceutical Interventions on COVID-19 Spread in Kenya

    No full text
    Mathematical modeling of nonpharmaceutical interventions (NPIs) of coronavirus disease (COVID-19) in Kenya is presented. A susceptible-exposed-infected-recovered (SEIR) compartment model is considered with additional compartments of hospitalized population whose condition is severe or critical and the fatality compartment. The basic reproduction number (R0) is computed by the next-generation matrix approach and later expressed as a time-dependent function so as to incorporate the NPIs into the model. The resulting system of ordinary differential equations (ODEs) is solved using fourth-order and fifth-order Rungeā€“Kutta methods. Different intervention scenarios are considered, and the results show that implementation of closure of education institutions, curfew, and partial lockdown yields predicted delayed peaks of the overall infections, severe cases, and fatalities and subsequently containment of the pandemic in the country

    Flexible Bayesian semiparametric mixed-effects model for skewed longitudinal data

    No full text
    Abstract Background In clinical trials and epidemiological research, mixed-effects models are commonly used to examine population-level and subject-specific trajectories of biomarkers over time. Despite their increasing popularity and application, the specification of these models necessitates a great deal of care when analysing longitudinal data with non-linear patterns and asymmetry. Parametric (linear) mixed-effect models may not capture these complexities flexibly and adequately. Additionally, assuming a Gaussian distribution for random effects and/or model errors may be overly restrictive, as it lacks robustness against deviations from symmetry. Methods This paper presents a semiparametric mixed-effects model with flexible distributions for complex longitudinal data in the Bayesian paradigm. The non-linear time effect on the longitudinal response was modelled using a spline approach. The multivariate skew-t distribution, which is a more flexible distribution, is utilized to relax the normality assumptions associated with both random-effects and model errors. Results To assess the effectiveness of the proposed methods in various model settings, simulation studies were conducted. We then applied these models on chronic kidney disease (CKD) data and assessed the relationship between covariates and estimated glomerular filtration rate (eGFR). First, we compared the proposed semiparametric partially linear mixed-effect (SPPLM) model with the fully parametric one (FPLM), and the results indicated that the SPPLM model outperformed the FPLM model. We then further compared four different SPPLM models, each assuming different distributions for the random effects and model errors. The model with a skew-t distribution exhibited a superior fit to the CKD data compared to the Gaussian model. The findings from the application revealed that hypertension, diabetes, and follow-up time had a substantial association with kidney function, specifically leading to a decrease in GFR estimates. Conclusions The application and simulation studies have demonstrated that our work has made a significant contribution towards a more robust and adaptable methodology for modeling intricate longitudinal data. We achieved this by proposing a semiparametric Bayesian modeling approach with a spline smoothing function and a skew-t distribution

    Amoud Class for Hazard-Based and Odds-Based Regression Models: Application to Oncology Studies

    No full text
    The purpose of this study is to propose a novel, general, tractable, fully parametric class for hazard-based and odds-based models of survival regression for the analysis of censored lifetime data, named as the ā€œAmoud class (AM)ā€ of models. This generality was attained using a structure resembling the general class of hazard-based regression models, with the addition that the baseline odds function is multiplied by a link function. The class is broad enough to cover a number of widely used models, including the proportional hazard model, the general hazard model, the proportional odds model, the general odds model, the accelerated hazards model, the accelerated odds model, and the accelerated failure time model, as well as combinations of these. The proposed class incorporates the analysis of crossing survival curves. Based on a versatile parametric distribution (generalized log-logistic) for the baseline hazard, we introduced a technique for applying these various hazard-based and odds-based regression models. This distribution allows us to cover the most common hazard rate shapes in practice (decreasing, constant, increasing, unimodal, and reversible unimodal), and various common survival distributions (Weibull, Burr-XII, log-logistic, exponential) are its special cases. The proposed model has good inferential features, and it performs well when different information criteria and likelihood ratio tests are used to select hazard-based and odds-based regression models. The proposed modelā€™s utility is demonstrated by an application to a right-censored lifetime dataset with crossing survival curves

    Amoud Class for Hazard-Based and Odds-Based Regression Models: Application to Oncology Studies

    No full text
    The purpose of this study is to propose a novel, general, tractable, fully parametric class for hazard-based and odds-based models of survival regression for the analysis of censored lifetime data, named as the “Amoud class (AM)” of models. This generality was attained using a structure resembling the general class of hazard-based regression models, with the addition that the baseline odds function is multiplied by a link function. The class is broad enough to cover a number of widely used models, including the proportional hazard model, the general hazard model, the proportional odds model, the general odds model, the accelerated hazards model, the accelerated odds model, and the accelerated failure time model, as well as combinations of these. The proposed class incorporates the analysis of crossing survival curves. Based on a versatile parametric distribution (generalized log-logistic) for the baseline hazard, we introduced a technique for applying these various hazard-based and odds-based regression models. This distribution allows us to cover the most common hazard rate shapes in practice (decreasing, constant, increasing, unimodal, and reversible unimodal), and various common survival distributions (Weibull, Burr-XII, log-logistic, exponential) are its special cases. The proposed model has good inferential features, and it performs well when different information criteria and likelihood ratio tests are used to select hazard-based and odds-based regression models. The proposed model’s utility is demonstrated by an application to a right-censored lifetime dataset with crossing survival curves
    corecore