325,668 research outputs found

    Predicting the outcome of ankylosing spondylitis therapy

    Get PDF
    Objectives To create a model that provides a potential basis for candidate selection for anti-tumour necrosis factor (TNF) treatment by predicting future outcomes relative to the current disease profile of individual patients with ankylosing spondylitis (AS). Methods ASSERT and GO-RAISE trial data (n=635) were analysed to identify baseline predictors for various disease-state and disease-activity outcome instruments in AS. Univariate, multivariate, receiver operator characteristic and correlation analyses were performed to select final predictors. Their associations with outcomes were explored. Matrix and algorithm-based prediction models were created using logistic and linear regression, and their accuracies were compared. Numbers needed to treat were calculated to compare the effect size of anti-TNF therapy between the AS matrix subpopulations. Data from registry populations were applied to study how a daily practice AS population is distributed over the prediction model. Results Age, Bath ankylosing spondylitis functional index (BASFI) score, enthesitis, therapy, C-reactive protein (CRP) and HLA-B27 genotype were identified as predictors. Their associations with each outcome instrument varied. However, the combination of these factors enabled adequate prediction of each outcome studied. The matrix model predicted outcomes as well as algorithm-based models and enabled direct comparison of the effect size of anti-TNF treatment outcome in various subpopulations. The trial populations reflected the daily practice AS population. Conclusion Age, BASFI, enthesitis, therapy, CRP and HLA-B27 were associated with outcomes in AS. Their combined use enables adequate prediction of outcome resulting from anti-TNF and conventional therapy in various AS subpopulations. This may help guide clinicians in making treatment decisions in daily practice.Pathophysiology and treatment of rheumatic disease

    A study of pre-validation

    Full text link
    Given a predictor of outcome derived from a high-dimensional dataset, pre-validation is a useful technique for comparing it to competing predictors on the same dataset. For microarray data, it allows one to compare a newly derived predictor for disease outcome to standard clinical predictors on the same dataset. We study pre-validation analytically to determine if the inferences drawn from it are valid. We show that while pre-validation generally works well, the straightforward "one degree of freedom" analytical test from pre-validation can be biased and we propose a permutation test to remedy this problem. In simulation studies, we show that the permutation test has the nominal level and achieves roughly the same power as the analytical test.Comment: Published in at http://dx.doi.org/10.1214/07-AOAS152 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Child-related characteristics predicting subsequent health-related quality of life in 8- to 14-year-old children with and without cerebellar tumors: a prospective longitudinal study

    No full text
    BackgroundWe identified child-related determinants of health-related quality of life (HRQoL) in children aged 8–14 years who were treated for 2 common types of pediatric brain tumors. MethodsQuestionnaire measures of HRQoL and psychometric assessments were completed by 110 children on 3 occasions over 24 months. Of these 110, 72 were within 3 years of diagnosis of a cerebellar tumor (37 standard-risk medulloblastoma, 35 low-grade cerebellar astrocytoma), and 38 were in a nontumor group. HRQoL, executive function, health status, and behavioral difficulties were also assessed by parents and teachers as appropriate. Regression modeling was used to relate HRQoL z scores to age, sex, socioeconomic status, and 5 domains of functioning: Cognition, Emotion, Social, Motor and Sensory, and Behavior. ResultsHRQoL z scores were significantly lower after astrocytoma than those in the nontumor group and significantly lower again in the medulloblastoma group, both by self-report and by parent-report. In regression modeling, significant child-related predictors of poorer HRQoL z scores by self-report were poorer cognitive and emotional function (both z scores) and greater age (years) at enrollment (B = 0.038, 0.098, 0.136, respectively). By parent-report, poorer cognitive, emotional and motor or sensory function (z score) were predictive of lower subsequent HRQoL of the child (B = 0.043, 0.112, 0.019, respectively), while age at enrollment was not. ConclusionsEarly screening of cognitive and emotional function in this age group, which are potentially amenable to change, could identify those at risk of poor HRQoL and provide a rational basis for interventions to improve HRQoL

    An investigation into machine learning approaches for forecasting spatio-temporal demand in ride-hailing service

    Full text link
    In this paper, we present machine learning approaches for characterizing and forecasting the short-term demand for on-demand ride-hailing services. We propose the spatio-temporal estimation of the demand that is a function of variable effects related to traffic, pricing and weather conditions. With respect to the methodology, a single decision tree, bootstrap-aggregated (bagged) decision trees, random forest, boosted decision trees, and artificial neural network for regression have been adapted and systematically compared using various statistics, e.g. R-square, Root Mean Square Error (RMSE), and slope. To better assess the quality of the models, they have been tested on a real case study using the data of DiDi Chuxing, the main on-demand ride hailing service provider in China. In the current study, 199,584 time-slots describing the spatio-temporal ride-hailing demand has been extracted with an aggregated-time interval of 10 mins. All the methods are trained and validated on the basis of two independent samples from this dataset. The results revealed that boosted decision trees provide the best prediction accuracy (RMSE=16.41), while avoiding the risk of over-fitting, followed by artificial neural network (20.09), random forest (23.50), bagged decision trees (24.29) and single decision tree (33.55).Comment: Currently under review for journal publicatio

    Scalable Bayesian model averaging through local information propagation

    Full text link
    We show that a probabilistic version of the classical forward-stepwise variable inclusion procedure can serve as a general data-augmentation scheme for model space distributions in (generalized) linear models. This latent variable representation takes the form of a Markov process, thereby allowing information propagation algorithms to be applied for sampling from model space posteriors. In particular, we propose a sequential Monte Carlo method for achieving effective unbiased Bayesian model averaging in high-dimensional problems, utilizing proposal distributions constructed using local information propagation. We illustrate our method---called LIPS for local information propagation based sampling---through real and simulated examples with dimensionality ranging from 15 to 1,000, and compare its performance in estimating posterior inclusion probabilities and in out-of-sample prediction to those of several other methods---namely, MCMC, BAS, iBMA, and LASSO. In addition, we show that the latent variable representation can also serve as a modeling tool for specifying model space priors that account for knowledge regarding model complexity and conditional inclusion relationships
    corecore