2,559 research outputs found

    Predicting the outcome of patients with subarachnoid hemorrhage using machine learning techniques

    Get PDF
    Background: Outcome prediction for subarachnoid hemorrhage (SAH) helps guide care and compare global management strategies. Logistic regression models for outcome prediction may be cumbersome to apply in clinical practice. Objective: To use machine learning techniques to build a model of outcome prediction that makes the knowledge discovered from the data explicit and communicable to domain experts. Material and methods: A derivation cohort (n = 441) of nonselected SAH cases was analyzed using different classification algorithms to generate decision trees and decision rules. Algorithms used were C4.5, fast decision tree learner, partial decision trees, repeated incremental pruning to produce error reduction, nearest neighbor with generalization, and ripple down rule learner. Outcome was dichotomized in favorable [Glasgow outcome scale (GOS) = I–II] and poor (GOS = III–V). An independent cohort (n = 193) was used for validation. An exploratory questionnaire was given to potential users (specialist doctors) to gather their opinion on the classifier and its usability in clinical routine. Results: The best classifier was obtained with the C4.5 algorithm. It uses only two attributes [World Federation of Neurological Surgeons (WFNS) and Fisher’s scale] and leads to a simple decision tree. The accuracy of the classifier [area under the ROC curve (AUC) = 0.84; confidence interval (CI) = 0.80–0.88] is similar to that obtained by a logistic regression model (AUC = 0.86; CI = 0.83–0.89) derived from the same data and is considered better fit for clinical use.This work was supported in part by the Spanish Ministries of Science under Grant TRA2007-67374-C02-02 and Health under Grant FIS PI 070152. The work of A. Lagares and J.F. Alen was supported by the Fundación Mutua Madrileña

    Information methods for predicting risk and outcome of stroke

    Get PDF
    Stroke is a major cause of disability and mortality in most economically developed countries. It is the second leading cause of death worldwide (after cancer and heart disease) [55.1, 2] and a major cause of disability in adults in developed countries [55.3]. Personalized modeling is an emerging effective computational approach, which has been applied to various disciplines, such as in personalized drug design, ecology, business, and crime prevention; it has recently become more prominent in biomedical applications. Biomedical data on stroke risk factors and prognostic data are available in a large volume, but the data are complex and often difficult to apply to a specific person. Individualizing stroke risk prediction and prognosis will allow patients to focus on risk factors specific to them, thereby reducing their stroke risk and managing stroke outcomes more effectively. This chapter reviews various methods–conventional statistical methods and computational intelligent modeling methods for predicting risk and outcome of stroke

    Outcome prediction in aneurysmal subarachnoid hemorrhage: a comparison of machine learning methods and established clinico-radiological scores

    Get PDF
    Reliable prediction of outcomes of aneurysmal subarachnoid hemorrhage (aSAH) based on factors available at patient admission may support responsible allocation of resources as well as treatment decisions. Radiographic and clinical scoring systems may help clinicians estimate disease severity, but their predictive value is limited, especially in devising treatment strategies. In this study, we aimed to examine whether a machine learning (ML) approach using variables available on admission may improve outcome prediction in aSAH compared to established scoring systems. Combined clinical and radiographic features as well as standard scores (Hunt & Hess, WFNS, BNI, Fisher, and VASOGRADE) available on patient admission were analyzed using a consecutive single-center database of patients that presented with aSAH (n = 388). Different ML models (seven algorithms including three types of traditional generalized linear models, as well as a tree bosting algorithm, a support vector machine classifier (SVMC), a Naive Bayes (NB) classifier, and a multilayer perceptron (MLP) artificial neural net) were trained for single features, scores, and combined features with a random split into training and test sets (4:1 ratio), ten-fold cross-validation, and 50 shuffles. For combined features, feature importance was calculated. There was no difference in performance between traditional and other ML applications using traditional clinico-radiographic features. Also, no relevant difference was identified between a combined set of clinico-radiological features available on admission (highest AUC 0.78, tree boosting) and the best performing clinical score GCS (highest AUC 0.76, tree boosting). GCS and age were the most important variables for the feature combination. In this cohort of patients with aSAH, the performance of functional outcome prediction by machine learning techniques was comparable to traditional methods and established clinical scores. Future work is necessary to examine input variables other than traditional clinico-radiographic features and to evaluate whether a higher performance for outcome prediction in aSAH can be achieved

    Rescue therapy for vasospasm following aneurysmal subarachnoid hemorrhage:a propensity score-matched analysis with machine learning

    Get PDF
    OBJECTIVE Rescue therapies have been recommended for patients with angiographic vasospasm (aVSP) and delayed cerebral ischemia (DCI) following subarachnoid hemorrhage (SAH). However, there is little evidence from randomized clinical trials that these therapies are safe and effective. The primary aim of this study was to apply game theory-based methods in explainable machine learning (ML) and propensity score matching to determine if rescue therapy was associated with better 3-month outcomes following post-SAH aVSP and DCI. The authors also sought to use these explainable ML methods to identify patient populations that were more likely to receive rescue therapy and factors associated with better outcomes after rescue therapy. METHODS Data for patients with aVSP or DCI after SAH were obtained from 8 clinical trials and 1 observational study in the Subarachnoid Hemorrhage International Trialists repository. Gradient boosting ML models were constructed for each patient to predict the probability of receiving rescue therapy and the 3-month Glasgow Outcome Scale (GOS) score. Favorable outcome was defined as a 3-month GOS score of 4 or 5. Shapley Additive Explanation (SNAP) values were calculated for each patient-derived model to quantify feature importance and interaction effects. Variables with high S HAP importance in predicting rescue therapy administration were used in a propensity score-matched analysis of rescue therapy and 3-month GOS scores. RESULTS The authors identified 1532 patients with aVSP or DCI. Predictive, explainable ML models revealed that aneurysm characteristics and neurological complications, but not admission neurological scores, carried the highest relative importance rankings in predicting whether rescue therapy was administered. Younger age and absence of cerebral ischemia/ infarction were invariably linked to better rescue outcomes, whereas the other important predictors of outcome varied by rescue type (interventional or noninterventional). In a propensity score-matched analysis guided by SHAP-based variable selection, rescue therapy was associated with higher odds of 3-month GOS scores of 4-5 (OR 1.63, 95% CI 1.22-2.17). CONCLUSIONS Rescue therapy may increase the odds of good outcome in patients with aVSP or DCI after SAH. Given the strong association between cerebral ischemia/infarction and poor outcome, trials focusing on preventative or therapeutic interventions in these patients may be most able to demonstrate improvements in clinical outcomes. Insights developed from these models may be helpful for improving patient selection and trial design

    Development of machine learning models to prognosticate chronic shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage

    Get PDF
    Background: Shunt-dependent hydrocephalus significantly complicates subarachnoid hemorrhage (SAH), and reliable prognosis methods have been sought in recent years to reduce morbidity and costs associated with delayed treatment or neglected onset. Machine learning (ML) defines modern data analysis techniques allowing accurate subject-based risk stratifications. We aimed at developing and testing different ML models to predict shunt-dependent hydrocephalus after aneurysmal SAH. Methods: We consulted electronic records of patients with aneurysmal SAH treated at our institution between January 2013 and March 2019. We selected variables for the models according to the results of the previous works on this topic. We trained and tested four ML algorithms on three datasets: one containing binary variables, one considering variables associated with shunt-dependency after an explorative analysis, and one including all variables. For each model, we calculated AUROC, specificity, sensitivity, accuracy, PPV, and also, on the validation set, the NPV and the Matthews correlation coefficient (ϕ). Results: Three hundred eighty-six patients were included. Fifty patients (12.9%) developed shunt-dependency after a mean follow-up of 19.7 (± 12.6) months. Complete information was retrieved for 32 variables, used to train the models. The best models were selected based on the performances on the validation set and were achieved with a distributed random forest model considering 21 variables, with a ϕ = 0.59, AUC = 0.88; sensitivity and specificity of 0.73 (C.I.: 0.39–0.94) and 0.92 (C.I.: 0.84–0.97), respectively; PPV = 0.59 (0.38–0.77); and NPV = 0.96 (0.90–0.98). Accuracy was 0.90 (0.82–0.95). Conclusions: Machine learning prognostic models allow accurate predictions with a large number of variables and a more subject-oriented prognosis. We identified a single best distributed random forest model, with an excellent prognostic capacity (ϕ = 0.58), which could be especially helpful in identifying low-risk patients for shunt-dependency

    DEVELOPING NOVEL COMPUTER-AIDED DETECTION AND DIAGNOSIS SYSTEMS OF MEDICAL IMAGES

    Get PDF
    Reading medical images to detect and diagnose diseases is often difficult and has large inter-reader variability. To address this issue, developing computer-aided detection and diagnosis (CAD) schemes or systems of medical images has attracted broad research interest in the last several decades. Despite great effort and significant progress in previous studies, only limited CAD schemes have been used in clinical practice. Thus, developing new CAD schemes is still a hot research topic in medical imaging informatics field. In this dissertation, I investigate the feasibility of developing several new innovative CAD schemes for different application purposes. First, to predict breast tumor response to neoadjuvant chemotherapy and reduce unnecessary aggressive surgery, I developed two CAD schemes of breast magnetic resonance imaging (MRI) to generate quantitative image markers based on quantitative analysis of global kinetic features. Using the image marker computed from breast MRI acquired pre-chemotherapy, CAD scheme enables to predict radiographic complete response (CR) of breast tumors to neoadjuvant chemotherapy, while using the imaging marker based on the fusion of kinetic and texture features extracted from breast MRI performed after neoadjuvant chemotherapy, CAD scheme can better predict the pathologic complete response (pCR) of the patients. Second, to more accurately predict prognosis of stroke patients, quantifying brain hemorrhage and ventricular cerebrospinal fluid depicting on brain CT images can play an important role. For this purpose, I developed a new interactive CAD tool to segment hemorrhage regions and extract radiological imaging marker to quantitatively determine the severity of aneurysmal subarachnoid hemorrhage at presentation and correlate the estimation with various homeostatic/metabolic derangements and predict clinical outcome. Third, to improve the efficiency of primary antibody screening processes in new cancer drug development, I developed a CAD scheme to automatically identify the non-negative tissue slides, which indicate reactive antibodies in digital pathology images. Last, to improve operation efficiency and reliability of storing digital pathology image data, I developed a CAD scheme using optical character recognition algorithm to automatically extract metadata from tissue slide label images and reduce manual entry for slide tracking and archiving in the tissue pathology laboratories. In summary, in these studies, we developed and tested several innovative approaches to identify quantitative imaging markers with high discriminatory power. In all CAD schemes, the graphic user interface-based visual aid tools were also developed and implemented. Study results demonstrated feasibility of applying CAD technology to several new application fields, which has potential to assist radiologists, oncologists and pathologists improving accuracy and consistency in disease diagnosis and prognosis assessment of using medical image

    Validation and Optimization of Barrow Neurological Institute Score in Prediction of Adverse Events and Functional Outcome After Subarachnoid Hemorrhage-Creation of the HATCH (Hemorrhage, Age, Treatment, Clinical State, Hydrocephalus) Score.

    Get PDF
    BACKGROUND: The Barrow Neurological Institute (BNI) score, measuring maximal thickness of aneurysmal subarachnoid hemorrhage (aSAH), has previously shown to predict symptomatic cerebral vasospasms (CVSs), delayed cerebral ischemia (DCI), and functional outcome. OBJECTIVE: To validate the BNI score for prediction of above-mentioned variables and cerebral infarct and evaluate its improvement by integrating further variables which are available within the first 24 h after hemorrhage. METHODS: We included patients from a single center. The BNI score for prediction of CVS, DCI, infarct, and functional outcome was validated in our cohort using measurements of calibration and discrimination (area under the curve [AUC]). We improved it by adding additional variables, creating a novel risk score (measure by the dichotomized Glasgow Outcome Scale) and validated it in a small independent cohort. RESULTS: Of 646 patients, 41.5% developed symptomatic CVS, 22.9% DCI, 23.5% cerebral infarct, and 29% had an unfavorable outcome. The BNI score was associated with all outcome measurements. We improved functional outcome prediction accuracy by including age, BNI score, World Federation of Neurologic Surgeons, rebleeding, clipping, and hydrocephalus (AUC 0.84, 95% CI 0.8-0.87). Based on this model we created a risk score (HATCH-Hemorrhage, Age, Treatment, Clinical State, Hydrocephalus), ranging 0 to 13 points. We validated it in a small independent cohort. The validated score demonstrated very good discriminative ability (AUC 0.84 [95% CI 0.72-0.96]). CONCLUSION: We developed the HATCH score, which is a moderate predictor of DCI, but excellent predictor of functional outcome at 1 yr after aSAH

    Secondary use of Structured Electronic Health Records Data: From Observational Studies to Deep Learning-based Predictive Modeling

    Get PDF
    With the wide adoption of electronic health records (EHRs), researchers, as well as large healthcare organizations, governmental institutions, insurance, and pharmaceutical companies have been interested in leveraging this rich clinical data source to extract clinical evidence and develop predictive algorithms. Large vendors have been able to compile structured EHR data from sites all over the United States, de-identify these data, and make them available to data science researchers in a more usable format. For this dissertation, we leveraged one of the earliest and largest secondary EHR data sources and conducted three studies of increasing scope. In the first study, which was of limited scope, we conducted a retrospective observational study to compare the effect of three drugs on a specific population of approximately 3,000 patients. Using a novel statistical method, we found evidence that the selection of phenylephrine as the primary vasopressor to induce hypertension for the management of nontraumatic subarachnoid hemorrhage is associated with better outcomes as compared to selecting norepinephrine or dopamine. In the second study, we widened our scope, using a cohort of more than 100,000 patients to train generalizable models for the risk prediction of specific clinical events, such as heart failure in diabetes patients or pancreatic cancer. In this study, we found that recurrent neural network-based predictive models trained on expressive terminologies, which preserve a high level of granularity, are associated with better prediction performance as compared with other baseline methods, such as logistic regression. Finally, we widened our scope again, to train Med-BERT, a foundation model, on more than 20 million patients’ diagnosis data. Med-BERT was found to improve the prediction performance of downstream tasks that have a small sample size, which otherwise would limit the ability of the model to learn good representation. In conclusion, we found that we can extract useful information and train helpful deep learning-based predictive models. Given the limitations of secondary EHR data and taking into consideration that the data were originally collected for administrative and not research purposes, however, the findings need clinical validation. Therefore, clinical trials are warranted to further validate any new evidence extracted from such data sources before updating clinical practice guidelines. The implementability of the developed predictive models, which are in an early development phase, also warrants further evaluation
    corecore