3,855 research outputs found

    Random Forest-Based Prediction of Stroke Outcome

    Get PDF
    [Abstract] We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) ISโ€‰+โ€‰ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9โ€‰ยฑโ€‰13.8 years) with IS and 1100 (mean age 73.3โ€‰ยฑโ€‰13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. ISโ€‰+โ€‰ICH group was the most stable for mortality prediction [0.904โ€‰ยฑโ€‰0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909โ€‰ยฑโ€‰0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and ISโ€‰+โ€‰ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with Wโ€‰=โ€‰0.93546 (p-valueโ€‰<โ€‰2.2eโˆ’16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a valueโ€‰<โ€‰2.2eโˆ’16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity.This study was partially supported by grants from the Spanish Ministry of Science and Innovation (SAF2017-84267-R), Xunta de Galicia (Axencia Galega de Innovaciรณn (GAIN): IN607A2018/3), Instituto de Salud Carlos III (ISCIII) (PI17/00540, PI17/01103), Spanish Research Network on Cerebrovascular Diseases RETICS-INVICTUS PLUS (RD16/0019) and by the European Union FEDER program. T. Sobrino (CPII17/00027), F. Campos (CPII19/00020) are recipients of research contracts from the Miguel Servet Program (Instituto de Salud Carlos III). General Directorate of Culture, Education and University Management of Xunta de Galicia (ED431G/01,252 ED431D 2017/16), โ€œGalician Network for Colorectal Cancer Research" (Ref. ED431D 2017/23), Competitive Reference Groups (ED431C 2018/49), Spanish Ministry of Economy and Competitiveness via funding of the unique installation BIOCAI (UNLC08-1E-002, UNLC13-13โ€“3503), European Regional Development Funds (FEDER).Xunta de Galicia; IN607A2018/3Xunta de Galicia; ED431G/01,252Xunta de Galicia; ED431D 2017/1

    Prediction of atrial fibrillation and stroke using machine learning models in UK Biobank

    Get PDF
    Objective: Atrial fibrillation (AF) is the most common cardiac arrythmia, and it is associated with increased risk for ischemic stroke, which is underestimated, as AF can be asymptomatic. The aim of this study was to develop optimal ML models for prediction of AF in the population, and secondly for ischemic stroke in AF patients. Methods: To develop ML models for prediction of 1) AF in the general population and 2) ischemic stroke in patients with AF we constructed XGBoost, LightGBM, Random Forest, Deep Neural Network, Support Vector Machine and Lasso penalised logistic regression models using UK-Biobank's extensive real-world clinical data, questionnaires, as well as biochemical and genetic data, and their predictive performances were compared. Ranking and contribution of the different features was assessed by SHapley Additive exPlanations (SHAP) analysis. The clinical tool CHA2DS2-VASc for prediction of ischemic stroke among AF patients, was used for comparison to the best performing ML model. Findings: The best performing model for AF prediction was LightGBM, with an area-under-the-roc-curve (AUROC) of 0.729 (95% confidence intervals (CI): 0.719, 0.738). The best performing model for ischemic stroke prediction in AF patients was XGBoost with AUROC of 0.631 (95% CI: 0.604, 0.657). The improved AUROC in the XGBoost model compared to CHA2DS2-VASc was statistically significant based on DeLong's test (p-valueย =ย 2.20E-06). In addition, the SHAP analysis showed that several peripheral blood biomarkers (e.g. creatinine, glycated haemoglobin, monocytes) were associated with ischemic stroke, which are not considered by CHA2DS2-VASc. Implications: The best performing ML models presented have the potential for clinical use, but further validation in independent studies is required. Our results endorse the incorporation of some routinely measured blood biomarkers for ischemic stroke prediction in AF patients

    ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ƒ์กด๋ถ„์„์ด ์ ์šฉ๋œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ ํ‰๊ฐ€ ๋ชจ๋ธ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ์œ„ํ•œ ์ฝ•์Šค ๋ชจํ˜•๊ณผ ๊ฒฐํ•ฉ๋œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•: ํ—ฌ์Šค์ผ€์–ด-ํ™˜๊ฒฝ ์—ฐ๊ณ„ ๋ฐ์ดํ„ฐ ํ™œ์šฉ ์—ฐ๊ตฌ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์˜๊ณผ๋Œ€ํ•™ ์˜๊ณผํ•™๊ณผ, 2020. 8. ๋ฐ•์ƒ๋ฏผ .Background and aims: The contribution of different cardiovascular disease (CVD) risk factors for the risk evaluation and predictive modeling for incident CVD is often debated. Also, to what extent data on CVD risk factors from multiple data categories should be collected for comprehensive risk assessment and predictive modeling for CVD risk using survival analysis is uncertain despite the increasing availability of the relevant data sources. This study aimed to evaluate the contribution of different data categories derived from integrated data on healthcare and environmental exposure to the risk evaluation and prediction models for CVD risk using deep learning based survival analysis in combination with Cox proportional hazards regression and Cox proportional hazards regression. Methods: Information on the comprehensive list of CVD risk factors were collected from systematic reviews of variables included in the conventional CVD risk assessment tools and observational studies from medical literature database (PubMed and Embase). Each risk factor was screened for availability in the National Health Insurance Service-National Sample Cohort (NHIS-NSC) linked to environmental exposure data on cumulative particulate matter and urban green space using residential area code. Individual records of 137,249 patients more than 40 years of age who underwent the biennial national health screening between 2009 and 2010 without previous history of CVD were followed up for incident CVD event from January 1, 2011 to December 31, 2013 in the NHIS-NSC with data linkage to environmental exposure. Statistics-based variable selection methods were implemented as follows: statistical significance, subset with the minimum (best) Akaike Information Criteria (AIC), variables selected from the regularized Cox proportional hazards regression with elastic net penalty, and finally a variable set that commonly meets all the criteria from the abovementioned statistical methods. Prediction models using Cox proportional hazards deep neural network (DeepSurv) and Cox proportional hazards regression were constructed in the training set (80% of the total sample) using input feature sets selected from the abovementioned strategies and progressively adding input features by data categories to examine the relative contribution of each data type to the predictive performance for CVD risk. Performance evaluations of the DeepSurv and Cox proportional hazards regression models for CVD risk were conducted in the test set (20% of the total sample) with Unos concordance statistics (C-index), which is the most up-to-date evaluation metrics for the survival models with right censored data. Results: After the comprehensive review, data synthesis, and availability check, a total of 31 risk factors in the categories of sociodemographic, clinical laboratory test and measurement, lifestyle behavior, family history, underlying medical conditions, dental health, medication, and environmental exposure were identified in the NHIS-NSC linked to environmental exposure data. Among the models constructed with different variable selection methods, using statistically significant variables for DeepSurv (Unos C-index: 0.7069) and all of the variables for Cox proportional hazards regression (Unos C-index: 0.7052) showed improved predictive performance for CVD risk, which was a statistically significant increase (p-value for difference in Unos C-index: <0.0001 for both comparisons) compared to the models with basic clinical factors (age, sex, and body mass index), respectively. When all and statistically significant variables in each data category from sociodemographic to environmental exposure were progressively added as input features into DeepSurv and Cox proportional hazards regression for predictive modeling for CVD risk, the DeepSurv model with statistically significant variables pertaining to the sociodemographic factors, clinical laboratory test and measurement, and lifestyle behavior data showed the notable performance that outperformed Cox proportional hazards regression model with statistically significant variables added up to the medication category. Extensive data linkage to environmental exposure on cumulative particulate matter and urban green space offered only marginal improvement for the predictive performance of DeepSurv and Cox proportional hazards regression models for CVD risk. Conclusion: To obtain the best predictive performance of DeepSurv model for CVD risk with minimum number of input features, information on sociodemographic, clinical laboratory test and measurement, and lifestyle behavior should be primarily collected and used as input features in the NHIS-NSC. Also, the overall performance of DeepSurv for CVD risk assessment was improved with a hybrid approach using statistically significant variables from Cox proportional hazards regression as input features. When all the data categories in the NHIS-NSC linked to environmental exposure data are available, progressively adding variables in each data category could incrementally increase the predictive performance of DeepSurv model for CVD risk with the hybrid approach. Data linkage to the environmental exposure with residential area code in the NHIS-NSC offered marginally improved performance for CVD risk in both DeepSurv model with the hybrid approach and Cox proportional hazards regression model.๋ฐฐ๊ฒฝ ๋ฐ ๋ชฉ์ : ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ํ‰๊ฐ€ ๋ฐ ์˜ˆ์ธก๋ชจ๋ธ๋ง์—์„œ ๋‹ค์–‘ํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๋“ค์˜ ๋ชจ๋ธ ์„ฑ๋Šฅํ–ฅ์ƒ์— ๋Œ€ํ•œ ๊ธฐ์—ฌ๋„๋Š” ๋…ผ๋ž€์˜ ์š”์ง€๋กœ ๋ณด๊ณ ๋˜์–ด์™”๋‹ค. ๋˜ํ•œ, ์ง€์†์ ์œผ๋กœ ์ฆ๊ฐ€ํ•˜๋Š” ํ™œ์šฉ ๊ฐ€๋Šฅํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ๊ด€๋ จ ๋ฐ์ดํ„ฐ์˜ ์ข…๋ฅ˜์™€ ์–‘์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ํฌ๊ด„์ ์ธ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ํ‰๊ฐ€์™€ ์ตœ์ ์˜ ์˜ˆ์ธก ๋ชจํ˜• ๊ฐœ๋ฐœ์„ ์œ„ํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ์–ด๋Š ๋ฒ”์œ„์™€ ์ˆ˜์ค€๊นŒ์ง€ ์ˆ˜์ง‘ํ•ด์•ผ ํ•˜๋Š”์ง€์— ๋Œ€ํ•œ ๊ทผ๊ฑฐ๋Š” ๋ถ€์กฑํ•œ ํ˜„ํ™ฉ์ด๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ฝ•์Šค ๋ชจํ˜•๊ณผ ๊ฒฐํ•ฉ๋œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ƒ์กด๋ถ„์„ ์ ‘๊ทผ๋ฒ• ๋ฐ ์ฝ•์Šค ๋ชจํ˜•์„ ํ™œ์šฉํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ํ‰๊ฐ€์™€ ์˜ˆ์ธก๋ชจ๋ธ๋ง์—์„œ ํ—ฌ์Šค์ผ€์–ด-ํ™˜๊ฒฝ ์—ฐ๊ณ„ ๋ฐ์ดํ„ฐ ํ™œ์šฉ๋ฐฉ๋ฒ• ๋ฐ ๋ฒ”์ฃผ์— ๋”ฐ๋ฅธ ๋ชจ๋ธ ์„ฑ๋Šฅํ–ฅ์ƒ์— ๋Œ€ํ•œ ๊ธฐ์—ฌ๋„๋ฅผ ํ‰๊ฐ€ํ•˜๊ณ ์ž ํ•˜์˜€๋‹ค. ์—ฐ๊ตฌ ๋ฐฉ๋ฒ•: ์ „ํ†ต์  ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ ํ‰๊ฐ€ ๋„๊ตฌ ๋ฐ ๊ด€์ฐฐ ์—ฐ๊ตฌ๋“ค์— ํฌํ•จ ๋œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์š”์ธ ๊ด€๋ จ ๋ณ€์ˆ˜๋“ค์„ ์ฒด๊ณ„์  ๋ฌธํ—Œ๊ณ ์ฐฐ ๋ฐฉ๋ฒ•๋ก ์„ ํ™œ์šฉํ•˜์—ฌ ์˜ํ•™์—ฐ๊ตฌ ๋ฌธํ—Œ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค (PubMed and Embase)์—์„œ ํฌ๊ด„์ ์œผ๋กœ ์ •๋ณด๋ฅผ ์ˆ˜์ง‘ํ•˜์˜€๋‹ค. ๋ฏธ์„ธ๋จผ์ง€ ๋ˆ„์ ์žฅ๊ธฐ๋…ธ์ถœ ๋ฐ ๋„์‹œ๋…น์ง€๋ฉด์ ์— ๋Œ€ํ•œ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ์™€ ์—ฐ๊ณ„ ๋œ ๊ตญ๋ฏผ๊ฑด๊ฐ•๋ณดํ—˜๊ณต๋‹จ ํ‘œ๋ณธ์ฝ”ํ˜ธํŠธ, (National Health Insurance Service-National Sample Cohort, NHIS-NSC)์—์„œ ๊ฐ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๋“ค์˜ ๋ฐ์ดํ„ฐ ํ™•๋ณด ๊ฐ€๋Šฅ์„ฑ์„ ๊ฒ€ํ† ํ•˜์˜€๋‹ค. NHIS-NSC๋ฅผ ๊ธฐ์ค€์œผ๋กœ 2009๋…„์—์„œ 2010๋…„ ์‚ฌ์ด์— ๊ตญ๊ฐ€๊ฑด๊ฐ•๊ฒ€์ง„์„ ๋ฐ›์€ 40์„ธ ์ด์ƒ ๋Œ€์ƒ์ž ์ค‘ ๊ณผ๊ฑฐ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ๋ณ‘๋ ฅ์ด ์—†๋Š” ๋Œ€์ƒ์ž 137,249๋ช…์˜ ํ™˜์ž์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ์ˆ˜์ง‘ํ•˜์—ฌ 2011 ๋…„ 1 ์›” 1 ์ผ๋ถ€ํ„ฐ 2013 ๋…„ 12 ์›” 31 ์ผ๊นŒ์ง€ ์‹ ๊ทœ ๋ฐœ์ƒํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜์— ๋Œ€ํ•ด ์‹œ๊ฐ„ ๊ฒฝ๊ณผ์— ๋”ฐ๋ผ ์ถ”์  ์กฐ์‚ฌํ•˜์˜€๋‹ค. ํ†ต๊ณ„ ๊ธฐ๋ฐ˜ ๋ณ€์ˆ˜์„ ํƒ ๋ฐฉ๋ฒ•์€ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์  ์œ ์˜์„ฑ, ์ตœ์†Œ (์ตœ์ƒ์˜) Akaike Information Criteria (AIC)์˜ ํ•˜์œ„ ์ง‘ํ•ฉ, elastic net penalty๋กœ ์ •๊ทœํ™” ๋œ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ์„ ํƒ๋œ ๋ณ€์ˆ˜ ๋ฐ ์œ„์— ์–ธ๊ธ‰๋œ ๋ชจ๋“  ๊ธฐ์ค€์„ ์ถฉ์กฑํ•˜๋Š” ๋ณ€์ˆ˜ ์„ธํŠธ๋กœ ์„ ์ •ํ•˜์˜€๋‹ค. ์œ„์— ๋ช…์‹œ๋œ ํ†ต๊ณ„์  ๋ฐฉ๋ฒ• ์™ธ ๋ชจ๋“  ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ์— ์†ํ•œ ๋ณ€์ˆ˜ ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜ (ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•)๋ฅผ ์ ์ง„์ ์œผ๋กœ ์ž…๋ ฅ ํ”ผ์ณ๋กœ ์ถ”๊ฐ€ํ•˜๋Š” ์ „๋žต์œผ๋กœ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ƒ์กด๋ถ„์„ (Cox proportional hazards deep neural network, DeepSurv) ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ์˜ˆ์ธก ๋ชจ๋ธ๋“ค์„ ํ›ˆ๋ จ ์„ธํŠธ (์ „์ฒด ์ƒ˜ํ”Œ์˜ 80 %)๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. DeepSurv ๋ฐ ์ฝ•์Šค๋น„๋ก€ ์œ„ํ—˜๋ชจํ˜•์„ ํ™œ์šฉํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅํ‰๊ฐ€๋Š” ์ƒ์กด๋ถ„์„์„ ํ™œ์šฉํ•œ ์˜ˆ์ธก ๋ชจ๋ธ๋ง์— ๊ฐ€์žฅ ์ ํ•ฉํ•œ ํ‰๊ฐ€์ง€ํ‘œ๋กœ ์•Œ๋ ค์ง„ Unos concordance statistics (C-index)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ํ…Œ์ŠคํŠธ ์„ธํŠธ (์ด ์ƒ˜ํ”Œ์˜ 20 %)์—์„œ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค. ๊ฒฐ๊ณผ: ์ฒด๊ณ„์  ๋ฌธํ—Œ๊ณ ์ฐฐ, ๋ฐ์ดํ„ฐ ์ทจํ•ฉ ๋ฐ ์ถ”์ถœ ๊ฐ€๋Šฅ์„ฑ ๊ฒ€ํ†  ํ›„, ์ธ๊ตฌ์‚ฌํšŒํ•™์  ์š”์ธ, ๊ฑด๊ฐ•๊ฒ€์ง„ ๋ฐ ์ธก์ • ๊ฒฐ๊ณผ, ์ƒํ™œ์Šต๊ด€, ๊ฐ€์กฑ๋ ฅ, ๊ฑด๊ฐ•์ƒํƒœ, ๊ตฌ๊ฐ•๊ฑด๊ฐ•, ์•ฝ๋ฌผ ๋ฐ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ์—์„œ ์ด 31 ๊ฐœ์˜ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๊ฐ€ ์ง€์—ญํ™˜๊ฒฝ ์ž๋ฃŒ์™€ ์—ฐ๊ณ„๋œ NHIS-NSC์—์„œ ํ™•์ธ๋˜์—ˆ๋‹ค. ํ†ต๊ณ„ ๊ธฐ๋ฐ˜ ๋ณ€์ˆ˜์„ ํƒ ๋ฐฉ๋ฒ•์œผ๋กœ ๊ฐœ๋ฐœํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ ์ค‘ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜๋ฅผ DeepSurv์— ์ ์šฉํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•์ด Uno 's C-index ๊ฐ’ 0.7069, ๋ชจ๋“  ๋ณ€์ˆ˜๋ฅผ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์— ์ ์šฉํ•œ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์ด Uno 's C-index ๊ฐ’ 0.7052๋กœ ๋‚˜ํƒ€๋‚˜ ๊ธฐ๋ณธ ์ž„์ƒ ์š”์ธ (์—ฐ๋ น, ์„ฑ๋ณ„ ๋ฐ ์ฒด์งˆ๋Ÿ‰์ง€์ˆ˜)์ด ํฌํ•จ๋œ ์˜ˆ์ธก ๋ชจ๋ธ๊ณผ ๋น„๊ตํ•˜์—ฌ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ชจ๋ธ ์˜ˆ์ธก๋ ฅ ์ฆ๊ฐ€๋ฅผ ๋ณด์˜€๋‹ค (๋‘ ๋ชจ๋ธ ๋ชจ๋‘ Unos C-index ์ฐจ์ด์— ๋Œ€ํ•œ p-value : <0.0001). ์ธ๊ตฌ์‚ฌํšŒํ•™์  ํŠน์„ฑ์—์„œ ํ™˜๊ฒฝ ๋…ธ์ถœ์— ์ด๋ฅด๊ธฐ๊นŒ์ง€ ๊ฐ ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ์—์„œ ๋ชจ๋‘ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜๋“ค์ด ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง์„์œ„ํ•œ DeepSurv ๋ฐ Cox ๋น„๋ก€ ์œ„ํ—˜ ํšŒ๊ท€์— ์ž…๋ ฅ ํ”ผ์ณ๋กœ ์ ์ง„์ ์œผ๋กœ ์ถ”๊ฐ€ ๋œ ๊ฒฝ์šฐ, ์ธ๊ตฌ์‚ฌํšŒํ•™์  ์š”์ธ, ๊ฑด๊ฐ•๊ฒ€์ง„ ๋ฐ ์ธก์ • ๊ฒฐ๊ณผ, ์ƒํ™œ์Šต๊ด€ ์š”์ธ ์ค‘ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜๋“ค๋กœ ๊ตฌ์„ฑ๋œ DeepSurv ๋ชจ๋ธ์ด ์˜์•ฝํ’ˆ ์‚ฌ์šฉ๊นŒ์ง€ ๊ณ ๋ คํ•œ Cox ๋น„๋ก€ ์œ„ํ—˜ ํšŒ๊ท€๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๋ชจ๋ธ ๋ณด๋‹ค ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋‚˜ํƒ€๋ƒˆ๋‹ค. ๋ฏธ์„ธ๋จผ์ง€ ๋ฐ ๋„์‹œ๋…น์ง€๋ฉด์ ์— ๋Œ€ํ•œ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ๋ฅผ ๊ฑฐ์ฃผ์ง€๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ NHIS-NSC์™€ ์—ฐ๊ณ„ ํ›„ ์ ์ง„์ ์œผ๋กœ ์ž…๋ ฅ ํ”ผ์ณ๋กœ ์ถ”๊ฐ€ ์‹œ DeepSurv ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์„ ํ™œ์šฉํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง ์„ฑ๋Šฅ์„ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ์ˆ˜์ค€์œผ๋กœ ๊ฐœ์„ ํ•˜์ง€ ๋ชปํ–ˆ๋‹ค. ๊ฒฐ๋ก : ์ตœ์†Œ ์ž…๋ ฅ ํ”ผ์ณ๋ฅผ ๊ฐ–์ถ˜ ์ƒ์กด ๋ถ„์„ ๊ธฐ๋ฐ˜ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ์—์„œ ์ตœ์ƒ์˜ ์„ฑ๋Šฅ์„ ์–ป์œผ๋ ค๋ฉด ์ธ๊ตฌ์‚ฌํšŒํ•™์ , ๊ฑด๊ฐ•๊ฒ€์ง„ ๋ฐ ์ธก์ • ๊ฒฐ๊ณผ, ๋ฐ ์ƒํ™œ์Šต๊ด€์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ NHIS-NSC์—์„œ ์ˆ˜์ง‘ํ•˜์—ฌ DeepSurv์˜ ์ž…๋ ฅ ํ”ผ์ณ๋กœ ํ™œ์šฉํ•ด์•ผํ•œ๋‹ค. ์ง€์—ญํ™˜๊ฒฝ ์ž๋ฃŒ์™€ ์—ฐ๊ณ„๋œ NHIS-NSC์—์„œ ๋ชจ๋“  ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ๋ฅผ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์„ ๋•Œ ์ ์ง„์ ์œผ๋กœ ๊ฐ ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ ์ค‘ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๋ฅผ ์ ์ง„์ ์œผ๋กœ ์ž…๋ ฅ ํ”ผ์ณ๋กœ DeepSurv ๋ชจ๋ธ์— ์ถ”๊ฐ€ํ•˜๋Š” ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•์—์„œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง ์„ฑ๋Šฅ์ด ์ ์ฐจ ํ–ฅ์ƒ ๋  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ฃผ๊ฑฐ ์ง€์—ญ ์ฝ”๋“œ๋ฅผ ์‚ฌ์šฉํ•œ NHIS-NSC์™€ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ ์—ฐ๊ณ„๋Š” DeepSurv ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜• ๋ชจ๋‘์—์„œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง ์„ฑ๋Šฅ์ด ํ–ฅ์ƒ๋˜์—ˆ์ง€๋งŒ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ์ฆ๊ฐ€ ์ˆ˜์ค€์€ ์•„๋‹Œ ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚˜ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ ์—ฐ๊ณ„ ๋ฐ ์ ์šฉ ์‹œ ๊ฒ€ํ† ๊ฐ€ ํ•„์š”ํ•  ๊ฒƒ์œผ๋กœ ์ถ”์ •๋œ๋‹ค.I. Introduction 1 1. Background 1 2. Research problem 4 3. Hypothesis and objective 6 3.1. Hypothesis 6 3.2. Objective 6 II. Materials and methods 8 1. Comprehensive review and identification of cardiovascular disease (CVD) risk factors 8 1.1. Systematic review on variables included in conventional CVD risk assessment tools 8 1.2. Systematic review on traditional and emerging CVD risk factors from observational studies 9 1.3. Integration of the comprehensive list of CVD risk factors 11 1.4. Screening for data availability 11 2. Cohort analysis for measuring strength of association between risk factors and incident cardiovascular disease 11 2.1 Study population and linkage to environmental exposure data 11 2.2. Variable selection and data processing 15 2.3. Population-based cohort analysis 17 3. Predictive modeling using survival analysis: DeepSurv and Cox proportional hazards regression 17 3.1. Model development 17 3.2. Evaluation of the predictive performance of the models 20 III. Results 21 1. Identification and categorization of cardiovascular disease risk factors 21 2. Magnitude of association between selected risk factors with cardiovascular disease 43 3. Model performance evaluation 56 VI. Discussion 68 1. Key findings and contributions 68 2. Comparison to other studies 69 3. Strengths and limitations 73 4. Implications 74 5. Future perspectives 75 V. Conclusion 77 Reference 78 ๊ตญ๋ฌธ์ดˆ๋ก 88Docto

    Retinal Artery Occlusion and Cardiovascular Disease:Risk Factors, Potential Pathophysiology, and Prognosis

    Get PDF

    Prediction and monitoring of in-hospital cardiac arrest

    Get PDF
    Background: In-hospital cardiac arrest (IHCA) is a global health concern of major importance, associated with a poor prognosis. IHCA is frequently heralded by a deterioration of vital signs, and many cases are considered preventable. Hence, prevention has become a key strategy. The overall aim of this thesis was to study the prevention of IHCA, by means of prediction and monitoring, with a view to improve patient safety. Methods: Study I and III are observational cohort studies, based on the Swedish Registry of Cardiopulmonary Resuscitation (SRCR). In study III, we also collected additional data from medical records in a small, hypothesis-generating group of patients. Study II and IV are prospective, observational cohort studies based on patients reviewed by Rapid Response Teams (RRTs) in 26 and 24 Swedish hospitals, respectively. In study IV, additional data on long-term survival was obtained from either medical records or the personal information directory, containing population registration data. Specific study aims and results: In study I, we investigated how 30-day survival after IHCA was influenced by ECG monitoring at the time of collapse, as well as clinical factors that determined whether patients were ECG monitored adjacent to cardiac arrest (CA). In all, 24,790 patients were enrolled in the SRCR between 2008 and 2017. After applying the exclusion criteria, 19,225 patients remained, of which 52% were monitored at the time of collapse. In all, 30-day survival was 30%. ECG monitoring at the time of CA was associated with a Hazard Ratio of 0.62 (95% Confidence Interval 0.60-0.64) for 30-day mortality. The strongest predictor of ECG monitoring adjacent to IHCA was location in hospital. There were tangible variations in the frequency of patients who were ECG monitored at the time of collapse between Swedish regions and across hospitals. In study II, we investigated the predictive power of NEWS 2, as compared to NEWS, in identifying patients at risk of Serious Adverse Events (SAEs) within 24 hours of an RRT-review. In all, 1,065 patients, reviewed by RRTs in general wards during the study period between October 2019 and January 2020, were included. After applying the exclusion criteria, 898 patients were eligible for complete case analyses. In all, 37% of the patients were admitted to the Intensive care unit (ICU) within 24 hours of RRT-review. In-hospital mortality and IHCA were uncommon (6% and 1% respectively). The Area Under the Receiver Operating Characteristic (AUROC) for both NEWS and NEWS 2 was 0.62 for the composite outcome, and 0.69/0.67 for mortality. Regarding the outcome unanticipated ICU admission, the AUROC was 0.59 and 0.60, respectively, while the AUROC for IHCA was 0.51 (NEWS) and 0.47 (NEWS 2), respectively. In study III, we investigated 30-day survival and ROSC in patients suffering from IHCA, who were reviewed by an RRT within 24 hours prior to the CA, as compared to those without such review. Furthermore, we studied patient centred factors prior to RRT activation, the timeliness of the RRT-review as well as the reason for the RRT-review. We also investigated the association between RRT interventions and outcome. During the study period between 2014 and 2021, 19,973 patents were enrolled in the SRCR. After applying the exclusion criteria, 12,915 patients remained. Among these IHCA patients, there was an RRT/ICU contact within 24 hours prior to the CA in 2,058 cases (19%). The adjusted 30-day survival was lower among patients reviewed by an RRT prior to IHCA (25% vs. 33%, p <0.001). Regarding ROSC, we did not observe any difference between the groups. The propensity score based Odds Ratio for 30- day survival was 0.92 for patients who were reviewed by an RRT (95% CI 0.90 to 0.94, p <0.001), as compared to those who were not RRT- reviewed within 24 hours prior to IHCA. A respiratory cause of CA was more common among IHCA patients who were reviewed by an RRT. In the small, explorative subgroup (n=82), 24% of the RRT activations were delayed, and respiratory distress was the most common RRT trigger. We observed a significantly lower 30-day survival among patients triaged to remain at ward compared to those triaged to a higher level of care (2% vs. 20%, p 0.016). In study IV, we explored the impact of age on the ability of NEWS 2 to predict IHCA, unanticipated ICU-admission, or death, and the composite of these three SAEs, within 24 hours of review by an RRT. Furthermore, we aimed to investigate 30-, 90- and 180-day mortality, and the discriminative ability of NEWS 2 in the prediction of long-term mortality among RRT-reviewed patients. In this multi-centre study based on data prospectively collected by RRTs, the NEWS 2 scores of all patients were retrospectively, digitally calculated by the study team. Age was analysed as a continuous variable, in a spline regression model, and categorized into five different models, subsequently explored as additive variables to NEWS 2. The discriminative ability of NEWS 2 in predicting 30-day mortality improved by adding age as a covariate (from AUROC 0.66, 0.62-0.70 to 0.70, 0.65-0.73, p=0.01). There were differences across age groups, with the best predictive performance identified among patients aged 45-54 years. The 30-, 90-and 180-day mortality was 31%, 33%, and 36%, respectively. Conclusion: ECG monitoring at the time of IHCA was associated with a 38% reduction of adjusted mortality. Despite this finding, only one in two IHCA patients were ECG monitored. The most important factor influencing ECG monitoring was which type of hospital ward the patient was admitted to. The tangible variations in the frequency of ECG monitoring adjacent to IHCA observed between Swedish regions and across hospitals need to be investigated in future studies. Guidelines for the monitoring of patients at risk of CA could contribute to an improved outcome. The prognostic accuracy of NEWS 2 in predicting mortality within 24 hours of an RRT-review was acceptable, whereas the discriminative ability in prediction of unanticipated ICU-admission and the composite outcome was rather weak. Regarding the prediction of IHCA, NEWS 2 performed poorly. There was no difference in the prognostic accuracy between NEWS and NEWS 2; however, the discriminative ability was not considered sufficient to serve as a triage tool in RRT-reviewed patients. In-hospital cardiac arrest among patients who were reviewed by an RRT prior to CA was associated with a poorer prognosis, and a more frequent respiratory aetiology of the CA. In the explorative sub-group of patients, RRT activation was frequently delayed, the most common trigger for RRT-review was respiratory distress, and escalation of the level of care was associated with an improved prognosis. Early identification of patients with abnormal respiratory vital signs, followed by a timely response, may have a potential to improve the prognosis for patients reviewed by an RRT and prevent IHCA. Adding age as a covariate improved the discriminative ability of NEWS 2 in the prediction of 30-day mortality among RRT-reviewed patients. The ability differed across age categories. Overall, the long-term prognosis of RRT-reviewed patients was poor. Our results indicate that age merits further validation as a covariate to improve the performance of NEWS 2

    Bleeding complications following acute myocardial infarction : time trends, risk assessment and associated prognosis

    Get PDF
    Background: In patients with acute myocardial infarction (MI), bleeding complications are common and associated with worse prognosis. This thesis aimed to investigate the epidemiology, risk assessment and associated outcomes of bleeding complications in patients with acute MI. Methods and results: Study I: Patients with acute MI enrolled in the SWEDEHEART registry from 1995โ€“2018 were included (n=371 431). The incidence of in-hospital and out-of-hospital bleeding at one-year was investigated parallel to treatment changes and ischemic outcomes. From 1995 to 2018, in-hospital bleeding increased from 0.5% to 1.3% and out-of-hospital bleeding increased from 2.5% to 4.8% along with increased use of invasive revascularisation and more efficient antithrombotic treatment. Meanwhile in-hospital and out-of-hospital ischemic outcomes decreased from 12.1% to 5.6% and 27.5% to 15.1%, respectively. Study II: Patients with acute MI enrolled in the SWEDEHEART registry from 2009โ€“ 2014 were included (n=97 597). A prediction model for in-hospital bleeding was created using logistic regression and the performance was compared to that of the CRUSADE and ACTION scores. Due to miscalibration, the CRUSADE and ACTION scores were recalibrated. The SWEDEHEART score, consisting of five baseline variables (haemoglobin, age, sex, creatinine, and C-reactive protein) plus one interaction term (haemoglobin and sex) had a C-index of 0.80 as compared with 0.72 and 0.73 for the recalibrated CRUSADE and ACTION scores, respectively. Study III: Patients with acute MI enrolled in the SWEDEHEART registry from 2007โ€“2016 and discharged alive on any antithrombotic treatment were included (n=149 447). The incidence, associated outcomes and predictors of upper gastrointestinal bleeding (UGIB) was investigated. The incidence of UGIB within one year after discharge was 1.5% and experiencing UGIB was associated with increased risk of mortality and stroke, but not significantly associated with MI. Using both logistic regression and machine-learning models, new potential predictors of UGIB were found, such as smoking status and blood glucose. Study IV: Patients with acute MI enrolled in the SWEDEHEART registry and discharged alive on any antithrombotic treatment from 2012โ€“2017 were included (n=86 736). The incidence and associated mortality risk of ischemic (MI or ischemic stroke) and bleeding events was investigated. Within one year after discharge, the incidence rate of ischemic and bleeding events was 5.7/100 person years and 4.8/100 person years, respectively. Both ischemic and bleeding events were associated with higher risk of mortality as compared with no event, with adjusted hazard ratios (HR)s of 4.16 (95% CI 3.91 to 4.43) and 3.43 (95% CI 3.17 to 3.71), respectively. In a direct comparison of ischemic vs bleeding event, the adjusted HR was 1.27 (95% CI 1.15 to 1.40.) Conclusion: In the past two decades, the incidence of both short- and long-term bleeding events has nearly doubled in patients with acute MI. The five-item SWEDEHEART score predicts inhospital bleeding in patients with acute MI more accurately than the recalibrated CRUSADE and ACTION scores. Among patients with a recent MI, upper gastrointestinal bleeding is common and associated with poorer prognosis. Beyond the known risk factors for bleeding, other predictors for upper gastrointestinal bleeding may be present. In patients discharged after an acute MI, ischemic events were more common and associated with higher risk of mortality than bleeding events

    Artificial Intelligence Model Predicts Sudden Cardiac Arrest Manifesting With Pulseless Electric Activity Versus Ventricular Fibrillation

    Get PDF
    BACKGROUND: There is no specific treatment for sudden cardiac arrest (SCA) manifesting as pulseless electric activity (PEA) and survival rates are low; unlike ventricular fibrillation (VF), which is treatable by defibrillation. Development of novel treatments requires fundamental clinical studies, but access to the true initial rhythm has been a limiting factor. METHODS: Using demographics and detailed clinical variables, we trained and tested an AI model (extreme gradient boosting) to differentiate PEA-SCA versus VF-SCA in a novel setting that provided the true initial rhythm. A subgroup of SCAs are witnessed by emergency medical services personnel, and because the response time is zero, the true SCA initial rhythm is recorded. The internal cohort consisted of 421 emergency medical services-witnessed out-of-hospital SCAs with PEA or VF as the initial rhythm in the Portland, Oregon metropolitan area. External validation was performed in 220 emergency medical services-witnessed SCAs from Ventura, CA. RESULTS: In the internal cohort, the artificial intelligence model achieved an area under the receiver operating characteristic curve of 0.68 (95% CI, 0.61-0.76). Model performance was similar in the external cohort, achieving an area under the receiver operating characteristic curve of 0.72 (95% CI, 0.59-0.84). Anemia, older age, increased weight, and dyspnea as a warning symptom were the most important features of PEA-SCA; younger age, chest pain as a warning symptom and established coronary artery disease were important features associated with VF. CONCLUSIONS: The artificial intelligence model identified novel features of PEA-SCA, differentiated from VF-SCA and was successfully replicated in an external cohort. These findings enhance the mechanistic understanding of PEA-SCA with potential implications for developing novel management strategies
    • โ€ฆ
    corecore