2,555 research outputs found

    ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ƒ์กด๋ถ„์„์ด ์ ์šฉ๋œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ ํ‰๊ฐ€ ๋ชจ๋ธ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ์œ„ํ•œ ์ฝ•์Šค ๋ชจํ˜•๊ณผ ๊ฒฐํ•ฉ๋œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•: ํ—ฌ์Šค์ผ€์–ด-ํ™˜๊ฒฝ ์—ฐ๊ณ„ ๋ฐ์ดํ„ฐ ํ™œ์šฉ ์—ฐ๊ตฌ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์˜๊ณผ๋Œ€ํ•™ ์˜๊ณผํ•™๊ณผ, 2020. 8. ๋ฐ•์ƒ๋ฏผ .Background and aims: The contribution of different cardiovascular disease (CVD) risk factors for the risk evaluation and predictive modeling for incident CVD is often debated. Also, to what extent data on CVD risk factors from multiple data categories should be collected for comprehensive risk assessment and predictive modeling for CVD risk using survival analysis is uncertain despite the increasing availability of the relevant data sources. This study aimed to evaluate the contribution of different data categories derived from integrated data on healthcare and environmental exposure to the risk evaluation and prediction models for CVD risk using deep learning based survival analysis in combination with Cox proportional hazards regression and Cox proportional hazards regression. Methods: Information on the comprehensive list of CVD risk factors were collected from systematic reviews of variables included in the conventional CVD risk assessment tools and observational studies from medical literature database (PubMed and Embase). Each risk factor was screened for availability in the National Health Insurance Service-National Sample Cohort (NHIS-NSC) linked to environmental exposure data on cumulative particulate matter and urban green space using residential area code. Individual records of 137,249 patients more than 40 years of age who underwent the biennial national health screening between 2009 and 2010 without previous history of CVD were followed up for incident CVD event from January 1, 2011 to December 31, 2013 in the NHIS-NSC with data linkage to environmental exposure. Statistics-based variable selection methods were implemented as follows: statistical significance, subset with the minimum (best) Akaike Information Criteria (AIC), variables selected from the regularized Cox proportional hazards regression with elastic net penalty, and finally a variable set that commonly meets all the criteria from the abovementioned statistical methods. Prediction models using Cox proportional hazards deep neural network (DeepSurv) and Cox proportional hazards regression were constructed in the training set (80% of the total sample) using input feature sets selected from the abovementioned strategies and progressively adding input features by data categories to examine the relative contribution of each data type to the predictive performance for CVD risk. Performance evaluations of the DeepSurv and Cox proportional hazards regression models for CVD risk were conducted in the test set (20% of the total sample) with Unos concordance statistics (C-index), which is the most up-to-date evaluation metrics for the survival models with right censored data. Results: After the comprehensive review, data synthesis, and availability check, a total of 31 risk factors in the categories of sociodemographic, clinical laboratory test and measurement, lifestyle behavior, family history, underlying medical conditions, dental health, medication, and environmental exposure were identified in the NHIS-NSC linked to environmental exposure data. Among the models constructed with different variable selection methods, using statistically significant variables for DeepSurv (Unos C-index: 0.7069) and all of the variables for Cox proportional hazards regression (Unos C-index: 0.7052) showed improved predictive performance for CVD risk, which was a statistically significant increase (p-value for difference in Unos C-index: <0.0001 for both comparisons) compared to the models with basic clinical factors (age, sex, and body mass index), respectively. When all and statistically significant variables in each data category from sociodemographic to environmental exposure were progressively added as input features into DeepSurv and Cox proportional hazards regression for predictive modeling for CVD risk, the DeepSurv model with statistically significant variables pertaining to the sociodemographic factors, clinical laboratory test and measurement, and lifestyle behavior data showed the notable performance that outperformed Cox proportional hazards regression model with statistically significant variables added up to the medication category. Extensive data linkage to environmental exposure on cumulative particulate matter and urban green space offered only marginal improvement for the predictive performance of DeepSurv and Cox proportional hazards regression models for CVD risk. Conclusion: To obtain the best predictive performance of DeepSurv model for CVD risk with minimum number of input features, information on sociodemographic, clinical laboratory test and measurement, and lifestyle behavior should be primarily collected and used as input features in the NHIS-NSC. Also, the overall performance of DeepSurv for CVD risk assessment was improved with a hybrid approach using statistically significant variables from Cox proportional hazards regression as input features. When all the data categories in the NHIS-NSC linked to environmental exposure data are available, progressively adding variables in each data category could incrementally increase the predictive performance of DeepSurv model for CVD risk with the hybrid approach. Data linkage to the environmental exposure with residential area code in the NHIS-NSC offered marginally improved performance for CVD risk in both DeepSurv model with the hybrid approach and Cox proportional hazards regression model.๋ฐฐ๊ฒฝ ๋ฐ ๋ชฉ์ : ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ํ‰๊ฐ€ ๋ฐ ์˜ˆ์ธก๋ชจ๋ธ๋ง์—์„œ ๋‹ค์–‘ํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๋“ค์˜ ๋ชจ๋ธ ์„ฑ๋Šฅํ–ฅ์ƒ์— ๋Œ€ํ•œ ๊ธฐ์—ฌ๋„๋Š” ๋…ผ๋ž€์˜ ์š”์ง€๋กœ ๋ณด๊ณ ๋˜์–ด์™”๋‹ค. ๋˜ํ•œ, ์ง€์†์ ์œผ๋กœ ์ฆ๊ฐ€ํ•˜๋Š” ํ™œ์šฉ ๊ฐ€๋Šฅํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ๊ด€๋ จ ๋ฐ์ดํ„ฐ์˜ ์ข…๋ฅ˜์™€ ์–‘์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ํฌ๊ด„์ ์ธ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ํ‰๊ฐ€์™€ ์ตœ์ ์˜ ์˜ˆ์ธก ๋ชจํ˜• ๊ฐœ๋ฐœ์„ ์œ„ํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ์–ด๋Š ๋ฒ”์œ„์™€ ์ˆ˜์ค€๊นŒ์ง€ ์ˆ˜์ง‘ํ•ด์•ผ ํ•˜๋Š”์ง€์— ๋Œ€ํ•œ ๊ทผ๊ฑฐ๋Š” ๋ถ€์กฑํ•œ ํ˜„ํ™ฉ์ด๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ฝ•์Šค ๋ชจํ˜•๊ณผ ๊ฒฐํ•ฉ๋œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ƒ์กด๋ถ„์„ ์ ‘๊ทผ๋ฒ• ๋ฐ ์ฝ•์Šค ๋ชจํ˜•์„ ํ™œ์šฉํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ํ‰๊ฐ€์™€ ์˜ˆ์ธก๋ชจ๋ธ๋ง์—์„œ ํ—ฌ์Šค์ผ€์–ด-ํ™˜๊ฒฝ ์—ฐ๊ณ„ ๋ฐ์ดํ„ฐ ํ™œ์šฉ๋ฐฉ๋ฒ• ๋ฐ ๋ฒ”์ฃผ์— ๋”ฐ๋ฅธ ๋ชจ๋ธ ์„ฑ๋Šฅํ–ฅ์ƒ์— ๋Œ€ํ•œ ๊ธฐ์—ฌ๋„๋ฅผ ํ‰๊ฐ€ํ•˜๊ณ ์ž ํ•˜์˜€๋‹ค. ์—ฐ๊ตฌ ๋ฐฉ๋ฒ•: ์ „ํ†ต์  ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜ ํ‰๊ฐ€ ๋„๊ตฌ ๋ฐ ๊ด€์ฐฐ ์—ฐ๊ตฌ๋“ค์— ํฌํ•จ ๋œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์š”์ธ ๊ด€๋ จ ๋ณ€์ˆ˜๋“ค์„ ์ฒด๊ณ„์  ๋ฌธํ—Œ๊ณ ์ฐฐ ๋ฐฉ๋ฒ•๋ก ์„ ํ™œ์šฉํ•˜์—ฌ ์˜ํ•™์—ฐ๊ตฌ ๋ฌธํ—Œ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค (PubMed and Embase)์—์„œ ํฌ๊ด„์ ์œผ๋กœ ์ •๋ณด๋ฅผ ์ˆ˜์ง‘ํ•˜์˜€๋‹ค. ๋ฏธ์„ธ๋จผ์ง€ ๋ˆ„์ ์žฅ๊ธฐ๋…ธ์ถœ ๋ฐ ๋„์‹œ๋…น์ง€๋ฉด์ ์— ๋Œ€ํ•œ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ์™€ ์—ฐ๊ณ„ ๋œ ๊ตญ๋ฏผ๊ฑด๊ฐ•๋ณดํ—˜๊ณต๋‹จ ํ‘œ๋ณธ์ฝ”ํ˜ธํŠธ, (National Health Insurance Service-National Sample Cohort, NHIS-NSC)์—์„œ ๊ฐ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๋“ค์˜ ๋ฐ์ดํ„ฐ ํ™•๋ณด ๊ฐ€๋Šฅ์„ฑ์„ ๊ฒ€ํ† ํ•˜์˜€๋‹ค. NHIS-NSC๋ฅผ ๊ธฐ์ค€์œผ๋กœ 2009๋…„์—์„œ 2010๋…„ ์‚ฌ์ด์— ๊ตญ๊ฐ€๊ฑด๊ฐ•๊ฒ€์ง„์„ ๋ฐ›์€ 40์„ธ ์ด์ƒ ๋Œ€์ƒ์ž ์ค‘ ๊ณผ๊ฑฐ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ๋ณ‘๋ ฅ์ด ์—†๋Š” ๋Œ€์ƒ์ž 137,249๋ช…์˜ ํ™˜์ž์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ์ˆ˜์ง‘ํ•˜์—ฌ 2011 ๋…„ 1 ์›” 1 ์ผ๋ถ€ํ„ฐ 2013 ๋…„ 12 ์›” 31 ์ผ๊นŒ์ง€ ์‹ ๊ทœ ๋ฐœ์ƒํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜์— ๋Œ€ํ•ด ์‹œ๊ฐ„ ๊ฒฝ๊ณผ์— ๋”ฐ๋ผ ์ถ”์  ์กฐ์‚ฌํ•˜์˜€๋‹ค. ํ†ต๊ณ„ ๊ธฐ๋ฐ˜ ๋ณ€์ˆ˜์„ ํƒ ๋ฐฉ๋ฒ•์€ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์  ์œ ์˜์„ฑ, ์ตœ์†Œ (์ตœ์ƒ์˜) Akaike Information Criteria (AIC)์˜ ํ•˜์œ„ ์ง‘ํ•ฉ, elastic net penalty๋กœ ์ •๊ทœํ™” ๋œ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ์„ ํƒ๋œ ๋ณ€์ˆ˜ ๋ฐ ์œ„์— ์–ธ๊ธ‰๋œ ๋ชจ๋“  ๊ธฐ์ค€์„ ์ถฉ์กฑํ•˜๋Š” ๋ณ€์ˆ˜ ์„ธํŠธ๋กœ ์„ ์ •ํ•˜์˜€๋‹ค. ์œ„์— ๋ช…์‹œ๋œ ํ†ต๊ณ„์  ๋ฐฉ๋ฒ• ์™ธ ๋ชจ๋“  ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ์— ์†ํ•œ ๋ณ€์ˆ˜ ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜ (ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•)๋ฅผ ์ ์ง„์ ์œผ๋กœ ์ž…๋ ฅ ํ”ผ์ณ๋กœ ์ถ”๊ฐ€ํ•˜๋Š” ์ „๋žต์œผ๋กœ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ƒ์กด๋ถ„์„ (Cox proportional hazards deep neural network, DeepSurv) ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ์˜ˆ์ธก ๋ชจ๋ธ๋“ค์„ ํ›ˆ๋ จ ์„ธํŠธ (์ „์ฒด ์ƒ˜ํ”Œ์˜ 80 %)๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. DeepSurv ๋ฐ ์ฝ•์Šค๋น„๋ก€ ์œ„ํ—˜๋ชจํ˜•์„ ํ™œ์šฉํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅํ‰๊ฐ€๋Š” ์ƒ์กด๋ถ„์„์„ ํ™œ์šฉํ•œ ์˜ˆ์ธก ๋ชจ๋ธ๋ง์— ๊ฐ€์žฅ ์ ํ•ฉํ•œ ํ‰๊ฐ€์ง€ํ‘œ๋กœ ์•Œ๋ ค์ง„ Unos concordance statistics (C-index)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ํ…Œ์ŠคํŠธ ์„ธํŠธ (์ด ์ƒ˜ํ”Œ์˜ 20 %)์—์„œ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค. ๊ฒฐ๊ณผ: ์ฒด๊ณ„์  ๋ฌธํ—Œ๊ณ ์ฐฐ, ๋ฐ์ดํ„ฐ ์ทจํ•ฉ ๋ฐ ์ถ”์ถœ ๊ฐ€๋Šฅ์„ฑ ๊ฒ€ํ†  ํ›„, ์ธ๊ตฌ์‚ฌํšŒํ•™์  ์š”์ธ, ๊ฑด๊ฐ•๊ฒ€์ง„ ๋ฐ ์ธก์ • ๊ฒฐ๊ณผ, ์ƒํ™œ์Šต๊ด€, ๊ฐ€์กฑ๋ ฅ, ๊ฑด๊ฐ•์ƒํƒœ, ๊ตฌ๊ฐ•๊ฑด๊ฐ•, ์•ฝ๋ฌผ ๋ฐ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ์—์„œ ์ด 31 ๊ฐœ์˜ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๊ฐ€ ์ง€์—ญํ™˜๊ฒฝ ์ž๋ฃŒ์™€ ์—ฐ๊ณ„๋œ NHIS-NSC์—์„œ ํ™•์ธ๋˜์—ˆ๋‹ค. ํ†ต๊ณ„ ๊ธฐ๋ฐ˜ ๋ณ€์ˆ˜์„ ํƒ ๋ฐฉ๋ฒ•์œผ๋กœ ๊ฐœ๋ฐœํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ ์ค‘ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜๋ฅผ DeepSurv์— ์ ์šฉํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•์ด Uno 's C-index ๊ฐ’ 0.7069, ๋ชจ๋“  ๋ณ€์ˆ˜๋ฅผ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์— ์ ์šฉํ•œ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์ด Uno 's C-index ๊ฐ’ 0.7052๋กœ ๋‚˜ํƒ€๋‚˜ ๊ธฐ๋ณธ ์ž„์ƒ ์š”์ธ (์—ฐ๋ น, ์„ฑ๋ณ„ ๋ฐ ์ฒด์งˆ๋Ÿ‰์ง€์ˆ˜)์ด ํฌํ•จ๋œ ์˜ˆ์ธก ๋ชจ๋ธ๊ณผ ๋น„๊ตํ•˜์—ฌ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ชจ๋ธ ์˜ˆ์ธก๋ ฅ ์ฆ๊ฐ€๋ฅผ ๋ณด์˜€๋‹ค (๋‘ ๋ชจ๋ธ ๋ชจ๋‘ Unos C-index ์ฐจ์ด์— ๋Œ€ํ•œ p-value : <0.0001). ์ธ๊ตฌ์‚ฌํšŒํ•™์  ํŠน์„ฑ์—์„œ ํ™˜๊ฒฝ ๋…ธ์ถœ์— ์ด๋ฅด๊ธฐ๊นŒ์ง€ ๊ฐ ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ์—์„œ ๋ชจ๋‘ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜๋“ค์ด ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง์„์œ„ํ•œ DeepSurv ๋ฐ Cox ๋น„๋ก€ ์œ„ํ—˜ ํšŒ๊ท€์— ์ž…๋ ฅ ํ”ผ์ณ๋กœ ์ ์ง„์ ์œผ๋กœ ์ถ”๊ฐ€ ๋œ ๊ฒฝ์šฐ, ์ธ๊ตฌ์‚ฌํšŒํ•™์  ์š”์ธ, ๊ฑด๊ฐ•๊ฒ€์ง„ ๋ฐ ์ธก์ • ๊ฒฐ๊ณผ, ์ƒํ™œ์Šต๊ด€ ์š”์ธ ์ค‘ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ๋ณ€์ˆ˜๋“ค๋กœ ๊ตฌ์„ฑ๋œ DeepSurv ๋ชจ๋ธ์ด ์˜์•ฝํ’ˆ ์‚ฌ์šฉ๊นŒ์ง€ ๊ณ ๋ คํ•œ Cox ๋น„๋ก€ ์œ„ํ—˜ ํšŒ๊ท€๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๋ชจ๋ธ ๋ณด๋‹ค ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋‚˜ํƒ€๋ƒˆ๋‹ค. ๋ฏธ์„ธ๋จผ์ง€ ๋ฐ ๋„์‹œ๋…น์ง€๋ฉด์ ์— ๋Œ€ํ•œ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ๋ฅผ ๊ฑฐ์ฃผ์ง€๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ NHIS-NSC์™€ ์—ฐ๊ณ„ ํ›„ ์ ์ง„์ ์œผ๋กœ ์ž…๋ ฅ ํ”ผ์ณ๋กœ ์ถ”๊ฐ€ ์‹œ DeepSurv ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์„ ํ™œ์šฉํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง ์„ฑ๋Šฅ์„ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ์ˆ˜์ค€์œผ๋กœ ๊ฐœ์„ ํ•˜์ง€ ๋ชปํ–ˆ๋‹ค. ๊ฒฐ๋ก : ์ตœ์†Œ ์ž…๋ ฅ ํ”ผ์ณ๋ฅผ ๊ฐ–์ถ˜ ์ƒ์กด ๋ถ„์„ ๊ธฐ๋ฐ˜ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ์—์„œ ์ตœ์ƒ์˜ ์„ฑ๋Šฅ์„ ์–ป์œผ๋ ค๋ฉด ์ธ๊ตฌ์‚ฌํšŒํ•™์ , ๊ฑด๊ฐ•๊ฒ€์ง„ ๋ฐ ์ธก์ • ๊ฒฐ๊ณผ, ๋ฐ ์ƒํ™œ์Šต๊ด€์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ NHIS-NSC์—์„œ ์ˆ˜์ง‘ํ•˜์—ฌ DeepSurv์˜ ์ž…๋ ฅ ํ”ผ์ณ๋กœ ํ™œ์šฉํ•ด์•ผํ•œ๋‹ค. ์ง€์—ญํ™˜๊ฒฝ ์ž๋ฃŒ์™€ ์—ฐ๊ณ„๋œ NHIS-NSC์—์„œ ๋ชจ๋“  ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ๋ฅผ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์„ ๋•Œ ์ ์ง„์ ์œผ๋กœ ๊ฐ ๋ฐ์ดํ„ฐ ๋ฒ”์ฃผ ์ค‘ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜•์—์„œ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์œ„ํ—˜์ธ์ž๋ฅผ ์ ์ง„์ ์œผ๋กœ ์ž…๋ ฅ ํ”ผ์ณ๋กœ DeepSurv ๋ชจ๋ธ์— ์ถ”๊ฐ€ํ•˜๋Š” ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•์—์„œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง ์„ฑ๋Šฅ์ด ์ ์ฐจ ํ–ฅ์ƒ ๋  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ฃผ๊ฑฐ ์ง€์—ญ ์ฝ”๋“œ๋ฅผ ์‚ฌ์šฉํ•œ NHIS-NSC์™€ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ ์—ฐ๊ณ„๋Š” DeepSurv ๋ฐ ์ฝ•์Šค๋น„๋ก€์œ„ํ—˜๋ชจํ˜• ๋ชจ๋‘์—์„œ ์‹ฌํ˜ˆ๊ด€์งˆํ™˜ ์˜ˆ์ธก ๋ชจ๋ธ๋ง ์„ฑ๋Šฅ์ด ํ–ฅ์ƒ๋˜์—ˆ์ง€๋งŒ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ ์ฆ๊ฐ€ ์ˆ˜์ค€์€ ์•„๋‹Œ ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚˜ ํ™˜๊ฒฝ ๋…ธ์ถœ ๋ฐ์ดํ„ฐ ์—ฐ๊ณ„ ๋ฐ ์ ์šฉ ์‹œ ๊ฒ€ํ† ๊ฐ€ ํ•„์š”ํ•  ๊ฒƒ์œผ๋กœ ์ถ”์ •๋œ๋‹ค.I. Introduction 1 1. Background 1 2. Research problem 4 3. Hypothesis and objective 6 3.1. Hypothesis 6 3.2. Objective 6 II. Materials and methods 8 1. Comprehensive review and identification of cardiovascular disease (CVD) risk factors 8 1.1. Systematic review on variables included in conventional CVD risk assessment tools 8 1.2. Systematic review on traditional and emerging CVD risk factors from observational studies 9 1.3. Integration of the comprehensive list of CVD risk factors 11 1.4. Screening for data availability 11 2. Cohort analysis for measuring strength of association between risk factors and incident cardiovascular disease 11 2.1 Study population and linkage to environmental exposure data 11 2.2. Variable selection and data processing 15 2.3. Population-based cohort analysis 17 3. Predictive modeling using survival analysis: DeepSurv and Cox proportional hazards regression 17 3.1. Model development 17 3.2. Evaluation of the predictive performance of the models 20 III. Results 21 1. Identification and categorization of cardiovascular disease risk factors 21 2. Magnitude of association between selected risk factors with cardiovascular disease 43 3. Model performance evaluation 56 VI. Discussion 68 1. Key findings and contributions 68 2. Comparison to other studies 69 3. Strengths and limitations 73 4. Implications 74 5. Future perspectives 75 V. Conclusion 77 Reference 78 ๊ตญ๋ฌธ์ดˆ๋ก 88Docto

    Finance, growth, and public policy

    Get PDF
    Development economists have long argued that modern financial markets are important to growth and that financial repression is a serious obstacle to progress in many developing countries. The authors consider the relationship between finance and growth and the appropriate role of government policy. Many economists have stressed how problems of asymmetric information and contract enforcement impede the functioning of financial markets in developing countries. In addition, they try to elaborate on these theories to make them relevant to policymakers. Information gaps and enforcement frictions introduce a premium in the cost of external funds. Factors such as the borrower's financial health, the efficiency of financial intermediation, and the ease of enforcing private financial contracts govern the size of this premium. How financial factors contribute to development may be understood along these lines. Financial contracts and institutions should be designed to minimize this premium.Banks&Banking Reform,Financial Intermediation,Environmental Economics&Policies,Economic Theory&Research,Health Economics&Finance

    ๋‹ค์ˆ˜์ค€ ์ž์› ๋ถ„ํฌ์™€ ๊ฑด๊ฐ• ๊ด€๋ จ ์‚ถ์˜ ์งˆ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๋ณด๊ฑด๋Œ€ํ•™์› ๋ณด๊ฑดํ•™๊ณผ(๋ณด๊ฑดํ•™์ „๊ณต), 2022. 8. ์กฐ์„ฑ์ผ.์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ์ตœ๊ทผ ์ธ๊ตฌ๊ตฌ์กฐ์˜ ๊ธ‰๊ฒฉํ•œ ๊ณ ๋ นํ™”์™€ ๊ฒฝ์ œ ์ˆ˜์ค€์˜ ๋ฐœ์ „์— ๋”ฐ๋ผ ๊ฑด๊ฐ•ํŒจ๋Ÿฌ๋‹ค์ž„์€ ์ƒ๋ช… ์—ฐ์žฅ๊ณผ ๊ฐ™์€ ๋‹จ์ˆœํ•œ ์–‘์  ์ง€ํ‘œ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ(health-related quality of life, HRQOL)๊ณผ ๊ฐ™์€ ์งˆ์  ์ง€ํ‘œ์— ์ฃผ๋ชฉํ•˜๊ณ  ์žˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๊ฒฝ์ œ ์ž์› ๋ฐ ์ธ์  ์ž์› ๋งŒ์œผ๋กœ๋Š” ํฌ๊ด„์ ์ด๊ณ  ๋‹ค์ธต์ ์ธ ์‚ถ์˜ ์งˆ์„ ์„ค๋ช…ํ•˜๋Š” ๋ฐ์— ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค. ๋‹จ์ˆœํžˆ ๊ฐœ์ธ์ด ๋ณด์œ ํ•œ ์ž์›์˜ ์–‘ ๊ทธ ์ž์ฒด ๋ณด๋‹ค๋„, ์œ ์˜๋ฏธํ•œ ์ž์›์ด ์ƒ์„ฑ๋˜๋Š” ๊ธฐ์ „๊ณผ, ์‚ฌํšŒ์  ๊ด€๊ณ„๋ฅผ ํ†ตํ•œ ์ž์›์˜ ๋ถ„ํฌ ๋ฐ ์ž์›์˜ ํ™œ์šฉ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ํญ๋„“์€ ๊ณ ๋ ค๊ฐ€ ํ•„์š”ํ•˜๋‹ค. ์ด์— ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์‚ฌํšŒ์  ์ž์›๊ณผ ์ฃผ๊ด€์  ์‚ฌํšŒ๊ณ„์ธต ์ธ์‹ ์ง€ํ‘œ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ž์›์˜ ๊ฐœ๋…์„ ๋ณด๋‹ค ํญ ๋„“๊ฒŒ ์ดํ•ดํ•˜๋ฉฐ, ๊ฐ€๊ตฌ ๋ฐ ์ง€์—ญ์‚ฌํšŒ ์ˆ˜์ค€์—์„œ ๋‹ค์ธต์ ์ธ ์ž์›์˜ ๋ถ„ํฌ ๊ตฌ์กฐ๋ฅผ ํฌ์ฐฉํ•œ๋‹ค. ๋˜ํ•œ ๋‹ค์ธต์  ์ž์›์ด ์‹œ๊ฐ„๊ณผ ๊ณต๊ฐ„์— ๋”ฐ๋ผ ์–ด๋–ป๊ฒŒ ๋ถ„ํฌํ•˜๋Š”์ง€ ํ™•์ธํ•˜๋ฉฐ, ์ด์— ๋Œ€ํ•œ ๊ฑด๊ฐ• ์˜ํ–ฅ์„ ํŒŒ์•…ํ•˜๊ณ ์ž ํ•œ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ, ์‚ถ์˜ ์งˆ์— ๋Œ€ํ•œ ์‚ฌํšŒ์  ๊ฒฐ์ •์š”์ธ์˜ ์ดํ•ด์˜ ํ‹€์„ ํ™•์žฅ์‹œํ‚ค๊ณ , ์šฐ๋ฆฌ ์‚ฌํšŒ์— ๋‚ด์žฌ๋œ ๊ณ„์ธต ๊ธฐ๋ฐ˜์˜ ๋ถˆํ‰๋“ฑ ํ˜„ํ™ฉ์„ ํŒŒ์•…ํ•œ๋‹ค. ์ฃผ์š” ์—ฐ๊ตฌ๋ชฉ์ ์€ ์ฒซ์งธ, ๊ฐ€๊ตฌ ํ™˜๊ฒฝ ๋ฐ ๊ฐœ์ธ์˜ ์ž์›์ด ์ฃผ๊ด€์  ์‚ฌํšŒ๊ณ„์ธต ์ธ์‹์˜ ํ˜•์„ฑ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ํŒŒ์•…ํ•˜๋ฉฐ, ๊ฐ€๊ตฌ์›๊ฐ„ ์ธ์‹ ์ฐจ์ด๋ฅผ ํŒŒ์•…ํ•จ์œผ๋กœ์จ ๊ฐ€๊ตฌ๋‚ด ์ž์›์˜ ๊ณต์œ ์˜ ๊ธฐ์ „์„ ํŒŒ์•…ํ•œ๋‹ค. ๋‘˜์งธ, ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ ๊ถค์ ์„ ํŒŒ์•…ํ•˜๊ณ , ์ฃผ๊ด€์  ์‚ฌํšŒ๊ณ„์ธต ์ธ์‹๊ณผ ๊ฐ๊ด€์  ์‚ฌํšŒ๊ฒฝ์ œ์  ์ˆ˜์ค€์— ๋”ฐ๋ฅธ ์‚ถ์˜ ์งˆ ๊ถค์ ์„ ํŒŒ์•…ํ•œ ํ›„, ๋‘ ๊ถค์ ์˜ ์—ฐ๊ด€์„ฑ์„ ๋ถ„์„ํ•จ์œผ๋กœ์จ ์‚ฌํšŒ๊ฒฝ์ œ์  ์ง€์œ„์˜ ์ข…๋‹จ์ ์ธ ๊ฑด๊ฐ•์˜ํ–ฅ์„ ํŒŒ์•…ํ•œ๋‹ค. ์…‹์งธ, ์ง€์—ญ์‚ฌํšŒ์˜ ๋‹ค์ฐจ์›์ ์ธ ์‚ฌํšŒ ์ž์›์˜ ๊ตฌ์„ฑ์ฒด๋ฅผ ์ •์˜ํ•˜๊ณ , ์ธ๊ตฌ์ง‘๋‹จ์˜ ๊ฑด๊ฐ•์— ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š” ์ฃผ์š”ํ•œ ์‚ฌํšŒ ์ž์›์„ ํŒŒ์•…ํ•œ๋‹ค. ๋„ท์งธ, ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ์˜ ๊ณต๊ฐ„ ์ƒ๊ด€์„ ํŒŒ์•…ํ•˜๊ณ , ์‚ฌํšŒ ์ž์›์ด ๊ฑด๊ฐ•์— ๋ฏธ์น˜๋Š” ํšจ๊ณผ๋ฅผ ๊ณต๊ฐ„์ ์ธ ๋น„์ •ํ˜•์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํŒŒ์•…ํ•œ๋‹ค. ์—ฐ๊ตฌ ๋ฐฉ๋ฒ• ์ฒซ๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ œ 8์ฐจ ํ•œ๊ตญ์˜๋ฃŒํŒจ๋„ ์ž๋ฃŒ(2013๋…„)๋ฅผ ํ™œ์šฉํ•ด 3,984 ๊ฐ€๊ตฌ์—์„œ 18์„ธ ์ด์ƒ ์„ฑ์ธ 8330๋ช…์„ ์—ฐ๊ตฌ๋Œ€์ƒ์œผ๋กœ ํ•˜์˜€๊ณ , ๋‘๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” 2009๋…„๋ถ€ํ„ฐ 2018๋…„๋„๊นŒ์ง€์˜ ํ•œ๊ตญ์˜๋ฃŒํŒจ๋„ (์ด10์ฐจ ์กฐ์‚ฌ) ์ž๋ฃŒ์˜ ๊ท ํ˜•ํŒจ๋„ ๋Œ€์ƒ์ž ์ž๋ฃŒ๋ฅผ ํ™œ์šฉํ•˜์˜€๋‹ค. ์„ธ๋ฒˆ์งธ์™€ ๋„ค๋ฒˆ์งธ ์—ฐ๊ตฌ๋Š” ์ง€์—ญ์‚ฌํšŒ ์ˆ˜์ค€์˜ ์—ฐ๊ตฌ๋กœ์„œ, ํ†ต๊ณ„์ฒญ(KOSIS)์˜ ๊ณต๊ฐœ์ž๋ฃŒ ๋ฐ ์ง€์—ญ์‚ฌํšŒ๊ฑด๊ฐ•์กฐ์‚ฌ ์ž๋ฃŒ๋ฅผ ํ™œ์šฉํ•˜์—ฌ 250๊ฐœ ์ง€์—ญ์‚ฌํšŒ ์ˆ˜์ค€์œผ๋กœ ๋‹ค์–‘ํ•œ ์‚ฌํšŒ ์ž์› ๋ณ€์ˆ˜๋ฅผ ๋ณ‘ํ•ฉํ•˜์˜€๋‹ค. ์ข…์†๋ณ€์ˆ˜์ธ ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ ์ง€์ˆ˜(HRQoL)๋Š” EQ-5D ์ง€ํ‘œ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ•œ๊ตญ์ธ ๊ณ ์œ ์˜ ๊ฐ€์ค‘์น˜๋ฅผ ์ ์šฉ ํ›„ ์‚ฐ์ถœํ•˜์˜€๋‹ค. ์ฃผ๊ด€์  ์‚ฌํšŒ๊ณ„์ธต ์ธ์‹์˜ ์ง€ํ‘œ๋Š” MacArthur scale ์„ ์‚ฌ์šฉํ•˜์˜€๋‹ค. ์‚ฌํšŒ์ž๋ณธ์€ ์‚ฌํšŒ์  ์—ฐ๊ฒฐ๋ง, ์‹ ๋ขฐ, ์‚ฌํšŒ ์ฐธ์—ฌ๋กœ ๊ตฌ๋ถ„ํ•˜๊ณ , ๊ทธ ์™ธ์—๋„ ๋ฌธํ™”, ์ฒด์œก์‹œ์„ค, ๊ณต์›์˜ ์ˆ˜์™€ ๊ฐ™์€ ๋ฌธํ™” ์ž์›๊ณผ, ์˜์‚ฌ ์ˆ˜, ํ•„์ˆ˜์ง„๋ฃŒ๊ณผ ์˜์›, ๋ณ‘์›, ์š”์–‘๋ณ‘์› ์ˆ˜์™€ ๊ฐ™์€ ์˜๋ฃŒ ์ž์› ๋ฐ ์ง€์—ญ์‚ฌํšŒ์˜ ์‚ฌํšŒ๊ฒฝ์ œ์  ํ™˜๊ฒฝ ๋“ฑ์„ ๋ฐ˜์˜ํ•˜์˜€๋‹ค. ๋ถ„์„ ๋ฐฉ๋ฒ•์€ ์ฒซ๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ ์ฃผ๊ด€์  ์‚ฌํšŒ๊ณ„์ธต ์ธ์‹์— ๋Œ€ํ•œ ๊ฐ€๊ตฌ์›๊ฐ„ ์‘๋‹ต์ผ์น˜๋„๋ฅผ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•˜์—ฌ ์ง‘๋‹จ ๋‚ด ์ƒ๊ด€๊ณ„์ˆ˜๋ฅผ ๊ตฌํ•˜์˜€๊ณ , ๋ถ„์‚ฐ ๋ถ„ํ•ด๋ฅผ ํ†ตํ•ด ๋ณ€์ˆ˜๋ณ„ ์ƒ๋Œ€์  ์ค‘์š”๋„๋ฅผ ๋น„๊ตํ•˜์˜€๋‹ค. ๋‘๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” ๊ทธ๋ฃน ๊ธฐ๋ฐ˜์˜ ๊ถค์  ๋ชจํ˜•(Group-based trajectory modeling, GBTM)์„ ์ ์šฉํ•˜์˜€์œผ๋ฉฐ, ํŠนํžˆ ๊ฐ๊ด€์ , ์ฃผ๊ด€์  ์ง€์œ„์˜ ๋ณ€ํ™” ํŒจํ„ด์„ ํ•œ๋ฒˆ์— ํฌ์ฐฉํ•˜๊ธฐ ์œ„ํ•˜์—ฌ 2๊ฐœ ๋ณ€์ˆ˜์˜ ๋ณ€ํ™” ํŒจํ„ด์„ ๋™์‹œ์— ํฌ์ง‘ํ•˜๋Š” ๋‹ค์ค‘ ๊ทธ๋ฃน ๊ธฐ๋ฐ˜ ๊ถค์  ๋ชจํ˜•(multi-GBTM)์„ ์ ์šฉํ•˜์˜€๋‹ค. ์„ธ๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ฃผ์„ฑ๋ถ„ ๋ถ„์„ ๋ฐ ์ฃผ์„ฑ๋ถ„ ํšŒ๊ท€๋ถ„์„์„ ์‚ฌ์šฉํ•˜์˜€๋‹ค. ๋„ค๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ง€๋ฆฌ์  ๊ฐ€์ค‘ํšŒ๊ท€๋ถ„์„(Geographically weighted regression, GWR)์„ ์ ์šฉํ•˜๊ณ  ๊ทธ ํšŒ๊ท€๊ณ„์ˆ˜์— ๋Œ€ํ•ด K-means ๊ตฐ์ง‘ ๋ถ„์„์„ ์‚ฌ์šฉํ•˜์˜€๋‹ค. ํ†ต๊ณ„ ํ”„๋กœ๊ทธ๋žจ์€ STATA 16, SAS ์†Œํ”„ํŠธ์›จ์–ด 9.4 ๋ฒ„์ „, R 4.1.3๋ฒ„์ „์„ ์ด์šฉํ•˜์˜€์œผ๋ฉฐ, ์ง€๋ฆฌ ๋ถ„์„ ์‹œ QGIS 3.24 ๋ฐ GeoDa 1.18.0 ํ”„๋กœ๊ทธ๋žจ์„ ๋ณด์กฐ์ ์œผ๋กœ ์ด์šฉํ•˜์˜€๋‹ค. ์—ฐ๊ตฌ ๊ฒฐ๊ณผ ๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ, ์ฃผ๊ฑฐ์•ˆ์ •์„ฑ๊ณผ ๊ฐ™์€ ๊ฐ€๊ตฌ์˜ ๋ถ€์˜ ์ˆ˜์ค€์€ ์ฃผ๊ด€์  ๊ณ„์ธต ์ธ์‹ ํ•˜๋ฝ์— ๋Œ€ํ•˜์—ฌ ์ƒ๋‹นํ•œ ์™„์ถฉํšจ๊ณผ๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์œผ๋‚˜, ๊ฐ€๊ตฌ ๋‚ด์—์„œ ์„œ๋กœ ์ž์›์„ ๊ณต์œ ํ•˜๋Š” ๊ธฐ์ „์— ๋”ฐ๋ผ ๊ฐ€๊ตฌ์› ๊ฐ„ ์ธ์‹์— ์ฐจ์ด๊ฐ€ ์žˆ๋Š” ๊ฒƒ์„ ํ™•์ธํ•˜์˜€๋‹ค. ํŠนํžˆ ๋ฏธ์„ฑ๋…„ ์ž๋…€์˜ ์ˆ˜๊ฐ€ ๋งŽ์•„์งˆ์ˆ˜๋ก ๋ถ€๋ถ€๊ฐ„ ๊ณ„์ธต ์ธ์‹์˜ ๊ฒฉ์ฐจ๊ฐ€ ๋ฒŒ์–ด์กŒ๊ณ , ์ž๋…€ ์„ธ๋Œ€, ๊ฐ€๊ตฌ์ฃผ ์„ธ๋Œ€, ๊ฐ€๊ตฌ์ฃผ์˜ ๋ถ€๋ชจ ์„ธ๋Œ€๋ณ„๋กœ ์„ธ๋Œ€๊ฐ„ ์ธ์‹ ์ฐจ์ด๊ฐ€ ์žˆ์—ˆ๋‹ค. ์ฆ‰, ์ด๋Ÿฌํ•œ ์ธ์‹ ๊ฒฉ์ฐจ๋Š” ๊ฐ€๊ตฌ์›์œผ๋กœ์„œ ์ •์ฒด์„ฑ ๋ฐ ๋ถ€์–‘์˜๋ฌด, ํ˜น์€ ๊ฐ€๊ตฌ ๋‚ด์—์„œ ์ Š์€ ์„ธ๋Œ€์—๊ฒŒ ์ž์›์ด ์ง‘์ค‘๋˜๋Š” ์–‘์ƒ์— ๊ธฐ์ธํ•œ๋‹ค. ๋‘๋ฒˆ์งธ ์—ฐ๊ตฌ์˜ ๊ถค์  ๋ถ„์„ ๊ฒฐ๊ณผ, HRQoL์€ ์‹œ๊ฐ„ ๊ฒฝ๊ณผ์— ๋”ฐ๋ผ ์ง€์†์ ์œผ๋กœ ์ตœ๊ณ ์ ์ธ 1์  ์ˆ˜์ค€์„ ์œ ์ง€ํ•˜๊ฑฐ๋‚˜, ๋‚ฎ์€ ์ˆ˜์ค€์—์„œ ์‹œ์ž‘ํ•˜์—ฌ ํ•˜ํ–ฅ ๊ณก์„ ์„ ๊ทธ๋ฆฌ๋ฉฐ ๊ฑด๊ฐ•์ด ์•…ํ™”๋˜๋Š” ํ˜•ํƒœ๋งŒ ํ™•์ธ๋˜์—ˆ๋‹ค. ๋˜ํ•œ ํ•œ๊ตญ ์‚ฌํšŒ๋Š” ๋ถ€์œ ํ•œ ๊ฐ€๊ตฌ๊ฐ€ ์‹œ๊ฐ„์ด ์ง€๋‚จ์— ๋”ฐ๋ผ ๋”์šฑ ๋น ๋ฅด๊ฒŒ ์†Œ๋“์ฆ๊ฐ€๋ฅผ ์ด๋ฃจ๋ฉฐ ์ด๋Ÿฌํ•œ ๊ฒฝ์ œ์  ๋ถˆํ‰๋“ฑ์ด ์‚ฌํšŒ์  ์ง‘๋‹จํ™”์— ๊ธฐ์—ฌํ•˜๊ณ  ์žˆ์—ˆ๋‹ค. ๋˜ํ•œ ์ด๋Ÿฌํ•œ ์‚ฌํšŒ๊ฒฝ์ œ์  ์ง€ํ‘œ๋Š” ์ค‘์žฅ๊ธฐ์ ์œผ๋กœ ๊ฑด๊ฐ• ๊ถค์  ํ™•๋ฅ ์— ์˜ํ–ฅ์„ ๋ฏธ์น˜๋ฉฐ ๊ฑด๊ฐ• ๊ฒฉ์ฐจ๋ฅผ ์•…ํ™”์‹œํ‚ค๋Š” ๊ฒƒ์„ ํ™•์ธํ•˜์˜€๋‹ค. ํ•œํŽธ, ์ค‘์žฅ๊ธฐ์ ์ธ ์ฃผ๊ด€์  ๊ณ„์ธต ์ธ์‹ ์ˆ˜์ค€์€ ๊ฐ€๊ตฌ ์†Œ๋“์˜ ๋ณ€ํ™”๋งŒ์œผ๋กœ๋Š” ์„ค๋ช…๋˜์ง€ ์•Š์•˜๋Š”๋ฐ, ์ด๋Š” ๊ฐ€๊ตฌ์˜ ๋ฒ”์ฃผ๋ฅผ ๋„˜์–ด์„œ๋Š” ์‚ฌํšŒ์  ์ž์› ๋ฐ ํ™˜๊ฒฝ์˜ ์ค‘์š”์„ฑ์„ ์‹œ์‚ฌํ•œ๋‹ค. ์„ธ๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ, ์ง€์—ญ์‚ฌํšŒ๋Š” ๋ฌผ๋ฆฌ์  ์‹œ์„ค ํ™˜๊ฒฝ ๋ฐ ๊ฒฝ์ œ์  ์ˆ˜์ค€ ์ด์™ธ์—๋„ ์—ฐ๊ฒฐํ˜•, ๊ฒฐ์†ํ˜•, ์ธ์ง€์  ์‚ฌํšŒ ์ž๋ณธ๊ณผ, ์˜๋ฃŒ์„œ๋น„์Šค์˜ ๊ณต๊ธ‰ ๋ฐ ์ˆ˜์š” ํ™˜๊ฒฝ์œผ๋กœ ์œ ํ˜•ํ™” ๋˜๋Š” ํŠน์ง•์ด ์žˆ์—ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ง€์—ญ์‚ฌํšŒ์˜ ์ž์› ๋ถ„ํฌ์˜ ํŠน์„ฑ์€ ๊ทผ๋ฆฐํšจ๊ณผ๋กœ์„œ ์ธ๊ตฌ์ง‘๋‹จ์˜ ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์— ์งˆ์— ํฌ๊ฒŒ ์˜ํ–ฅ์„ ๋ฏธ์ณค๋‹ค. ํŠนํžˆ ๋‹จ์ˆœ ์‹œ์„ค์˜ ์ˆ˜๊ฐ€ ์•„๋‹Œ, ๋ฏธ์ถฉ์กฑ ์˜๋ฃŒํ•„์š”๋„์™€ ๊ฐ™์€ ์‹ค์งˆ์ ์ธ ์ž์›์˜ ์ด์šฉ๊ฐ€๋Šฅ์„ฑ์ด ์ธ๊ตฌ ๊ฑด๊ฐ•์— ์˜ํ–ฅ์„ ๋ฏธ์ณค๋‹ค. ๋„ค๋ฒˆ์งธ ์—ฐ๊ตฌ์—์„œ, ์ง€๋ฆฌ์ ์ธ ๊ฑฐ๋ฆฌ๋ฅผ ๋ฐ˜์˜ํ•œ ๊ณต๊ฐ„ ๋ถ„์„ ๊ฒฐ๊ณผ, ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์ด ์งˆ์€ ๋†’์€ ์ˆ˜์ค€์˜ ์ง€๋ฆฌ์  ์ž๊ธฐ์ƒ๊ด€์„ ๊ฐ€์กŒ๋‹ค. ์ฆ‰ ๊ฑด๊ฐ•ํ•œ ์ง€์—ญ์‚ฌํšŒ๋Š” ๊ฑด๊ฐ•ํ•œ ์ง€์—ญ๋ผ๋ฆฌ ์„œ๋กœ ์ง€๋ฆฌ์ ์œผ๋กœ ๋ฐ€์ ‘ํ•œ ๊ณต๊ฐ„์  ์ƒ๊ด€์„ฑ์ด ์žˆ์—ˆ๋‹ค. ๋˜ํ•œ ๊ฐ ์ง€์—ญ์‚ฌํšŒ ์ž์›์ด ๊ฑด๊ฐ•์— ๋ฏธ์น˜๋Š” ํšจ๊ณผ์„ฑ์€ ์ง€์—ญ๋งˆ๋‹ค ์ƒ์ดํ•˜๋ฉฐ, ํ•ด๋‹น ํšจ๊ณผ์„ฑ์„ ๊ตฐ์ง‘ํ™” ํ•˜์˜€์„ ๋•Œ ๊ถŒ์—ญ ๋‹จ์œ„์—์„œ ์ง‘ํ•ฉ์ ์œผ๋กœ ์ž‘๋™ํ•˜๋Š” ๊ฒƒ์„ ํ™•์ธํ•˜์˜€๋‹ค. ์„œ์šธ ๋ฐ ๊ฒฝ๊ธฐ๋„์—์„œ๋Š” ์‚ฌํšŒ ์‹ ๋ขฐ๊ฐ€ ์œ ์˜ํ•œ ๊ฑด๊ฐ• ๋ณดํ˜ธ ํšจ๊ณผ๊ฐ€ ์žˆ์—ˆ๋‹ค. ๊ฒฝ์ƒ๋„๊ถŒ์—์„œ๋Š” ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ ์ˆ˜์ค€์ด ๋‚ฎ์€ ์‚ฌ๋žŒ์ด ์ข…๊ตํ™œ๋™์„ ๋ณด๋‹ค ๋นˆ๋ฒˆํ•˜๊ฒŒ ์ฐธ์—ฌํ•˜๋Š” ๊ฒฝํ–ฅ์ด ์žˆ์—ˆ๋‹ค. ์ „๋ผ๋„๊ถŒ์—์„œ๋Š” ๋ฒฝ์ง€ ์ง€์—ญ์˜ 1์ธ ๊ฐ€๊ตฌ๊ฐ€ ๊ฑด๊ฐ• ์œ„ํ—˜ ์š”์†Œ์˜€์œผ๋ฉฐ, ๊ฐ•์› ๋ฐ ์ถฉ์ฒญ๋„๊ถŒ์—์„œ๋Š” ๋ฏธ์ถฉ์กฑ ์˜๋ฃŒ ํ•„์š”๋„๊ฐ€ ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ๊ณผ ์œ ์˜ํ•œ ๋ถ€์  ์—ฐ๊ด€์„ฑ์„ ๋ณด์˜€๋‹ค. ๊ฒฐ๋ก  ๋‹ค์ˆ˜์ค€์— ๊ฑธ์นœ ์ž์›์˜ ๋ถ„ํฌ ๋ฐ ํ™œ์šฉ๊ฐ€๋Šฅ์„ฑ์˜ ๊ฒฉ์ฐจ๊ฐ€ ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ํ™•์ธํ•˜์˜€๋‹ค. ํ•œ๊ตญ ์‚ฌํšŒ๋Š” ์‹œ๊ฐ„ ๊ฒฝ๊ณผ์— ๋”ฐ๋ผ ๊ฐ€๊ตฌ ์†Œ๋“์— ๊ธฐ๋ฐ˜ํ•œ ๊ณ„์ธตํ™”๊ฐ€ ๊ฒฌ๊ณ ํ•ด์ง€๊ณ  ์žˆ์œผ๋ฉฐ ๊ฐ๊ด€์ , ์ฃผ๊ด€์  ์‚ฌํšŒ ์ด๋™์˜ ๊ฐ€๋Šฅ์„ฑ์ด ์ œ์•ฝ๋˜๋Š” ๊ฒฝ์ง๋œ ์‚ฌํšŒ์ด๋‹ค. ์ฃผ๊ฑฐ์•ˆ์ •์„ฑ์€ ๊ฐ๊ด€์  ์†Œ๋“ ๋Œ€๋น„ ์ฃผ๊ด€์  ์ˆ˜์ค€์ด ๋‚ฎ์•„์ง€๋Š”๋ฐ ๋Œ€ํ•œ ๋ณดํ˜ธํšจ๊ณผ๊ฐ€ ์žˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋ฌผ๋ฆฌ์  ์ฃผ๊ฑฐ๊ณต๊ฐ„๊ณผ ๊ฒฝ์ œ์  ์ž์›์„ ๊ณต์œ ํ•˜๋Š” ํ•œ ๊ฐ€์ •์•ˆ์—์„œ๋„ ๋ถ€์–‘์˜๋ฌด์™€ ๊ฐ™์€ ๊ฐ€๊ตฌ์› ์ •์ฒด์„ฑ์ด ์ž์› ํ™œ์šฉ์— ์˜ํ–ฅ์„ ๋ฏธ์ณ ์ฃผ๊ด€์ ์ธ ์‚ฌํšŒ๊ณ„์ธต ์ธ์‹์— ๊ฒฉ์ฐจ๋ฅผ ๋ฐœ์ƒ์‹œํ‚ค๊ณ  ์žˆ๋‹ค. ํ•œํŽธ, ๊ฑด๊ฐ•๊ด€๋ จ ์‚ถ์˜ ์งˆ์€ ์ข…๋‹จ์ ์œผ๋กœ ์ƒํ–ฅ๊ณก์„ ์„ ๋„์ง€ ์•Š์œผ๋ฏ€๋กœ ์•…ํ™”๋ฅผ ๋ฐฉ์ง€ ๋ฐ ๋ณดํ˜ธํ•˜๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•˜๋‹ค. ์ด๋Š” ๊ฐœ์ธ๊ณผ ๊ฐ€๊ตฌ ์ˆ˜์ค€์„ ๋„˜์–ด ๋‹ค์–‘ํ•œ ์‚ฌํšŒ ์ž์›์œผ๋กœ์˜ ์ ‘๊ทผ ๊ฐ€๋Šฅ์„ฑ ๋ฐ ํ™œ์šฉ ๊ฐ€๋Šฅ์„ฑ์ด ๋ณด์žฅ๋˜์–ด์•ผ ๊ฐ€๋Šฅํ•œ ์ผ์ด๋‹ค. ์ง€์—ญ ์‚ฌํšŒ์˜ ์ž์›์„ ์œ ํ˜•ํ™”ํ•ด๋ณด๋ฉด, ๋ฌผ๋ฆฌ์  ์‹œ์„ค์ด์™ธ์—๋„ ๋ณด๊ฑด์˜๋ฃŒ์„œ๋น„์Šค์˜ ์ˆ˜์š”์™€ ๊ณต๊ธ‰ ๊ท ํ˜•, ๊ทธ๋ฆฌ๊ณ  ์‚ฌํšŒ์ž๋ณธ ํ™˜๊ฒฝ์œผ๋กœ ํŠน์„ฑํ™” ๋˜๋Š”๋ฐ, ์ด๋Ÿฌํ•œ ์ง€์—ญ์‚ฌํšŒ์˜ ์ž์› ์œ ํ˜•์€ ์ธ๊ตฌ์ง‘๋‹จ ๊ฑด๊ฐ•์— ์˜ํ–ฅ์„ ๋ฏธ์นœ๋‹ค. ๋˜ํ•œ ๊ฑด๊ฐ•ํ•œ ์ง€์—ญ์‚ฌํšŒ๋Š” ๊ฑด๊ฐ•ํ•œ ์ง€์—ญ์‚ฌํšŒ๋ผ๋ฆฌ, ๊ฑด๊ฐ• ๋ฐ•ํƒˆ์ง€์—ญ์€ ๋ฐ•ํƒˆ์ง€์—ญ๋ผ๋ฆฌ ๋†’์€ ๊ณต๊ฐ„์  ์ž๊ธฐ์ƒ๊ด€์„ ๊ฐ€์ง€๋ฉฐ, ์ž์›์˜ ํšจ๊ณผ์„ฑ์ด ๊ถŒ์—ญ๋ณ„๋กœ ๊ตฐ์ง‘ํ™” ๋˜๋Š” ์ง€์—ญ์„ฑ์„ ๋ˆ๋‹ค๋Š” ์ ์— ์œ ์˜ํ•˜์—ฌ ์ž์› ์žฌ๋ถ„๋ฐฐ ์ •์ฑ…์„ ์ˆ˜๋ฆฝํ•˜์—ฌ์•ผ ํ•œ๋‹ค. ์ฆ‰, ํ–ฅํ›„ ์ง€์—ญ ๋ณด๊ฑด ์ •์ฑ… ์ˆ˜๋ฆฝ ์‹œ์—๋Š” ๋‹จ์ˆœํžˆ ์ž์›์˜ ์–‘์„ ๊ท ๋“ฑํ™”ํ•˜๋Š” ์ •์ฑ…๋ณด๋‹ค, ํ•œ์ •๋œ ์ž์›์˜ ์–‘์„ ๊ฐ€๊ตฌ ์œ ํ˜•๋ณ„๋กœ, ์ง€์—ญ๋ณ„๋กœ, ์–ด๋Š ์ˆ˜์ค€์œผ๋กœ ์ง‘์ค‘ ๋ถ„๋ฐฐํ•˜๋Š” ๊ฒƒ์ด ๊ฐ€์žฅ ๋น„์šฉํšจ๊ณผ์ ์ผ์ง€์— ๊ธฐ๋ฐ˜ํ•˜์—ฌ ์ง€์—ญ ํŠนํ™” ์ „๋žต์„ ์ˆ˜๋ฆฝํ•˜์—ฌ์•ผ ํ•œ๋‹ค. ์ด์ƒ์„ ์ข…ํ•ฉํ•˜๋ฉด, ํ•œ ๊ฐœ์ธ์ด ์–ด๋– ํ•œ ๊ฐ€์กฑ ์—ญํ• ์˜ ์˜๋ฌด๋ฅผ ๊ฐ€์ง€๋Š”์ง€, ์–ด๋Š ์ง€์—ญ์‚ฌํšŒ์— ๊ฑฐ์ฃผํ•˜๋Š”์ง€์— ๋”ฐ๋ผ ์ž์›์˜ ํ™œ์šฉ๊ฐ€๋Šฅ์„ฑ๊ณผ ํšจ๊ณผ์„ฑ์€ ์ƒ์ดํ•จ์„ ๋ณด์—ฌ์ค€๋‹ค. ์ด๋Ÿฌํ•œ ๋‚ด์žฌ์  ์†์„ฑ์€ ์ค‘์žฅ๊ธฐ์ ์œผ๋กœ, ๊ทธ๋ฆฌ๊ณ  ๊ณต๊ฐ„์ ์œผ๋กœ ๋”์šฑ ํฐ ๊ฑด๊ฐ• ๊ฒฉ์ฐจ๋ฅผ ๋ถˆ๋Ÿฌ์ผ์œผํ‚จ๋‹ค๋Š” ์ ์—์„œ ๊ฑด๊ฐ• ํ˜•ํ‰์„ฑ ๋ฐ ์ž์›์˜ ์žฌ๋ถ„๋ฐฐ ์ •์ฑ…์— ์‹œ์‚ฌํ•˜๋Š” ๋ฐ”๊ฐ€ ํฌ๋‹ค.Background The recent dynamics of population aging and economic development have drawn renewed interest to the health paradigm. Rather than a quantitative indicator, such as a prolonged life, qualitative indicators, such as health-related quality of life (HRQoL), have become of interest. However, concepts of economic or human capital cannot fully explain the quality of life. Moreover, it is not only the amount of resources owned per se but also mechanisms of generation, distribution, and availability of valuable resources that are important for understanding the social determinants of HRQoL. In general, social determinants of health cumulatively operate over long periods of time and are more effectively investigated by longitudinal perspectives. These resources can be multidimensional, ranging from the material environment to social relationships, and can be distributed within a family or among communities. Ecological differentiation stems from community characteristics and is very much a spatial affair. Here, this thesis aims to evaluate the broader concept of resources using a subjective measure of social status and social resource indicators. Then, it aims to capture the structure of multilevel resource distribution as it is dispersed over time and space. Finally, this aims to expand the framework of social determinants of HRQoL and reveal the health inequalities embedded in our society. The study objectives are as follows. First, the determinants of subjective social status (SSS) were investigated among household members, focusing on the household environment. Then, differences in SSS among members and gaps between objective income and SSS levels were assessed. Second, changing patterns of socioeconomic status were investigated over time and longitudinal effects of socioeconomic status on HRQoL trajectories were assessed. Then, combined changes in patterns of objective and subjective status (i.e., multiple socioeconomic status trajectories) and the HRQoL trajectories were derived, with time gaps. The prospective effects of socioeconomic transition on HRQoL trajectories were analyzed. Third, the resource composite was defined at the community level by combining healthcare resources, cultural infrastructure, and social capital, such as social networks, as well as the neighborhood environment. Then, types of outdoor resources that are crucial to population health were investigated. Finally, spatial correlations in HRQoL were determined and effects of social resources on HRQoL were investigated, considering geographical variations. Methods The study population was adults over 18 years old in the eighth wave (2013) of the Korea Health Panel Survey for the first study, composed of 3,984 households and 8,330 individuals. As the second was a longitudinal study, we made the dataset a balanced panel that respondents answered in all ten waves of the Korea Health Panel (2009โ€“2018). As the third and fourth were ecological studies, we collected community variables via two types of data librariesโ€”OSIS and the Community Health Survey website. We then aggregated overall data at the 250 community level. The dependent variable of HRQoL was calculated using the EQ-5D index with the weights for Koreans. We used the MacArthur scale to measure household SSS. The other explanatory variables consisted of social resources (trust, social network, and social participation), cultural resources (cultural and sports infrastructures and parks), healthcare resources (doctors, essential medical clinics, tertiary hospitals, and nursing hospitals), and communityโ€™s socioeconomic status. Regarding methodologies, we applied the intra-class correlation coefficient to investigate the response reliabilities on household SSS among household members for the first study. In addition, we assessed the importance of determinants on SSS using variance decomposition. For the second study, we used group-based trajectory modeling to identify health trajectories and group-based multi-trajectory modeling to draw multi-SES trajectories. The third study was analyzed using principal component analysis and principal component regression modeling. For the spatial analysis, the fourth study used the geographically weighted regression (GWR) and k-means clustering of the GWR coefficients. We used the STATA 16, SAS software 9.4 version, R version of 4.1.3., QGIS 3.24 and GeoDa 1.18.0 in the adequate analysis. Results For the first study, Housing safety and household wealth, which contributed to 65.7% of the variance in SSS, act as a buffer to downgrade one's SSS. However, there were significant differences between household members according to the dynamics of relational resource sharing. In particular, the perceptions of married couples were consistent, although this decreased as they nurtured more underage children. There are SSS gaps across generations between the ages of the head of household's parents, head of household, and children. For the second study of trajectory modeling, four types of multi-SES trajectories were derived from 2009 to 2013. In the multi-SES trajectories, the richer in 2009 had steeper income growth during the period, while the shapes of the SSS were kept unchangeable over time. The following HRQoL trajectories from 2013 to 2018 showed three distinctive patternsโ€”the 4.3% of individuals showed a low and declining pattern while the other two trajectories remained high and stable. The objective and subjective socioeconomic status, respectively, at baseline were strongly associated with the following health trajectories. For the third study, the communities can be categorized into several principal components (PC). The seven PCs explicitly represent the community characteristics such as (1) structural environments regarding facilities and physical structure; (2)-(3) the set of demand and supply in healthcare; (4) bridging; (5) cognitive; (6) bonding social capital; and (7) economic affluence of the community. These first to seventh PCs explain 46.4% of the HRQoL variance at the community level and are distinctively associated with the HRQoL level. In particular, the structural environment significantly influences population health, implying the neighborhood effect on health. The fourth spatial analysis study showed that HRQoL at the community level has spatial autocorrelation, which means healthy regions are geographically clustered with healthy ones. Moreover, resources do or do not exert effectiveness depending on the regions. Social trust effectively increases HRQoL only in the Seoul and Gyeonggi-do regions. Meanwhile, the religious activities in the Busan and Gyunsang-do regions unexpectedly showed a negative association with health. Unmet medical needs have become a critical health agenda, specifically in the eastern and interior regions of South Korea. Urbanization of the city was positively associated with health on the west side. The aging index is negatively associated with the north and interior regions. The single-person household has become a risk factor in Jeollanam-do and Gangwon-do regions. This differential effectiveness can be spatially clustered and distinguished into five clusters based on the GWR coefficients. That is, the effectiveness of the resources works collectively with some degree of administrative spatial range. Conclusion This study investigated the distribution of multiple levels of resources across households and communities and their health impacts. Taken together, the results indicate that South Korea is a risk-bearing society. The HRQoL patterns were either stable or decreased, but not increased. In addition, HRQoL was spatially clustered at high and low levels of HRQoL. These health patterns suggest longitudinal deterioration and geographical disparities in health. The availability of resources differed according to household environment and family roles. Furthermore, the effectiveness of social resources in the community, such as social capital, differed according to region. This geographical pattern of resource effects on health indicates a spatially shaped social process that gives rise to social inequality. In sum, these findings suggest that the originating family, and where a person lives, determines their health status, highlighting the importance of resource redistribution in enhancing population health. Considering that the administrative district boundary is an effective policy target, the regional-specific healthcare policy for communities should allocate limited resources to areas and households in need, and not focus on equalizing the resources.Chapter 1. Overall introduction๏ผ‘ 1.1. Study Background๏ผ’ 1.2. Study design and objectives๏ผ’๏ผ” Chapter 2. Resource sharing model for subjective social status at the household level๏ผ’๏ผ— 2.1. Introduction๏ผ’๏ผ˜ 2.2. Methods๏ผ“๏ผ“ 2.3. Results๏ผ“๏ผ˜ 2.4. Discussion ๏ผ•๏ผ 2.5. Supplementary data ๏ผ•๏ผ– Appendix A. Distribution of household income and subjective social status among the study population ๏ผ•๏ผ– Appendix B. Detailed information on the intra-class correlation coefficients of subjective social status ๏ผ•๏ผ˜ Appendix C. Post hoc analyses ๏ผ•๏ผ™ Chapter 3. Trajectories of health-related quality of life by change pattern of objective and subjective social status ๏ผ–๏ผ’ 3.1. Introduction ๏ผ–๏ผ“ 3.2. Methods ๏ผ–๏ผ— 3.3. Results ๏ผ—๏ผ‘ 3.4. Discussion ๏ผ˜๏ผ’ 3.5. Supplementary data ๏ผ˜๏ผ™ Appendix A. Sensitivity analyses for missing using MCAR and MAR test ๏ผ˜๏ผ™ Appendix B. Selection criteria for the optimal trajectory model ๏ผ™๏ผ• Chapter 4. Diverse social resources for health-related quality of life in the communities๏ผ‘๏ผ๏ผ 4.1. Introduction ๏ผ‘๏ผ๏ผ‘ 4.2. Methods ๏ผ‘๏ผ๏ผ• 4.3. Results ๏ผ‘๏ผ‘๏ผ 4.4. Discussion ๏ผ‘๏ผ’๏ผ‘ 4.5. Supplementary data ๏ผ‘๏ผ’๏ผ• Appendix A. Correlation of variables and OLS model ๏ผ‘๏ผ’๏ผ• Appendix B. Detailed information on PCA and PCR results ๏ผ‘๏ผ’๏ผ— Appendix C. Sensitivity analyses: Comparison to other dimension reduction methods๏ผ‘๏ผ“๏ผ Chapter 5 Spatial dependences of social resources on health-related quality of life๏ผ‘๏ผ“๏ผ’ 5.1. Introduction ๏ผ‘๏ผ“๏ผ“ 5.2. Methods ๏ผ‘๏ผ“๏ผ– 5.3. Results ๏ผ‘๏ผ”๏ผ‘ 5.4. Discussion ๏ผ‘๏ผ•๏ผ” 5.5. Supplementary data ๏ผ‘๏ผ•๏ผ™ Appendix A. Spatial distributions of HRQoL in 2011, 2015, and 2019๏ผ‘๏ผ•๏ผ™ Appendix B. Detailed information on the 2019 GWR modeling ๏ผ‘๏ผ–๏ผ Appendix C. Sensitivity analyses: Validation of the โ€˜religious activitiesโ€™ coefficient ๏ผ‘๏ผ–๏ผ” Chapter 6 Overall discussion ๏ผ‘๏ผ–๏ผ— 6.1. Summary of the studies (Chapter 2-Chapter5) ๏ผ‘๏ผ–๏ผ˜ 6.2. Framework of Multilevel Resource Distribution ๏ผ‘๏ผ—๏ผ 6.3. Overall discussion ๏ผ‘๏ผ—๏ผ˜ 6.4. Overall conclusions ๏ผ‘๏ผ˜๏ผ– References ๏ผ‘๏ผ˜๏ผ™ ๊ตญ๋ฌธ ์ดˆ๋ก ๏ผ’๏ผ‘๏ผ‘๋ฐ•

    Economic Crises: Evidence and Insights from East Asia

    Get PDF
    macroeconomics, Economic Crises, East Asia

    Data Science for Finance: Targeted Learning from (Big) Data to Economic Stability and Financial Risk Management

    Get PDF
    A thesis submitted in partial fulfillment of the requirements for the degree of Doctor in Information Management, specialization in Statistics and EconometricsThe modelling, measurement, and management of systemic financial stability remains a critical issue in most countries. Policymakers, regulators, and managers depend on complex models for financial stability and risk management. The models are compelled to be robust, realistic, and consistent with all relevant available data. This requires great data disclosure, which is deemed to have the highest quality standards. However, stressed situations, financial crises, and pandemics are the source of many new risks with new requirements such as new data sources and different models. This dissertation aims to show the data quality challenges of high-risk situations such as pandemics or economic crisis and it try to theorize the new machine learning models for predictive and longitudes time series models. In the first study (Chapter Two) we analyzed and compared the quality of official datasets available for COVID-19 as a best practice for a recent high-risk situation with dramatic effects on financial stability. We used comparative statistical analysis to evaluate the accuracy of data collection by a national (Chinese Center for Disease Control and Prevention) and two international (World Health Organization; European Centre for Disease Prevention and Control) organizations based on the value of systematic measurement errors. We combined excel files, text mining techniques, and manual data entries to extract the COVID-19 data from official reports and to generate an accurate profile for comparisons. The findings show noticeable and increasing measurement errors in the three datasets as the pandemic outbreak expanded and more countries contributed data for the official repositories, raising data comparability concerns and pointing to the need for better coordination and harmonized statistical methods. The study offers a COVID-19 combined dataset and dashboard with minimum systematic measurement errors and valuable insights into the potential problems in using databanks without carefully examining the metadata and additional documentation that describe the overall context of data. In the second study (Chapter Three) we discussed credit risk as the most significant source of risk in banking as one of the most important sectors of financial institutions. We proposed a new machine learning approach for online credit scoring which is enough conservative and robust for unstable and high-risk situations. This Chapter is aimed at the case of credit scoring in risk management and presents a novel method to be used for the default prediction of high-risk branches or customers. This study uses the Kruskal-Wallis non-parametric statistic to form a conservative credit-scoring model and to study its impact on modeling performance on the benefit of the credit provider. The findings show that the new credit scoring methodology represents a reasonable coefficient of determination and a very low false-negative rate. It is computationally less expensive with high accuracy with around 18% improvement in Recall/Sensitivity. Because of the recent perspective of continued credit/behavior scoring, our study suggests using this credit score for non-traditional data sources for online loan providers to allow them to study and reveal changes in client behavior over time and choose the reliable unbanked customers, based on their application data. This is the first study that develops an online non-parametric credit scoring system, which can reselect effective features automatically for continued credit evaluation and weigh them out by their level of contribution with a good diagnostic ability. In the third study (Chapter Four) we focus on the financial stability challenges faced by insurance companies and pension schemes when managing systematic (undiversifiable) mortality and longevity risk. For this purpose, we first developed a new ensemble learning strategy for panel time-series forecasting and studied its applications to tracking respiratory disease excess mortality during the COVID-19 pandemic. The layered learning approach is a solution related to ensemble learning to address a given predictive task by different predictive models when direct mapping from inputs to outputs is not accurate. We adopt a layered learning approach to an ensemble learning strategy to solve the predictive tasks with improved predictive performance and take advantage of multiple learning processes into an ensemble model. In this proposed strategy, the appropriate holdout for each model is specified individually. Additionally, the models in the ensemble are selected by a proposed selection approach to be combined dynamically based on their predictive performance. It provides a high-performance ensemble model to automatically cope with the different kinds of time series for each panel member. For the experimental section, we studied more than twelve thousand observations in a portfolio of 61-time series (countries) of reported respiratory disease deaths with monthly sampling frequency to show the amount of improvement in predictive performance. We then compare each countryโ€™s forecasts of respiratory disease deaths generated by our model with the corresponding COVID-19 deaths in 2020. The results of this large set of experiments show that the accuracy of the ensemble model is improved noticeably by using different holdouts for different contributed time series methods based on the proposed model selection method. These improved time series models provide us proper forecasting of respiratory disease deaths for each country, exhibiting high correlation (0.94) with Covid-19 deaths in 2020. In the fourth study (Chapter Five) we used the new ensemble learning approach for time series modeling, discussed in the previous Chapter, accompany by K-means clustering for forecasting life tables in COVID-19 times. Stochastic mortality modeling plays a critical role in public pension design, population and public health projections, and in the design, pricing, and risk management of life insurance contracts and longevity-linked securities. There is no general method to forecast the mortality rate applicable to all situations especially for unusual years such as the COVID-19 pandemic. In this Chapter, we investigate the feasibility of using an ensemble of traditional and machine learning time series methods to empower forecasts of age-specific mortality rates for groups of countries that share common longevity trends. We use Generalized Age-Period-Cohort stochastic mortality models to capture age and period effects, apply K-means clustering to time series to group countries following common longevity trends, and use ensemble learning to forecast life expectancy and annuity prices by age and sex. To calibrate models, we use data for 14 European countries from 1960 to 2018. The results show that the ensemble method presents the best robust results overall with minimum RMSE in the presence of structural changes in the shape of time series at the time of COVID-19. In this dissertationโ€™s conclusions (Chapter Six), we provide more detailed insights about the overall contributions of this dissertation on the financial stability and risk management by data science, opportunities, limitations, and avenues for future research about the application of data science in finance and economy
    • โ€ฆ
    corecore