7 research outputs found

    Linking big models to big data: efficient ecosystem model calibration through Bayesian model emulation

    Get PDF
    Data-model integration plays a critical role in assessing and improving our capacity to predict ecosystem dynamics. Similarly, the ability to attach quantitative statements of uncertainty around model forecasts is crucial for model assessment and interpretation and for setting field research priorities. Bayesian methods provide a rigorous data assimilation framework for these applications, especially for problems with multiple data constraints. However, the Markov chain Monte Carlo (MCMC) techniques underlying most Bayesian calibration can be prohibitive for computationally demanding models and large datasets. We employ an alternative method, Bayesian model emulation of sufficient statistics, that can approximate the full joint posterior density, is more amenable to parallelization, and provides an estimate of parameter sensitivity. Analysis involved informative priors constructed from a meta-analysis of the primary literature and specification of both model and data uncertainties, and it introduced novel approaches to autocorrelation corrections on multiple data streams and emulating the sufficient statistics surface. We report the integration of this method within an ecological workflow management software, Predictive Ecosystem Analyzer (PEcAn), and its application and validation with two process-based terrestrial ecosystem models: SIPNET and ED2. In a test against a synthetic dataset, the emulator was able to retrieve the true parameter values. A comparison of the emulator approach to standard brute-force MCMC involving multiple data constraints showed that the emulator method was able to constrain the faster and simpler SIPNET model's parameters with comparable performance to the brute-force approach but reduced computation time by more than 2 orders of magnitude. The emulator was then applied to calibration of the ED2 model, whose complexity precludes standard (brute-force) Bayesian data assimilation techniques. Both models are constrained after assimilation of the observational data with the emulator method, reducing the uncertainty around their predictions. Performance metrics showed increased agreement between model predictions and data. Our study furthers efforts toward reducing model uncertainties, showing that the emulator method makes it possible to efficiently calibrate complex models.</p

    Markov chain Monte Carlo with Gaussian processes for fast parameter estimation and uncertainty quantification in a 1D fluidโ€dynamics model of the pulmonary circulation

    Get PDF
    The past few decades have witnessed an explosive synergy between physics and the life sciences. In particular, physical modelling in medicine and physiology is a topical research area. The present work focuses on parameter inference and uncertainty quantification in a 1D fluidโ€dynamics model for quantitative physiology: the pulmonary blood circulation. The practical challenge is the estimation of the patientโ€specific biophysical model parameters, which cannot be measured directly. In principle this can be achieved based on a comparison between measured and predicted data. However, predicting data requires solving a system of partial differential equations (PDEs), which usually have no closedโ€form solution, and repeated numerical integrations as part of an adaptive estimation procedure are computationally expensive. In the present article, we demonstrate how fast parameter estimation combined with sound uncertainty quantification can be achieved by a combination of statistical emulation and Markov chain Monte Carlo (MCMC) sampling. We compare a range of stateโ€ofโ€theโ€art MCMC algorithms and emulation strategies, and assess their performance in terms of their accuracy and computational efficiency. The longโ€term goal is to develop a method for reliable disease prognostication in real time, and our work is an important step towards an automatic clinical decision support system

    ๋ณต์žกํ•œ ๊ณตํ•™ ์‹œ์Šคํ…œ์— ๋Œ€ํ•œ ์˜ค๊ฒฝ๋ณด๋ฅผ ๊ณ ๋ คํ•œ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ํ•ด์„ ๋ฐ ์„ค๊ณ„ ๋ฐฉ๋ฒ•๋ก  ์—ฐ๊ตฌ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ๊ธฐ๊ณ„ํ•ญ๊ณต๊ณตํ•™๋ถ€, 2018. 2. ์œค๋ณ‘๋™.it estimates a healthy engineered to be faulty, resulting unnecessary system shutdown, inspection, and โ€“ in the case of incorrect inspection โ€“ unnecessary system repair or replacement. Although false alarms make a system unavailable with capital loss, it has not been considered in resilience engineering. To cope with false alarm problems, this research is elaborated to advance the resilience engineering considering false alarms. Specifically, this consists of three research thrusts: 1) resilience analysis considering false alarms, 2) resilience-driven system design considering false alarms (RDSD-FA), and 3) resilience-driven system design considering time-dependent false alarms (RDSD-TFA). In the first research thrust, a resilience measure is newly formulated considering false alarms. This enables the evaluation of resilience decrease due to false alarms, resulting in accurate analysis of system resilience. Based upon the new resilience measure, RDSD-FA is proposed in the second research thrust. This aims at designing a resilient system to satisfy a target resilience level while minimizing life-cycle cost. This is composed of three hierarchical tasks: resilience allocation problem, reliability-based design optimization (RBDO), and PHM design. The third research thrust presents RDSD-TFA that considers time-dependent variability of an engineered system. This makes one to estimate life-cycle cost in an accurate and rigorous manner, and to design an engineered system more precisely while minimizing its life-cycle cost. The framework of RDSD-TFA consists of four tasks: system analysis, PHM analysis, life-cycle simulation, and design optimization. Through theoretical analysis and case studies, the significance of false alarms in engineering resilience and the effectiveness of the proposed ideas are demonstrated.๊ณตํ•™ ์‹œ์Šคํ…œ์€ ์ƒ์• ์ฃผ๊ธฐ์— ๊ฑธ์ณ ๋‹ค์–‘ํ•œ ๋ถˆํ™•์‹ค์„ฑ์— ๋…ธ์ถœ๋˜๋ฉฐ, ์ด๋กœ ์ธํ•ด ๋ชฉํ‘œ ์„ฑ๋Šฅ์„ ์ถฉ์กฑ์‹œํ‚ค์ง€ ๋ชปํ•  ๊ฒฝ์šฐ ์‚ฌํšŒ์ , ๊ฒฝ๊ณ„์ , ์ธ์  ์†Œ์‹ค์„ ์•ผ๊ธฐํ•˜๊ฒŒ ๋œ๋‹ค. ์ด์— ๋Œ€ํ•œ ํ•ด๊ฒฐ ๋ฐฉ์•ˆ ์ค‘ ํ•˜๋‚˜๋กœ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ์ฃผ๋„ ์„ค๊ณ„ ๊ธฐ์ˆ  (resilience-driven system design์ดํ•˜ RDSD)์ด ๊ฐœ๋ฐœ๋˜์—ˆ๋‹ค. RDSD๋Š” ๊ฑด์ „์„ฑ ์˜ˆ์ธก ๋ฐ ๊ด€๋ฆฌ ๊ธฐ์ˆ  (prognostics & health management์ดํ•˜ PHM)์„ ์„ค๊ณ„์— ๋„์ž…ํ•จ์œผ๋กœ์จ ๋น„์šฉ ํšจ์œจ์ ์ธ ๊ณ ์žฅ ์˜ˆ๋ฐฉ์„ ๊ฐ€๋Šฅ์ผ€ ํ•˜์˜€๋‹ค. ํ•˜์ง€๋งŒ, RDSD๋Š” PHM์˜ ๊ณ ์žฅ ์˜ค๊ฒฝ๋ณด ํ˜„์ƒ์„ ๊ณ ๋ คํ•˜์ง€ ์•Š๋Š” ํ•œ๊ณ„์ ์„ ๊ฐ–๋Š”๋‹ค. ๊ณ ์žฅ ์˜ค๊ฒฝ๋ณด๋Š” ๊ฑด์ „ํ•œ ์‹œ์Šคํ…œ์„ ๊ณ ์žฅ์ด๋ผ ์ถ”์ •ํ•˜๋Š” ํ˜„์ƒ์œผ๋กœ, ๋ถˆํ•„์š”ํ•œ ์‹œ์Šคํ…œ ์ •์ง€ ๋ฐ ๊ฒ€์‚ฌ ๋น„์šฉ์„ ์•ผ๊ธฐํ•˜์—ฌ, PHM๊ณผ RDSD์˜ ๊ธฐ์ˆ ์  ํšจ์šฉ์„ฑ์„ ๋–จ์–ดํŠธ๋ฆฌ๊ฒŒ ๋œ๋‹ค. ๋”ฐ๋ผ์„œ, RDSD์˜ ๊ธฐ์ˆ ์  ์•ฝ์ง„๊ณผ ์‹ค์ ์šฉ์„ ๋„๋ชจํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๊ณ ์žฅ ์˜ค๊ฒฝ๋ณด ํ˜„์ƒ์„ ํ•ด๊ฒฐํ•ด์•ผ ํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๊ณ ์žฅ ์˜ค๊ฒฝ๋ณด์˜ ๊ณ ๋ ค๋ฅผ ํ†ตํ•ด ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ํ•ด์„ ๋ฐ ์„ค๊ณ„ ๋ฐฉ๋ฒ•๋ก ์„ ๊ฐœ์„ ํ•˜๊ณ ์ž ํ•˜๋ฉฐ, ์ด๋ฅผ ์œ„ํ•ด ์„ธ ๊ฐ€์ง€ ์—ฐ๊ตฌ ์ฃผ์ œ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ์ฒซ ๋ฒˆ์งธ ์ฃผ์ œ๋Š” ์˜ค๊ฒฝ๋ณด๋ฅผ ๊ณ ๋ คํ•œ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ๋ถ„์„์œผ๋กœ, ๊ณตํ•™ ์‹œ์Šคํ…œ์˜ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ์‹œ๋‚˜๋ฆฌ์˜ค ๋ถ„์„์— ๊ธฐ๋ฐ˜ํ•ด ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ์ง€์ˆ˜๋ฅผ ์ƒˆ๋กญ๊ฒŒ ์ •์‹ํ™” ํ•œ๋‹ค. ์ด ์ง€์ˆ˜๋Š” ๊ณ ์žฅ ์˜ค๊ฒฝ๋ณด๋กœ ์ธํ•œ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค์˜ ์ €ํ•˜๋ฅผ ๋ถ„์„ํ•จ์œผ๋กœ์จ, ์ •ํ™•ํ•œ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ์ถ”์ •์„ ๊ฐ€๋Šฅ์ผ€ ํ•œ๋‹ค. ๋‘ ๋ฒˆ์งธ ์ฃผ์ œ๋Š” ๊ณ ์žฅ ์˜ค๊ฒฝ๋ณด๋ฅผ ๊ณ ๋ คํ•œ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ์ฃผ๋„ ์„ค๊ณ„ ๋ฐฉ๋ฒ•๋ก ์ด๋‹ค. ์ด๋Š” 3๋‹จ๊ณ„์˜ ๊ณ„์ธต์  ์š”์†Œ๋กœ ๊ตฌ์„ฑ๋œ๋‹ค. ๋จผ์ € ๋ชฉํ‘œ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ์ง€์ˆ˜๋ฅผ ๋งŒ์กฑํ•˜๋ฉด์„œ ์ƒ์• ์ฃผ๊ธฐ๋น„์šฉ์„ ์ตœ์†Œํ™”ํ•˜๊ธฐ ์œ„ํ•ด, ๋ชฉํ‘œ ์‹ ๋ขฐ๋„์™€ ๋ชฉํ‘œ ์˜ค๊ฒฝ๋ณด ๋ฐ ์œ ์‹ค๊ฒฝ๋ณด์œจ์„ ์ตœ์ ํ™”ํ•œ๋‹ค. ์ดํ›„ ์‹ ๋ขฐ์„ฑ ๊ธฐ๋ฐ˜ ์ตœ์  ์„ค๊ณ„ (reliability-based design optimization)๋ฅผ ํ†ตํ•ด ๋ชฉํ‘œ ์‹ ๋ขฐ๋„๋ฅผ ํ™•๋ณดํ•˜๊ณ , PHM ์„ค๊ณ„๋ฅผ ํ†ตํ•ด ํ• ๋‹น๋œ ๋ชฉํ‘œ ์˜ค๊ฒฝ๋ณด ๋ฐ ์œ ์‹ค๊ฒฝ๋ณด์œจ์„ ์ถฉ์กฑ์‹œํ‚จ๋‹ค. ์„ธ ๋ฒˆ์งธ ์ฃผ์ œ๋Š” ์‹œ๋ณ€(ๆ™‚่ฎŠ) ์˜ค๊ฒฝ๋ณด๋ฅผ ๊ณ ๋ คํ•œ ๋ฆฌ์งˆ๋ฆฌ์–ธ์Šค ์ฃผ๋„ ์„ค๊ณ„ ๋ฐฉ๋ฒ•๋ก ์ด๋‹ค. ๊ธฐ์กด์˜ ์„ค๊ณ„ ๋ฐฉ๋ฒ•๋ก ๋“ค์€ ์‹œ์Šคํ…œ์˜ ๊ฑด์ „์„ฑ ์ƒํƒœ๋ฅผ ์‹œ๋ถˆ๋ณ€(ๆ™‚๏ฅง่ฎŠ)ํ•˜๋‹ค ๊ฐ„์ฃผํ•˜์˜€์œผ๋‚˜, ์‹ค์ œ ์‹œ์Šคํ…œ์€ ์šดํ–‰์— ๋”ฐ๋ผ ์ ์ง„์ ์œผ๋กœ ๊ฑด์ „์„ฑ์ด ์ €ํ•˜๋œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์‹œ๋ณ€์„ฑ์„ ๋ถ„์„ํ•˜๊ธฐ ์œ„ํ•ด ์‹œ๋ณ€ ์˜ค๊ฒฝ๋ณด์œจ ๋ฐ ์œ ์‹ค๊ฒฝ๋ณด์œจ์— ๋Œ€ํ•œ ๊ฐœ๋…์„ ์ƒˆ๋กญ๊ฒŒ ์ œ์•ˆํ•˜์˜€์œผ๋ฉฐ, ์ƒ์• ์ฃผ๊ธฐ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์„ ํ†ตํ•œ ์ด ์œ ์ง€๋ณด์ˆ˜ ๋น„์šฉ ๋ถ„์„ ๋ฐฉ๋ฒ•๋ก ์„ ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. ์ด๋ฅผ ํ†ตํ•ด ์ƒ์• ์ฃผ๊ธฐ๋น„์šฉ์„ ๋ณด๋‹ค ์—„๋ฐ€ํ•˜๊ณ  ์ •ํ™•ํ•˜๊ฒŒ ์ถ”์ •ํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋˜์—ˆ์œผ๋ฉฐ, ์ด๋ฅผ ์ตœ์†Œํ™”ํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์‹œ์Šคํ…œ๊ณผ PHM์˜ ์„ค๊ณ„๋ฅผ ์ตœ์ ํ™”์˜€๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•๋ก ๋“ค์€ ์ด๋ก ์  ๋ถ„์„๊ณผ ์‚ฌ๋ก€ ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด ๊ทธ ํšจ์šฉ์„ฑ์„ ์ž…์ฆํ•˜์˜€๋‹ค.Most engineered systems are designed with a passive and fixed design capacity and, therefore, may become unreliable in the presence of adverse events. In order to handle this issue, the resilience-driven system design (RDSD) has been proposed to make engineered systems adaptively reliable by incorporating the prognostics and health management (PHM) method. PHM tracks the health degradation of an engineered system, and provides health state information supporting decisions on condition-based maintenance. Meanwhile, one of the issues awaiting solution in the field of PHM, as well as in RDSD, is to address false alarms. A false alarm is an erroneous report on the health state of an engineered systemChapter 1. Introduction 1 1.1 Motivation 1 1.2 Research Scope and Overview 3 1.3 Dissertation Layout 6 Chapter 2. Literature Review 7 2.1 Resilience Engineering (Analysis and Design) 7 2.1.1 Resilience Analysis for Mechanical Systems 8 2.1.2 Resilience-Driven System Design (RDSD) for Mechanical Systems 15 2.2 False and Missed Alarms in Prognostics and Health Management 27 2.2.1 Definition of False and Missed Alarms 27 2.2.2 Quantification of False and Missed Alarms 32 2.3 Summary and Discussion 35 Chapter 3. Resilience Analysis Considering False Alarms 37 3.1 Resilience Measure Considering False Alarms 37 3.2 Case Studies 42 3.2.1 Numerical ample 42 3.2.2 Electro-Hydrtatic Actuator (EHA) 44 3.3 Summary and Discussion 53 Chapter 4. Resilience-Driven System Design Considering False Alarms (RDSD-FA) 55 4.1 Overview of RDSD-FA Framework 55 4.2 Resilience Allocation Problem Considering False Alarms 56 4.3 Prognostics and Health Management (PHM) Design Considering False Alarms 60 4.4 Case study: Electro-Hydrostatic Actuator (EHA) 61 4.4.1 Step 1: Resilience Allocation Considering False Alarms 61 4.4.2 Step 2: Reliability-Based Design Optimization 64 4.4.3 Step 3: PHM Design Considering False Alarms 68 4.4.4 Comparison of Design Results from RDSD and RDSD-FA 73 4.5 Summary and Discussion 75 Chapter 5. Resilience-Driven System Design Considering Time-Dependent False Alarms (RDSD-TFA) 77 5.1 Time-Dependent False and Missed Alarms in PHM 79 5.2 Resilience-Driven System Design Considering Time-Dependent False Alarms (RDSD-TFA) 83 5.2.1 Overview of RDSD-TFA Framework 83 5.2.2 Task 1: System Analysis 86 5.2.3 Task 2: PHM Analysis 89 5.2.4 Task 3: Life-Cycle Simulation 91 5.2.5 Task 4: Design Optimization 97 5.3 Case studies 98 5.3.1 Numerical Example of Life-Cycle Simulation 98 5.3.2 Electro-Hydrostatic Actuator (EHA) 107 5.4 Summary and Discussion 123 Chapter 6. Conclusions 126 6.1 Summary and Contributions 126 6.2 Suggestions for Future Research 129 References 132 Appendix 154 Abstract(Korean) 157Docto

    Bayesian parameter estimation and uncertainty quantification in fluid-dynamics models of the pulmonary circulation system

    Get PDF
    The past few decades have witnessed an explosive synergy between physics and the life sciences. In particular, physical modelling in medicine and physiology is a topical research area. The present work focuses on the inverse problem, more specifically on the parameter inference and uncertainty quantification in a 1D fluid-dynamics model for quantitative physiology: the pulmonary blood circulation. The particular application is pulmonary hypertension, requiring an analysis of the blood pressure, whose measurement in the pulmonary system can only be obtained invasively for patients. The ultimate goal is to develop a non-invasive disease diagnostication method. This could be accomplished by combining non-invasively obtained haemodynamic data (blood flow measured with MRI) with imaging data (CT scans of the lung structure), to be used in conjunction with mathematical and statistical modelling. This will provide a decision-making support mechanism in the clinic, ultimately aiding in personalised medicine. This thesis adopts a Bayesian approach to uncertainty quantification in physiological models, allowing to assess the credibility of these models. The danger with using overly confident models is that they could produce biased predictions, ultimately leading to the wrong disease diagnosis and treatment. Inference of unknown and immeasurable parameters of several 1D fluid-dynamics models, expressed through partial differential equations, is performed with Markov Chain Monte Carlo. These parameters act as bio-indicators for the disease, e.g. vessel wall stiffness, which is high in pulmonary hypertension patients. In addition, the uncertainty in the model form and the data measurement process (jointly called model mismatch) is captured, and the model mismatch is represented with Gaussian Processes. Given that the mathematical model is not a perfect representation of the reality, and that the data measurement process is prone to errors, this introduces an extra layer of uncertainty. If unaccounted for, the result is biased and overly confident parameter estimates and model predictions. Yet another source of uncertainty modelled in this study is the variability in the vessel network geometry, connectivity and size, which is shown to introduce variability in the model predictions, and must be accounted for. The uncertainty in the model parameters, model form, data measurement process and vessel network propagates through to the model predictions, which is also quantified. Lastly, this thesis is concerned with accelerating the computational efficiency of the statistical inference procedure, aiming to make the methods suitable for use in the clinic. Statistical emulation is used in conjunction with a series of efficient Hamiltonian Monte Carlo algorithms, particularly adapted to computationally expensive models. A comparative evaluation study is carried out to identify the algorithm giving the best trade-off between accuracy and efficiency on a set of representative benchmark differential equation models

    The Application of Contemporary Numerical Methods to the Modeling, Analysis, and Uncertainty Quantification of Glacier Dynamics

    Get PDF
    Warming temperatures have led to accelerating ice loss from the Greenland ice sheet, contributing to global sea level rise. Understanding the stability of the Greenland ice sheet to further warming is crucial to estimating rates of sea level rise over the next century. Estimating sea level rise is complicated by uncertainties in the physical mechanisms governing ice motion as well as uncertainties in the broader Arctic climate system of which the ice sheet is an integral part. In chapter 2, we focus on how surface melt water input to the ice sheet bed influences the rate of basal sliding, which is thought to depend on the seasonal evolution of the subglacial drainage system. Models of subglacial drainage have developed considerably in recent years. However, the recent sublglacial hydrology model intercomparison project (SHMIP), presented in Appendix A, shows that a wide gamut of models underpredict subglacial water pressure in winter when compared to borehole water pressure observations from Greenland. We investigate possible causes for this unphysical model behavior, ranging from poorly constrained model parameters to uncertainties in the physical equations for subglacial drainage. We conclude that the mismatch between modeled and observed winter water pressure can be remedied by dynamically adjusting the hydraulic conductivity parameter, which accounts for missing physics in the models describing seasonal changes in drainage system connectivity. Chapter 3 focuses on contextualizing modern climate change in the Arctic by investigating past changes in temperature and precipitation. In particular, we exploit a new chronology of ice sheet retreat in west Central Greenland, along with a novel data assimilation method based on the unscented transform (UT), to estimate changes in precipitation during the Holocene Thermal Maximum (HTM) -- a period of higher than modern temperatures that occurred some eight-thousand years ago. We demonstrate the effectiveness of the UT as a method for data assimilation and uncertainty quantification and show new evidence that the HTM was associated with greater than modern snowfall, which helped mitigate ice sheet retreat
    corecore