619 research outputs found

    Robust Algorithms for Detecting Hidden Structure in Biological Data

    Get PDF
    Biological data, such as molecular abundance measurements and protein sequences, harbor complex hidden structure that reflects its underlying biological mechanisms. For example, high-throughput abundance measurements provide a snapshot the global state of a living cell, while homologous protein sequences encode the residue-level logic of the proteins\u27 function and provide a snapshot of the evolutionary trajectory of the protein family. In this work I describe algorithmic approaches and analysis software I developed for uncovering hidden structure in both kinds of data. Clustering is an unsurpervised machine learning technique commonly used to map the structure of data collected in high-throughput experiments, such as quantification of gene expression by DNA microarrays or short-read sequencing. Clustering algorithms always yield a partitioning of the data, but relying on a single partitioning solution can lead to spurious conclusions. In particular, noise in the data can cause objects to fall into the same cluster by chance rather than due to meaningful association. In the first part of this thesis I demonstrate approaches to clustering data robustly in the presence of noise and apply robust clustering to analyze the transcriptional response to injury in a neuron cell. In the second part of this thesis I describe identifying hidden specificity determining residues (SDPs) from alignments of protein sequences descended through gene duplication from a common ancestor (paralogs) and apply the approach to identify numerous putative SDPs in bacterial transcription factors in the LacI family. Finally, I describe and demonstrate a new algorithm for reconstructing the history of duplications by which paralogs descended from their common ancestor. This algorithm addresses the complexity of such reconstruction due to indeterminate or erroneous homology assignments made by sequence alignment algorithms and to the vast prevalence of divergence through speciation over divergence through gene duplication in protein evolution

    Computational Intelligence in Healthcare

    Get PDF
    This book is a printed edition of the Special Issue Computational Intelligence in Healthcare that was published in Electronic

    Computational Intelligence in Healthcare

    Get PDF
    The number of patient health data has been estimated to have reached 2314 exabytes by 2020. Traditional data analysis techniques are unsuitable to extract useful information from such a vast quantity of data. Thus, intelligent data analysis methods combining human expertise and computational models for accurate and in-depth data analysis are necessary. The technological revolution and medical advances made by combining vast quantities of available data, cloud computing services, and AI-based solutions can provide expert insight and analysis on a mass scale and at a relatively low cost. Computational intelligence (CI) methods, such as fuzzy models, artificial neural networks, evolutionary algorithms, and probabilistic methods, have recently emerged as promising tools for the development and application of intelligent systems in healthcare practice. CI-based systems can learn from data and evolve according to changes in the environments by taking into account the uncertainty characterizing health data, including omics data, clinical data, sensor, and imaging data. The use of CI in healthcare can improve the processing of such data to develop intelligent solutions for prevention, diagnosis, treatment, and follow-up, as well as for the analysis of administrative processes. The present Special Issue on computational intelligence for healthcare is intended to show the potential and the practical impacts of CI techniques in challenging healthcare applications

    1992 NASA/ASEE Summer Faculty Fellowship Program

    Get PDF
    For the 28th consecutive year, a NASA/ASEE Summer Faculty Fellowship Program was conducted at the Marshall Space Flight Center (MSFC). The program was conducted by the University of Alabama and MSFC during the period June 1, 1992 through August 7, 1992. Operated under the auspices of the American Society for Engineering Education, the MSFC program, was well as those at other centers, was sponsored by the Office of Educational Affairs, NASA Headquarters, Washington, DC. The basic objectives of the programs, which are the 29th year of operation nationally, are (1) to further the professional knowledge of qualified engineering and science faculty members; (2) to stimulate and exchange ideas between participants and NASA; (3) to enrich and refresh the research and teaching activities of the participants' institutions; and (4) to contribute to the research objectives of the NASA centers

    Working Notes from the 1992 AAAI Spring Symposium on Practical Approaches to Scheduling and Planning

    Get PDF
    The symposium presented issues involved in the development of scheduling systems that can deal with resource and time limitations. To qualify, a system must be implemented and tested to some degree on non-trivial problems (ideally, on real-world problems). However, a system need not be fully deployed to qualify. Systems that schedule actions in terms of metric time constraints typically represent and reason about an external numeric clock or calendar and can be contrasted with those systems that represent time purely symbolically. The following topics are discussed: integrating planning and scheduling; integrating symbolic goals and numerical utilities; managing uncertainty; incremental rescheduling; managing limited computation time; anytime scheduling and planning algorithms, systems; dependency analysis and schedule reuse; management of schedule and plan execution; and incorporation of discrete event techniques

    Acta Cybernetica : Volume 19. Number 1.

    Get PDF

    High-Performance Modelling and Simulation for Big Data Applications

    Get PDF
    This open access book was prepared as a Final Publication of the COST Action IC1406 โ€œHigh-Performance Modelling and Simulation for Big Data Applications (cHiPSet)โ€œ project. Long considered important pillars of the scientific method, Modelling and Simulation have evolved from traditional discrete numerical methods to complex data-intensive continuous analytical optimisations. Resolution, scale, and accuracy have become essential to predict and analyse natural and complex systems in science and engineering. When their level of abstraction raises to have a better discernment of the domain at hand, their representation gets increasingly demanding for computational and data resources. On the other hand, High Performance Computing typically entails the effective use of parallel and distributed processing units coupled with efficient storage, communication and visualisation systems to underpin complex data-intensive applications in distinct scientific and technical domains. It is then arguably required to have a seamless interaction of High Performance Computing with Modelling and Simulation in order to store, compute, analyse, and visualise large data sets in science and engineering. Funded by the European Commission, cHiPSet has provided a dynamic trans-European forum for their members and distinguished guests to openly discuss novel perspectives and topics of interests for these two communities. This cHiPSet compendium presents a set of selected case studies related to healthcare, biological data, computational advertising, multimedia, finance, bioinformatics, and telecommunications

    Flood Management in a Complex River Basin with a Real-Time Decision Support System Based on Hydrological Forecasts

    Get PDF
    During the last decades, the Upper Rhone River basin has been hit by several flood events causing significant damages in excess of 500 million Swiss Francs. From this situation, the 3rd Rhรดne river training project was planned in order to improve the flood protection in the Upper Rhone River basin in Vaud and Valais Cantons. In this framework, the MINERVE forecast system aims to contribute to a better flow control during flood events in this catchment area, taking advantage of the existing hydropower multi-reservoir network. This system also fits into the OWARNA national project of the Swiss Federal Office of Environment by establishing a national platform on natural hazards alarms. The Upper Rhone River basin has a catchment area with high mountains and large glaciers. The surface of the basin is 5521 km2 and its elevation varies between 400 and 4634 m a.s.l. Numerous hydropower schemes with large dams and reservoirs are located in the catchment area, influencing the hydrological regime. Their impact during floods can be significant as appropriate preventive operations can decrease the peak discharges in the Rhone River and its main tributaries, thus reducing the damages. The MINERVE forecast system exploits flow measurements, data from reservoirs and hydropower plants as well as probabilistic (COSMO-LEPS) and deterministic (COSMO-2 and COSMO-7) numerical weather predictions from MeteoSwiss. The MINERVE hydrological model of the catchment area follows a semi-distributed approach. The basin is split into 239 sub-catchments which are further sub-divided into 500 m elevation bands, for a total of 1050 bands. For each elevation band, precipitation, temperature and potential evapotranspiration are calculated. They are considered in order to describe the temperature-driven processes accurately, such as snow and glaciers melt. The hydrological model was implemented in the Routing System software. The object oriented programming environment allows a user-friendly modelling of the hydrological, hydraulic and operating processes. Numerical meteorological data (observed or predicted) are introduced as input in the model. Over the calibration and validation periods of the model, only observed data (precipitation, temperature and flows) was used. For operational flood forecast, the observed measurements are used to update the initial conditions of the hydrological model and the weather forecasts for the hydrological simulations. Routing System provides then hydrological predictions in the whole catchment area. Subsequently, a warning system was developed especially for the basin to provide a flood warning report. The warning system predicts the evolution of the hydrological situation at selected main check points in the catchment area. It displays three warning levels during a flood event depending on respective critical discharge thresholds. Furthermore, the multi-reservoir system is managed in an optimal way in order to limit or avoid damages during floods. A decision support tool called MINDS (MINERVE Interactive Decision Support System) has been developed for real-time decision making based on the hydrological forecasts. This tool defines preventive operation measures for the hydropower plants such as turbine and bottom outlet releases able to provide an optimal water storage during the flood peak. The overall goal of MINDS is then to retain the inflowing floods in reservoirs and to avoid spillway and turbine operations during the peak flow, taking into account all restrictions and current conditions of the network. Such a reservoir management system can therefore significantly decrease flood damages in the catchment area. The reservoir management optimisation during floods is achieved with deterministic and probabilistic forecasts. The definition of the objective function to optimise is realised with a multi-attribute decision making approach. Then, the optimisation is performed with an iterative Greedy algorithm or a SCE-UA (Shuffled Complex Evolution โ€“ University of Arizona) algorithm. The developed decision support system combines the high-quality optimisation system with its user-friendly interface. The purpose is to help decision makers by being directly involve in main steps of the decision making process as well as by understanding the measures undertaken and their consequences

    ํ•˜์ฒœ ์˜ค์—ผ๋ฌผ์งˆ ํ˜ผํ•ฉ ํ•ด์„์„ ์œ„ํ•œ ์ €์žฅ๋Œ€ ๋ชจํ˜•์˜ ๋งค๊ฐœ๋ณ€์ˆ˜ ์‚ฐ์ •๋ฒ• ๋ฐ ๊ฒฝํ—˜์‹ ๊ฐœ๋ฐœ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ)--์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› :๊ณต๊ณผ๋Œ€ํ•™ ๊ฑด์„คํ™˜๊ฒฝ๊ณตํ•™๋ถ€,2019. 8. ์„œ์ผ์›.Analyses of solute transport and retention mechanism are essential to manage water quality and river ecosystem. As reported by tracer injection studies that have been conducted to identify solute transport mechanism, concentration curves measured in natural stream have steep rising and long tail parts. This phenomenon is due to solute exchange process between transient storage zones and the main river stream. The transient storage model (TSM) is one of the most widely used models for describing solute transport in natural stream, taking transient storage exchange process into consideration. In order to use this model, calibration of four TSM parameters is necessary. Inverse modelling using measured breakthrough curves (BTCs) from tracer injection test is general method for TSM parameter calibration. However, it is not feasible to carry out performing tracer injection tests, for every parameter calibration. For that reasons, empirical formulae with hydraulic data, which is comparatively easier to obtain, have been proposed for the purpose of parameter estimation. This study presents two methods for TSM parameter estimation. At first, inverse modelling method employing global optimization framework Shuffled Complex-Self Adaptive Hybrid EvoLution (SC-SAHEL), that incorporating famous evolutionary algorithms in water resource management field, was suggested. Second, TSM parameter empirical equations were derived adopting Multigene Genetic Programming (MGGP) based symbolic regression library GPTIPS and using Principal Components Regression (PCR). In terms of general performance, equations of this study were superior to published empirical equations.ํ•˜์ฒœ์˜ ์ˆ˜์งˆ์„ ๊ด€๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์ž์—ฐํ•˜์ฒœ์—์„œ ์œ ์ž…๋œ ๋ฌผ์งˆ์ด ์ด์†ก๋˜๊ณ  ์ง€์ฒด๋˜๋Š” ๋ฉ”์นด๋‹ˆ์ฆ˜์„ ๊ทœ๋ช…ํ•˜๊ณ  ์ดํ•ดํ•˜๋Š” ๊ฒƒ์ด ํ•„์š”ํ•˜๋‹ค. ํ•˜์ฒœ์—์„œ์˜ ๋ฌผ์งˆ ํ˜ผํ•ฉ์„ ์ดํ•ดํ•˜๊ธฐ ์œ„ํ•ด ์ˆ˜ํ–‰๋œ ์ถ”์ ์ž ์‹คํ—˜ ์—ฐ๊ตฌ๋“ค์— ๋”ฐ๋ฅด๋ฉด ์ž์—ฐํ•˜์ฒœ์—์„œ ๊ณ„์ธก๋˜๋Š” ๋†๋„๊ณก์„ ์—์„œ๋Š” ๊ฐ€ํŒŒ๋ฅธ ์ƒ์Šน๋ถ€์™€ ๊ธด ๊ผฌ๋ฆฌ๊ธฐ ๊ด€์ธก๋˜๋Š” ๊ฒƒ์œผ๋กœ ์•Œ๋ ค์กŒ๋‹ค. ์ด๋Ÿฌํ•œ ํ˜„์ƒ์€ ์ฃผ๋กœ ๋ฌผ์งˆ์ด ํ๋ฅด๋Š” ๋ณธ๋ฅ˜๋Œ€์™€ ์ž ์‹œ ๋ฌผ์งˆ์ด ํฌํš๋˜์—ˆ๋‹ค๊ฐ€ ์žฌ๋ฐฉ์ถœ๋˜๋Š” ๋ณธ๋ฅ˜๋Œ€์™€ ์ €์žฅ๋Œ€ ๊ฐ„์˜ ๋ฌผ์งˆ๊ตํ™˜ ํšจ๊ณผ ๋•Œ๋ฌธ์— ์ผ์–ด๋‚œ๋‹ค๊ณ  ์•Œ๋ ค์ ธ ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ €์žฅ๋Œ€ ๋ฌผ์งˆ๊ตํ™˜ ํšจ๊ณผ๋ฅผ ๋ชจ์‚ฌํ•˜๋Š” ์ €์žฅ๋Œ€๋ชจํ˜• ์ค‘ Transient Storage zone Model (TSM)์€ ๊ฐ€์žฅ ๊ด‘๋ฒ”์œ„ํ•˜๊ฒŒ ์ด์šฉ๋˜๋Š” ๋ชจํ˜•์œผ๋กœ, ์ด๋ฅผ ์ด์šฉํ•˜๊ธฐ ์œ„ํ•ด์„  ๋„ค ๊ฐ€์ง€์˜ ์ €์žฅ๋Œ€ ๋งค๊ฐœ๋ณ€์ˆ˜๋ฅผ ๋ณด์ •ํ•˜์—ฌ์•ผ ํ•œ๋‹ค. ๋„ค ๊ฐ€์ง€ ์ €์žฅ๋Œ€ ๋งค๊ฐœ๋ณ€์ˆ˜๋ฅผ ๊ฒฐ์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ํ˜„์žฅ์‹คํ—˜์—์„œ ์ธก์ •๋œ ๋†๋„๊ณก์„ ์„ ์ด์šฉํ•œ ์—ญ์‚ฐ๋ชจํ˜•์ด ์ด์šฉ๋œ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋งค๊ฐœ๋ณ€์ˆ˜๊ฐ€ ํ•„์š”ํ•  ๋•Œ๋งˆ๋‹ค ์ถ”์ ์ž์‹คํ—˜์„ ์ˆ˜ํ–‰ํ•˜์—ฌ ์—ญ์‚ฐ๋ชจํ˜•์„ ์ด์šฉํ•˜๋Š” ๊ฒƒ์€ ํ˜„์‹ค์ ์œผ๋กœ ๋ถˆ๊ฐ€๋Šฅํ•œ ๊ฒฝ์šฐ๊ฐ€ ์žˆ์–ด ์ด๋Ÿฌํ•œ ๊ฒฝ์šฐ์—๋Š” ๋น„๊ต์  ์ทจ๋“ํ•˜๊ธฐ ์‰ฌ์šด ์ˆ˜๋ฆฌ์ง€ํ˜•ํ•™์  ์ธ์ž๋“ค์„ ์ด์šฉํ•ด ๋งค๊ฐœ๋ณ€์ˆ˜๋ฅผ ์‚ฐ์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์ด ์ด์šฉ๋  ์ˆ˜ ์žˆ๋‹ค. ๋”ฐ๋ผ์„œ ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” TSM ๋งค๊ฐœ๋ณ€์ˆ˜๋ฅผ ๊ฒฐ์ •ํ•˜๊ธฐ ์œ„ํ•ด ๋‘ ๊ฐ€์ง€ ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•˜์˜€๋‹ค. ์ฒซ ๋ฒˆ์งธ๋กœ, ์ „์—ญ ์ตœ์ ํ™” ํ”„๋ ˆ์ž„์›Œํฌ์ธ Shuffled Complex-Self Adaptive Hybrid EvoLution (SC-SAHEL)์„ ์ด์šฉํ•œ ์—ญ์‚ฐ๋ชจํ˜• ๊ธฐ๋ฐ˜ TSM ๋งค๊ฐœ๋ณ€์ˆ˜ ์‚ฐ์ • ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜์˜€๋‹ค. ๋‘˜์งธ๋กœ๋Š” ๊ธฐํ˜ธํšŒ๊ท€๋ฒ• ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์ธ GPTIPS๋ฅผ ์ด์šฉํ•œ ๋‹ค์ค‘์œ ์ „์ž ์œ ์ „ ํ”„๋กœ๊ทธ๋ž˜๋ฐ(Multigene Genetic Programming, MGGP) ๊ณผ ์ฃผ์„ฑ๋ถ„ํšŒ๊ท€๋ฒ•(Principal Components Regression, PCR)์„ ํ†ตํ•ด ๋„ค ๊ฐ€์ง€ ๋งค๊ฐœ๋ณ€์ˆ˜ ๋ณ„๋กœ ๊ฐ ๋‘ ๊ฐœ์”ฉ์˜ ๊ฒฝํ—˜์‹์ด ๊ฐœ๋ฐœ๋˜์—ˆ๋‹ค. ๊ฐœ๋ฐœ๋œ ๊ฒฝํ—˜์‹๋“ค์˜ ์„ฑ๋Šฅํ‰๊ฐ€ ๊ฒฐ๊ณผ, ์„ ํ–‰ ์—ฐ๊ตฌ์—์„œ ์ œ์‹œ๋œ ์ €์žฅ๋Œ€ ๋งค๊ฐœ๋ณ€์ˆ˜ ์‹์— ๋น„ํ•ด ๋ณธ ์—ฐ๊ตฌ์—์„œ ์ œ์‹œ๋œ ๋ฐฉ๋ฒ•์ด ๋Œ€์ฒด์ ์œผ๋กœ ์šฐ์ˆ˜ํ•œ ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚ฌ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ๋ถ„์„์„ ํ†ตํ•ด ์‹ค๋ฌด์ ์œผ๋กœ ํ™œ์šฉ ๊ฐ€๋Šฅํ•œ TSM ๋งค๊ฐœ๋ณ€์ˆ˜ ์‚ฐ์ • ํ”„๋ ˆ์ž„์›Œํฌ์™€ ๊ฒฝํ—˜์‹๋“ค์ด ์ œ์‹œ๋˜์—ˆ์œผ๋ฉฐ, ์ด ๋ฐฉ๋ฒ•๋“ค์€ ์ถ”์ ์ž ์‹คํ—˜ ์ž๋ฃŒ์˜ ์œ ๋ฌด์— ๋”ฐ๋ผ TSM์˜ ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ฒฐ์ •์— ์œ ์šฉํ•˜๊ฒŒ ์‚ฌ์šฉ๋  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€๋œ๋‹ค.Chapter 1. Introduction 1 1.1 Necessity and Background of Research 1 1.2 Objectives 12 Chapter 2. Theoretical Background 15 2.1 Transient Storage Model 15 2.1.1. Mechanisms of Transient Storage 15 2.1.2. Models Accounting for Transient Storage 21 2.1.2.1 The one Zone Transient Storage Model (1Z-TSM) 24 2.1.2.2 The two Zone Transient Storage Model (2Z-TSM) 25 2.1.2.3 The Continuous Time Random Walk Approach (CTRW) 26 2.1.2.4 The Modified Advection Dispersion Model (MADE) 27 2.1.2.5 The Fractional Advection Dispersion Equation Model (FADE) 28 2.1.2.6 The Multirate Mass Transfer Model (MRMT) 29 2.1.2.7 The Advective Storage Path Model (ASP) 30 2.1.2.8 The Solute Transport in Rivers Model (STIR) 31 2.1.2.9 The Aggregate Dead Zone Model (ADZ) 34 2.2 Empirical Equations for Predicting Transient Storage Model Parameters 39 2.3 Parameter Estimation 47 2.3.1. The SC-SAHEL Framework 50 2.3.1.1 Modified Competitive Complex Evolution (MCCE) 52 2.3.1.2 Modified Frog Leaping (MFL) 52 2.3.1.3 Modified Grey Wolf Optimizer (GWO) 53 2.3.1.4 Modified Differential Evolution (DE) 53 2.4 Regression Method 54 2.4.1. The Multi-Gene Genetic Programming (MGGP) 56 2.4.1.1 The Simple Genetic Programming 56 2.4.1.2 Scaled Symbolic Regression via Multi-Gene Genetic Programming 57 2.4.2. Evolutionary Polynomial Regression (EPR) 61 2.4.2.1 Main Flow of EPR Procedure 62 Chapter 3. Model Development 66 3.1 Numerical Model 66 3.1.1. Model Validation 69 3.2 Merger of TSM-SC-SAHEL 73 3.3 Further assessments for the parameter estimation framework 76 3.3.1. Tracer Test Description 76 3.3.2. Grid Independency of Estimation 81 3.3.3. Choice of Optimization Setting 85 Chapter 4. Development of Formulae for Predicting TSM Parameter 91 4.1 Dimensional Analysis 91 4.2 Data Collection via Meta Analysis 95 4.3 Formulae Development 106 Chapter 5. Result and Discussion 110 5.1 Model Performances 110 5.2 Sensitivity Analysis 118 5.3 In-stream Application of Empirical Equations 130 Chapter 6. Conclusion 140 References 144 Appendix. I. The mean, minimum, and maximum values of the model fitness value and number of evolution using the SC-SAHEL with single-EA and multi-EA 159 Appendix. II. Used dimensionless datasets for development of empirical equations 161 ๊ตญ๋ฌธ์ดˆ๋ก 165Maste

    P5 eHealth: An Agenda for the Health Technologies of the Future

    Get PDF
    This open access volume focuses on the development of a P5 eHealth, or better, a methodological resource for developing the health technologies of the future, based on patientsโ€™ personal characteristics and needs as the fundamental guidelines for design. It provides practical guidelines and evidence based examples on how to design, implement, use and elevate new technologies for healthcare to support the management of incurable, chronic conditions. The volume further discusses the criticalities of eHealth, why it is difficult to employ eHealth from an organizational point of view or why patients do not always accept the technology, and how eHealth interventions can be improved in the future. By dealing with the state-of-the-art in eHealth technologies, this volume is of great interest to researchers in the field of physical and mental healthcare, psychologists, stakeholders and policymakers as well as technology developers working in the healthcare sector
    • โ€ฆ
    corecore