2,906 research outputs found

    NiftyNet: a deep-learning platform for medical imaging

    Get PDF
    Medical image analysis and computer-assisted intervention problems are increasingly being addressed with deep-learning-based solutions. Established deep-learning platforms are flexible but do not provide specific functionality for medical image analysis and adapting them for this application requires substantial implementation effort. Thus, there has been substantial duplication of effort and incompatible infrastructure developed across many research groups. This work presents the open-source NiftyNet platform for deep learning in medical imaging. The ambition of NiftyNet is to accelerate and simplify the development of these solutions, and to provide a common mechanism for disseminating research outputs for the community to use, adapt and build upon. NiftyNet provides a modular deep-learning pipeline for a range of medical imaging applications including segmentation, regression, image generation and representation learning applications. Components of the NiftyNet pipeline including data loading, data augmentation, network architectures, loss functions and evaluation metrics are tailored to, and take advantage of, the idiosyncracies of medical image analysis and computer-assisted intervention. NiftyNet is built on TensorFlow and supports TensorBoard visualization of 2D and 3D images and computational graphs by default. We present 3 illustrative medical image analysis applications built using NiftyNet: (1) segmentation of multiple abdominal organs from computed tomography; (2) image regression to predict computed tomography attenuation maps from brain magnetic resonance images; and (3) generation of simulated ultrasound images for specified anatomical poses. NiftyNet enables researchers to rapidly develop and distribute deep learning solutions for segmentation, regression, image generation and representation learning applications, or extend the platform to new applications.Comment: Wenqi Li and Eli Gibson contributed equally to this work. M. Jorge Cardoso and Tom Vercauteren contributed equally to this work. 26 pages, 6 figures; Update includes additional applications, updated author list and formatting for journal submissio

    The structure and stability of persistence modules

    Full text link
    We give a self-contained treatment of the theory of persistence modules indexed over the real line. We give new proofs of the standard results. Persistence diagrams are constructed using measure theory. Linear algebra lemmas are simplified using a new notation for calculations on quiver representations. We show that the stringent finiteness conditions required by traditional methods are not necessary to prove the existence and stability of the persistence diagram. We introduce weaker hypotheses for taming persistence modules, which are met in practice and are strong enough for the theory still to work. The constructions and proofs enabled by our framework are, we claim, cleaner and simpler.Comment: New version. We discuss in greater depth the interpolation lemma for persistence module

    Methodology for high resolution spatial analysis of the physical flood susceptibility of buildings in large river floodplains

    Get PDF
    The impacts of floods on buildings in urban areas are increasing due to the intensification of extreme weather events, unplanned or uncontrolled settlements and the rising vulnerability of assets. There are some approaches available for assessing the flood damage to buildings and critical infrastructure. To this point, however, it is extremely difficult to adapt these methods widely, due to the lack of high resolution classification and characterisation approaches for built structures. To overcome this obstacle, this work presents: first, a conceptual framework for understanding the physical flood vulnerability and the physical flood susceptibility of buildings, second, a methodological framework for the combination of methods and tools for a large-scale and high-resolution analysis and third, the testing of the methodology in three pilot sites with different development conditions. The conceptual framework narrows down an understanding of flood vulnerability, physical flood vulnerability and physical flood susceptibility and its relation to social and economic vulnerabilities. It describes the key features causing the physical flood susceptibility of buildings as a component of the vulnerability. The methodological framework comprises three modules: (i) methods for setting up a building topology, (ii) methods for assessing the susceptibility of representative buildings of each building type and (iii) the integration of the two modules with technological tools. The first module on the building typology is based on a classification of remote sensing data and GIS analysis involving seven building parameters, which appeared to be relevant for a classification of buildings regarding potential flood impacts. The outcome is a building taxonomic approach. A subsequent identification of representative buildings is based on statistical analyses and membership functions. The second module on the building susceptibility for representative buildings bears on the derivation of depth-physical impact functions. It relates the principal building components, including their heights, dimensions and materials, to the damage from different water levels. The material’s susceptibility is estimated based on international studies on the resistance of building materials and a fuzzy expert analysis. Then depth-physical impact functions are calculated referring to the principal components of the buildings which can be affected by different water levels. Hereby, depth-physical impact functions are seen as a means for the interrelation between the water level and the physical impacts. The third module provides the tools for implementing the methodology. This tool compresses the architecture for feeding the required data on the buildings with their relations to the building typology and the building-type specific depth-physical impact function supporting the automatic process. The methodology is tested in three flood plains pilot sites: (i) in the settlement of the Barrio Sur in MaganguĂ© and (ii) in the settlement of La Peña in Cicuco located on the flood plain of Magdalena River, Colombia and (iii) in a settlement of the city of Dresden, located on the Elbe River, Germany. The testing of the methodology covers the description of data availability and accuracy, the steps for deriving the depth-physical impact functions of representative buildings and the final display of the spatial distribution of the physical flood susceptibility. The discussion analyses what are the contributions of this work evaluating the findings of the methodology’s testing with the dissertation goals. The conclusions of the work show the contributions and limitations of the research in terms of methodological and empirical advancements and the general applicability in flood risk management.:1 INTRODUCTION 1 1.1 Background 1 1.2 State of the art 2 1.3 Problem statement 6 1.4 Objectives 6 1.5 Approach and outline 6 2 CONCEPTUAL FRAMEWORK 9 2.1 Flood vulnerability 10 2.2 Physical flood vulnerability 12 2.3 Physical flood susceptibility 14 3 METHODOLOGICAL FRAMEWORK 23 3.1 Module 1: Building taxonomy for settlements 24 3.1.1 Extraction of building features 24 3.1.2 Derivation of building parameters for setting up a building taxonomy 38 3.1.3 Selection of representative buildings for a building susceptibility assessment 51 3.2 Module 2: Physical susceptibility of representative buildings 57 3.2.1 Identification of building components 57 3.2.2 Qualification of building material susceptibility 62 3.2.3 Derivation of a depth-physical impact function 71 3.3 Module 3: Technological integration 77 3.3.1 Combination of the depth-physical impact function with the building taxonomic code 77 3.3.2 Tools supporting the physical susceptibility analysis 78 3.3.3 The users and their requirements 79 4 RESULTS OF THE METHODOLOGY TESTING 83 4.1 Pilot site “Kleinzschachwitz” – Dresden, Germany – Elbe River 83 4.1.1 Module 1: Building taxonomy – “Kleinzschachwitz” 85 4.1.2 Module 2: Physical susceptibility of representative buildings – “Kleinzschachwitz” 97 4.1.3 Module 3: Technological integration – “Kleinzschachwitz” 103 4.2 Pilot site “La Peña” – Cicuco, Colombia – Magdalena River 107 4.2.1 Module 1: Building taxonomy – “La Peña” 108 4.2.2 Module 2: Physical susceptibility of representative buildings – “La Peña” 121 4.2.3 Module 3: Technological integration– “La Peña” 129 4.3 Pilot site “Barrio Sur” – MaganguĂ©, Colombia – Magdalena River 133 4.3.1 Module 1: Building taxonomy – “Barrio Sur” 133 4.3.2 Module 2: Physical susceptibility of representative buildings – “Barrio Sur” 141 4.3.3 Module 3: Technological integration – “Barrio Sur” 147 4.4 Empirical findings 151 4.4.1 Empirical findings of Module 1 151 4.4.2 Empirical findings of Module 2 155 4.4.3 Empirical findings of Module 3 157 4.4.4 Guidance of the methodology 157 5 DISCUSSION 161 5.1 Discussion on the conceptual framework 161 5.2 Discussion on the methodological framework 161 5.2.1 Discussion on Module 1: the building taxonomic approach 162 5.2.2 Discussion on Module 2: the depth-physical impact function 164 6 CONCLUSIONS AND OUTLOOK 167 6.1 Conclusions 167 6.2 Outlook 168 REFERENCES 171 INDEX OF FIGURES 199 INDEX OF TABLES 201 APPENDICES 203In vielen StĂ€dten nehmen die Auswirkungen von Hochwasser auf GebĂ€ude aufgrund immer extremerer Wetterereignisse, unkontrollierbarer Siedlungsbauten und der steigenden VulnerabilitĂ€t von BesitztĂŒmern stetig zu. Es existieren zwar bereits AnsĂ€tze zur Beurteilung von WasserschĂ€den an GebĂ€uden und Infrastrukturknotenpunkten. Doch ist es bisher schwierig, diese Methoden großrĂ€umig anzuwenden, da es an einer prĂ€zisen Klassifizierung und Charakterisierung von GebĂ€uden und anderen baulichen Anlagen fehlt. Zu diesem Zweck sollen in dieser Arbeit erstens ein Konzept fĂŒr ein genaueres VerstĂ€ndnis der physischen VulnerabilitĂ€t von GebĂ€uden gegenĂŒber Hochwasser dargelegt, zweitens ein methodisches Verfahren zur Kombination der bestehenden Methoden und Hilfsmittel mit dem Ziel einer großrĂ€umigen und hochauflösenden Analyse erarbeitet und drittens diese Methode an drei Pilotstandorten mit unterschiedlichem Ausbauzustand erprobt werden. Die Rahmenbedingungen des Konzepts grenzen die Begriffe der VulnerabilitĂ€t, der physischen VulnerabilitĂ€t und der physischen AnfĂ€lligkeit gegenĂŒber Hochwasser ein und erörtern deren Beziehung zur sozialen und ökonomischen VulnerabilitĂ€t. Es werden die Merkmale der physischen AnfĂ€lligkeit von GebĂ€uden gegenĂŒber Hochwasser als Bestandteil der VulnerabilitĂ€t definiert. Das methodische Verfahren umfasst drei Module: (i) Methoden zur Erstellung einer GebĂ€udetypologie, (ii) Methoden zur Bewertung der AnfĂ€lligkeit reprĂ€sentativer GebĂ€ude jedes GebĂ€udetyps und (iii) die Kombination der beiden Module mit Hilfe technologischer Hilfsmittel. Das erste Modul zur GebĂ€udetypologie basiert auf der Klassifizierung von Fernerkundungsdaten und GIS-Analysen anhand von sieben GebĂ€udeparametern, die sich fĂŒr die Klassifizierung von GebĂ€uden bezĂŒglich ihres Risikopotenzials bei Hochwasser als wichtig erweisen. Daraus ergibt sich ein Ansatz zur GebĂ€udeklassifizierung. Die anschließende Ermittlung reprĂ€sentativer GebĂ€ude beruht auf statistischen Analysen und Zugehörigkeitsfunktionen. Das zweite Modul zur AnfĂ€lligkeit reprĂ€sentativer GebĂ€ude beruht auf der Ableitung von Funktion von Wasserstand und physischer Einwirkung. Es setzt die relevanten GebĂ€udemerkmale, darunter Höhe, Maße und Materialien, in Beziehung zum erwartbaren Schaden bei unterschiedlichen WasserstĂ€nden. Die MaterialanfĂ€lligkeit wird aufgrund internationaler Studien zur Festigkeit von Baustoffen sowie durch Anwendung eines Fuzzy-Logic-Expertensystems eingeschĂ€tzt. Anschließend werden Wasserstand-Schaden-Funktionen unter Einbeziehung der HauptgebĂ€udekomponenten berechnet, die durch unterschiedliche WasserstĂ€nde in Mitleidenschaft gezogen werden können. Funktion von Wasserstand und physischer Einwirkung dienen hier dazu, den jeweiligen Wasserstand und die physischen Auswirkung in Beziehung zueinander zu setzen. Das dritte Modul stellt die zur Umsetzung der Methoden notwendigen Hilfsmittel vor. Zur UnterstĂŒtzung des automatisierten Verfahrens dienen Hilfsmittel, die die GebĂ€udetypologie mit der Funktion von Wasserstand und physischer Einwirkung fĂŒr GebĂ€ude in Hochwassergebieten kombinieren. Die Methoden wurden anschließend in drei hochwassergefĂ€hrdeten Pilotstandorten getestet: (i) in den Siedlungsgebieten von Barrio Sur in MaganguĂ© und (ii) von La Pena in Cicuco, zwei Überschwemmungsgebiete des Magdalenas in Kolumbien, und (iii) im Stadtgebiet von Dresden, das an der Elbe liegt. Das Testverfahren umfasst die Beschreibung der DatenverfĂŒgbarkeit und genauigkeit, die einzelnen Schritte zur Analyse der. Funktion von Wasserstand und physischer Einwirkung reprĂ€sentativer GebĂ€ude sowie die Darstellung der rĂ€umlichen Verteilung der physischen AnfĂ€lligkeit fĂŒr Hochwasser. In der Diskussion wird der Beitrag dieser Arbeit zur Beurteilung der Erkenntnisse der getesteten Methoden anhand der Ziele dieser Dissertation analysiert. Die Folgerungen beleuchten abschließend die Fortschritte und auch Grenzen der Forschung hinsichtlich methodischer und empirischer Entwicklungen sowie deren allgemeine Anwendbarkeit im Bereich des Hochwasserschutzes.:1 INTRODUCTION 1 1.1 Background 1 1.2 State of the art 2 1.3 Problem statement 6 1.4 Objectives 6 1.5 Approach and outline 6 2 CONCEPTUAL FRAMEWORK 9 2.1 Flood vulnerability 10 2.2 Physical flood vulnerability 12 2.3 Physical flood susceptibility 14 3 METHODOLOGICAL FRAMEWORK 23 3.1 Module 1: Building taxonomy for settlements 24 3.1.1 Extraction of building features 24 3.1.2 Derivation of building parameters for setting up a building taxonomy 38 3.1.3 Selection of representative buildings for a building susceptibility assessment 51 3.2 Module 2: Physical susceptibility of representative buildings 57 3.2.1 Identification of building components 57 3.2.2 Qualification of building material susceptibility 62 3.2.3 Derivation of a depth-physical impact function 71 3.3 Module 3: Technological integration 77 3.3.1 Combination of the depth-physical impact function with the building taxonomic code 77 3.3.2 Tools supporting the physical susceptibility analysis 78 3.3.3 The users and their requirements 79 4 RESULTS OF THE METHODOLOGY TESTING 83 4.1 Pilot site “Kleinzschachwitz” – Dresden, Germany – Elbe River 83 4.1.1 Module 1: Building taxonomy – “Kleinzschachwitz” 85 4.1.2 Module 2: Physical susceptibility of representative buildings – “Kleinzschachwitz” 97 4.1.3 Module 3: Technological integration – “Kleinzschachwitz” 103 4.2 Pilot site “La Peña” – Cicuco, Colombia – Magdalena River 107 4.2.1 Module 1: Building taxonomy – “La Peña” 108 4.2.2 Module 2: Physical susceptibility of representative buildings – “La Peña” 121 4.2.3 Module 3: Technological integration– “La Peña” 129 4.3 Pilot site “Barrio Sur” – MaganguĂ©, Colombia – Magdalena River 133 4.3.1 Module 1: Building taxonomy – “Barrio Sur” 133 4.3.2 Module 2: Physical susceptibility of representative buildings – “Barrio Sur” 141 4.3.3 Module 3: Technological integration – “Barrio Sur” 147 4.4 Empirical findings 151 4.4.1 Empirical findings of Module 1 151 4.4.2 Empirical findings of Module 2 155 4.4.3 Empirical findings of Module 3 157 4.4.4 Guidance of the methodology 157 5 DISCUSSION 161 5.1 Discussion on the conceptual framework 161 5.2 Discussion on the methodological framework 161 5.2.1 Discussion on Module 1: the building taxonomic approach 162 5.2.2 Discussion on Module 2: the depth-physical impact function 164 6 CONCLUSIONS AND OUTLOOK 167 6.1 Conclusions 167 6.2 Outlook 168 REFERENCES 171 INDEX OF FIGURES 199 INDEX OF TABLES 201 APPENDICES 203El impacto de las inundaciones sobre los edificios en zonas urbanas es cada vez mayor debido a la intensificaciĂłn de los fenĂłmenos meteorolĂłgicos extremos, asentamientos no controlados o no planificados y su creciente vulnerabilidad. Hay mĂ©todos disponibles para evaluar los daños por inundaciĂłn en edificios e infraestructuras crĂ­ticas. Sin embargo, es muy difĂ­cil implementar estos mĂ©todos sistemĂĄticamente en grandes ĂĄreas debido a la falta de clasificaciĂłn y caracterizaciĂłn de estructuras construidas en resoluciones detalladas. Para superar este obstĂĄculo, este trabajo se enfoca, en primer lugar, en desarrollar un marco conceptual para comprender la vulnerabilidad y susceptibilidad fĂ­sica de edificios por inudaciones, en segundo lugar, en desarrollar un marco metodolĂłgico para la combinaciĂłn de los mĂ©todos y herramientas para una anĂĄlisis de alta resoluciĂłn y en tercer lugar, la prueba de la metodologĂ­a en tres sitios experimentales, con distintas condiciones de desarrollo. El marco conceptual se enfoca en comprender la vulnerabilidad y susceptibility de las edificaciones frente a inundaciones, y su relaciĂłn con la vulnerabilidad social y econĂłmica. En Ă©l se describen las principales caracterĂ­sticas fĂ­sicas de la susceptibilidad de edificicaiones como un componente de la vulnerabilidad. El marco metodolĂłgico consta de tres mĂłdulos: (i) mĂ©todos para la derivaciĂłn de topologĂ­a de construcciones, (ii) mĂ©todos para evaluar la susceptibilidad de edificios representativos y (iii) la integraciĂłn de los dos mĂłdulos a travĂ©s herramientas tecnolĂłgicas. El primer mĂłdulo de topologĂ­a de construcciones se basa en una clasificaciĂłn de datos de sensoramiento rĂ©moto y procesamiento SIG para la extracciĂłn de siete parĂĄmetros de las edficaciones. Este mĂłdulo parece ser aplicable para una clasificaciĂłn de los edificios en relaciĂłn con los posibles impactos de las inundaciones. El resultado es una taxonomĂ­a de las edificaciones y una posterior identificaciĂłn de edificios representativos que se basa en anĂĄlisis estadĂ­sticos y funciones de pertenencia. El segundo mĂłdulo consiste en el anĂĄlisis de susceptibilidad de las construcciones representativas a travĂ©s de funciones de profundidad del impacto fĂ­sico. Las cuales relacionan los principales componentes de la construcciĂłn, incluyendo sus alturas, dimensiones y materiales con los impactos fĂ­sicos a diferentes niveles de agua. La susceptibilidad del material se calcula con base a estudios internacionales sobre la resistencia de los materiales y un anĂĄlisis a travĂ©s de sistemas expertos difusos. AquĂ­, las funciones de profundidad de impacto fĂ­sico son considerados como un medio para la interrelaciĂłn entre el nivel del agua y los impactos fĂ­sicos. El tercer mĂłdulo proporciona las herramientas necesarias para la aplicaciĂłn de la metodologĂ­a. Estas herramientas tecnolĂłgicas consisten en la arquitectura para la alimentaciĂłn de los datos relacionados a la tipologĂ­a de construcciones con las funciones de profundidad del impacto fĂ­sico apoyado en procesos automĂĄticos. La metodologĂ­a es probada en tres sitios piloto: (i) en el Barrio Sur en MaganguĂ© y (ii) en la barrio de La Peña en Cicuco situado en la llanura inundable del RĂ­o Magdalena, Colombia y (iii) en barrio Kleinzschachwitz de la ciudad de Dresden, situado a orillas del rĂ­o Elba, en Alemania. Las pruebas de la metodologĂ­a abarca la descripciĂłn de la disponibilidad de los datos y la precisiĂłn, los pasos a seguir para obtener las funciones profundidad de impacto fĂ­sico de edificios representativos y la presentaciĂłn final de la distribuciĂłn espacial de la susceptibilidad fĂ­sica frente inundaciones El discusiĂłn analiza las aportaciones de este trabajo y evalua los resultados de la metodologĂ­a con relaciĂłn a los objetivos. Las conclusiones del trabajo, muestran los aportes y limitaciones de la investigaciĂłn en tĂ©rminos de avances metodolĂłgicos y empĂ­ricos y la aplicabilidad general de gestiĂłn del riesgo de inundaciones.:1 INTRODUCTION 1 1.1 Background 1 1.2 State of the art 2 1.3 Problem statement 6 1.4 Objectives 6 1.5 Approach and outline 6 2 CONCEPTUAL FRAMEWORK 9 2.1 Flood vulnerability 10 2.2 Physical flood vulnerability 12 2.3 Physical flood susceptibility 14 3 METHODOLOGICAL FRAMEWORK 23 3.1 Module 1: Building taxonomy for settlements 24 3.1.1 Extraction of building features 24 3.1.2 Derivation of building parameters for setting up a building taxonomy 38 3.1.3 Selection of representative buildings for a building susceptibility assessment 51 3.2 Module 2: Physical susceptibility of representative buildings 57 3.2.1 Identification of building components 57 3.2.2 Qualification of building material susceptibility 62 3.2.3 Derivation of a depth-physical impact function 71 3.3 Module 3: Technological integration 77 3.3.1 Combination of the depth-physical impact function with the building taxonomic code 77 3.3.2 Tools supporting the physical susceptibility analysis 78 3.3.3 The users and their requirements 79 4 RESULTS OF THE METHODOLOGY TESTING 83 4.1 Pilot site “Kleinzschachwitz” – Dresden, Germany – Elbe River 83 4.1.1 Module 1: Building taxonomy – “Kleinzschachwitz” 85 4.1.2 Module 2: Physical susceptibility of representative buildings – “Kleinzschachwitz” 97 4.1.3 Module 3: Technological integration – “Kleinzschachwitz” 103 4.2 Pilot site “La Peña” – Cicuco, Colombia – Magdalena River 107 4.2.1 Module 1: Building taxonomy – “La Peña” 108 4.2.2 Module 2: Physical susceptibility of representative buildings – “La Peña” 121 4.2.3 Module 3: Technological integration– “La Peña” 129 4.3 Pilot site “Barrio Sur” – MaganguĂ©, Colombia – Magdalena River 133 4.3.1 Module 1: Building taxonomy – “Barrio Sur” 133 4.3.2 Module 2: Physical susceptibility of representative buildings – “Barrio Sur” 141 4.3.3 Module 3: Technological integration – “Barrio Sur” 147 4.4 Empirical findings 151 4.4.1 Empirical findings of Module 1 151 4.4.2 Empirical findings of Module 2 155 4.4.3 Empirical findings of Module 3 157 4.4.4 Guidance of the methodology 157 5 DISCUSSION 161 5.1 Discussion on the conceptual framework 161 5.2 Discussion on the methodological framework 161 5.2.1 Discussion on Module 1: the building taxonomic approach 162 5.2.2 Discussion on Module 2: the depth-physical impact function 164 6 CONCLUSIONS AND OUTLOOK 167 6.1 Conclusions 167 6.2 Outlook 168 REFERENCES 171 INDEX OF FIGURES 199 INDEX OF TABLES 201 APPENDICES 20

    On Volumetric Shape Reconstruction from Implicit Forms

    Get PDF
    International audienceIn this paper we report on the evaluation of volumetric shape reconstruction methods that consider as input implicit forms in 3D. Many visual applications build implicit representations of shapes that are converted into explicit shape representations using geometric tools such as the Marching Cubes algorithm. This is the case with image based reconstructions that produce point clouds from which implicit functions are computed, with for instance a Poisson reconstruction approach. While the Marching Cubes method is a versatile solution with proven efficiency, alternative solutions exist with different and complementary properties that are of interest for shape modeling. In this paper, we propose a novel strategy that builds on Centroidal Voronoi Tessellations (CVTs). These tessellations provide volumetric and surface representations with strong regularities in addition to provably more accurate approximations of the implicit forms considered. In order to compare the existing strategies, we present an extensive evaluation that analyzes various properties of the main strategies for implicit to explicit volumetric conversions: Marching cubes, Delaunay refinement and CVTs, including accuracy and shape quality of the resulting shape mesh

    Fraud detection for online banking for scalable and distributed data

    Get PDF
    Online fraud causes billions of dollars in losses for banks. Therefore, online banking fraud detection is an important field of study. However, there are many challenges in conducting research in fraud detection. One of the constraints is due to unavailability of bank datasets for research or the required characteristics of the attributes of the data are not available. Numeric data usually provides better performance for machine learning algorithms. Most transaction data however have categorical, or nominal features as well. Moreover, some platforms such as Apache Spark only recognizes numeric data. So, there is a need to use techniques e.g. One-hot encoding (OHE) to transform categorical features to numerical features, however OHE has challenges including the sparseness of transformed data and that the distinct values of an attribute are not always known in advance. Efficient feature engineering can improve the algorithm’s performance but usually requires detailed domain knowledge to identify correct features. Techniques like Ripple Down Rules (RDR) are suitable for fraud detection because of their low maintenance and incremental learning features. However, high classification accuracy on mixed datasets, especially for scalable data is challenging. Evaluation of RDR on distributed platforms is also challenging as it is not available on these platforms. The thesis proposes the following solutions to these challenges: ‱ We developed a technique Highly Correlated Rule Based Uniformly Distribution (HCRUD) to generate highly correlated rule-based uniformly-distributed synthetic data. ‱ We developed a technique One-hot Encoded Extended Compact (OHE-EC) to transform categorical features to numeric features by compacting sparse-data even if all distinct values are unknown. ‱ We developed a technique Feature Engineering and Compact Unified Expressions (FECUE) to improve model efficiency through feature engineering where the domain of the data is not known in advance. ‱ A Unified Expression RDR fraud deduction technique (UE-RDR) for Big data has been proposed and evaluated on the Spark platform. Empirical tests were executed on multi-node Hadoop cluster using well-known classifiers on bank data, synthetic bank datasets and publicly available datasets from UCI repository. These evaluations demonstrated substantial improvements in terms of classification accuracy, ruleset compactness and execution speed.Doctor of Philosoph

    Improve the Low Energy Sensitivity of the HAWC Observatory

    Get PDF
    The high altitude water cherenkov gamma-ray observatory (HAWC) has been fully operational since March of 2015 in Mexico at 4,100 meters above sea level on the hillside of the Sierra Negra Volcano. It consists of an array of 300 water cherenkov detectors, each equipped with four photo-multiplier tubes. HAWC operates 24-hours per day with a wide field-of-view (FOV, ∌ 2 sr) and a high duty cycle (∌ 95%). These make it a powerful survey and monitoring experiment for mapping the gamma ray sky at very high energies (VHE, 100 GeV to 100 TeV) and to study sources with varying intensities. Thus HAWC is well suited to detect gamma-ray counterparts of possible flaring sources seen in neutrino events observed by IceCube or gravitational wave events observed by LIGO/Virgo. Extra-galactic sources including active galactic nuclei and gamma ray bursts are characterized by power-law spectra with most of the observed photon flux at 1 Tev and below. This corresponds to the lower energy range for HAWC. To participate in this science it is essential to optimize HAWC’s performance for gamma rays below ∌ 1 TeV. This is a particular challenge as in HAWC gamma rays below ∌ 1 TeV have a low signal-to-noise ratio, the events have limited and incomplete information and the HAWC Monte Carlo simulation does not well model all aspects of the events. Fortunately HAWC data includes a well characterized gamma ray source: the Crab nebula. Thus we use the significance level and the angular resolution of the Crab to quantify our gamma ray detection sensitivity improvements. Two critical factors are involved: the interpretation of HAWC raw detector signals (referred to as data reconstruction) and the rejection of cosmic ray background (referred to as gamma hadron separation). While this thesis focuses on different optimizations for (hadron) background rejection, both factors are addressed. An example of applying one improved analysis on searches for nearby AGNs is presented

    The physical properties of star forming galaxies in the low redshift universe

    Full text link
    (modified) We present a comprehensive study of the physical properties of \~10^5 galaxies with measurable star formation in the SDSS. By comparing physical information extracted from the emission lines with continuum properties, we build up a picture of the nature of star-forming galaxies at z<0.2. We take out essentially all aperture bias using resolved imaging, allowing an accurate estimate of the total SFRs in galaxies. We determine the SFR density to be 1.915^{+0.02}_{-0.01}(rand.)^{+0.14}_{-0.42} (sys.) h70 10^{-2} Msun/yr/Mpc^3 at z=0.1 (for a Kroupa IMF) and we study the distribution of star formation as a function of various physical parameters. The majority of the star formation in the low redshift universe takes place in moderately massive galaxies (10^10-10^11 Msun), typically in HSB disk galaxies. Roughly 15% of all star formation takes place in galaxies that show some sign of an active nucleus. About 20% occurs in starburst galaxies. We show that the present to past-average star formation rate, the Scalo b-parameter; is almost constant over almost three orders of magnitude in mass, declining only at M*>10^10 Msun. The volume averaged b parameter is 0.408^{+0.005}_{-0.002} (rand).^{+0.029}_{-0.090} (sys.) h70^{-1}. We use this value constrain the star formation history of the universe. In agreement with other work we find a correlation between bb and morphological type, as well as a tight correlation between the 4000AA break (D4000) and b. We discuss how D4000 can be used to estimate b parameters for high redshift galaxies.Comment: Accepted for MNRAS. Replaced with accepted version. A section on comparison with other methods of SFR estimation added and various updates have been made. The main results are almost unchange

    Expert System with an Embedded Imaging Module for Diagnosing Lung Diseases

    Get PDF
    Lung diseases are one of the major causes of suffering and death in the world. Improved survival rate could be obtained if the diseases can be detected at its early stage. Specialist doctors with the expertise and experience to interpret medical images and diagnose complex lung diseases are scarce. In this work, a rule-based expert system with an embedded imaging module is developed to assist the general physicians in hospitals and clinics to diagnose lung diseases whenever the services of specialist doctors are not available. The rule-based expert system contains a large knowledge base of data from various categories such as patient's personal and medical history, clinical symptoms, clinical test results and radiological information. An imaging module is integrated into the expert system for the enhancement of chest X-Ray images. The goal of this module is to enhance the chest X-Ray images so that it can provide details similar to more expensive methods such as MRl and CT scan. A new algorithm which is a modified morphological grayscale top hat transform is introduced to increase the visibility of lung nodules in chest X-Rays. Fuzzy inference technique is used to predict the probability of malignancy of the nodules. The output generated by the expert system was compared with the diagnosis made by the specialist doctors. The system is able to produce results\ud which are similar to the diagnosis made by the doctors and is acceptable by clinical standards

    Spatio-temporal Modelling of Accessibility to Train Stations for Park and Ride (PnR) Users

    Get PDF
    Accessibility has been of critical importance to physical planning over the past 60 years. This study mainly focuses a spatial methodology framework to understand measure and model the Park and Ride (PnR) users’ accessibility to train stations, specifically including the characteristics of catchment areas, directional accessibility to train stations, spatial modelling of train stations’ catchment areas, and spatio-temporal modelling of accessibility to train stations
    • 

    corecore