108 research outputs found

    Representation Learning With Convolutional Neural Networks

    Get PDF
    Deep learning methods have achieved great success in the areas of Computer Vision and Natural Language Processing. Recently, the rapidly developing field of deep learning is concerned with questions surrounding how we can learn meaningful and effective representations of data. This is because the performance of machine learning approaches is heavily dependent on the choice and quality of data representation, and different kinds of representation entangle and hide the different explanatory factors of variation behind the data. In this dissertation, we focus on representation learning with deep neural networks for different data formats including text, 3D polygon shapes, and brain fiber tracts. First, we propose a topic-based word representation learning approach for text classification. The proposed approach takes global semantic relationship between words over the whole corpus into consideration and encodes the relationships into distributed vector representations with continuous Skip-gram model. The learned representations which capture a large number of precise syntactic and semantic word relationships are taken as input of Convolution Neural Networks for classification. Our experimental results show the effectiveness of the proposed method on indexing of biomedical articles, behavior code annotation of clinical text fragments, and classification of news groups. Second, we present a 3D polygon shape representation learning framework for shape segmentation. We propose Directionally Convolutional Network (DCN) that extends convolution operations from images to the polygon mesh surface with rotation-invariant property. Based on the proposed DCN, we learn effective shape representations from raw geometric features and then classify each face of a given polygon into predefined semantic parts. Through extensive experiments, we demonstrate that our framework outperforms the current state-of-the-arts. Third, we propose to learn effective and meaningful representations for brain fiber tracts using deep learning frameworks. We handle the highly unbalanced dataset by introducing asymmetrical loss function for easily classified samples and hard classified ones. The training loss avoids to be dominated by the easy samples and the training step is more efficient. In addition, we learn more effective and meaningful representations by introducing deeper network and metric learning approaches. Furthermore, we propose to improve the interpretability of our framework by inducing attention mechanism. Our experimental results show that our proposed framework outperforms current golden standard significantly on the real-world dataset

    Deep generative modelling of the imaged human brain

    Get PDF
    Human-machine symbiosis is a very promising opportunity for the field of neurology given that the interpretation of the imaged human brain is a trivial feat for neither entity. However, before machine learning systems can be used in real world clinical situations, many issues with automated analysis must first be solved. In this thesis I aim to address what I consider the three biggest hurdles to the adoption of automated machine learning interpretative systems. For each issue, I will first elucidate the reader on its importance given the overarching narratives of both neurology and machine learning, and then showcase my proposed solutions to these issues through the use of deep generative models of the imaged human brain. First, I start by addressing what is an uncontroversial and universal sign of intelligence: the ability to extrapolate knowledge to unseen cases. Human neuroradiologists have studied the anatomy of the healthy brain and can therefore, with some success, identify most pathologies present on an imaged brain, even without having ever been previously exposed to them. Current discriminative machine learning systems require vast amounts of labelled data in order to accurately identify diseases. In this first part I provide a generative framework that permits machine learning models to more efficiently leverage unlabelled data for better diagnoses with either none or small amounts of labels. Secondly, I address a major ethical concern in medicine: equitable evaluation of all patients, regardless of demographics or other identifying characteristics. This is, unfortunately, something that even human practitioners fail at, making the matter ever more pressing: unaddressed biases in data will become biases in the models. To address this concern I suggest a framework through which a generative model synthesises demographically counterfactual brain imaging to successfully reduce the proliferation of demographic biases in discriminative models. Finally, I tackle the challenge of spatial anatomical inference, a task at the centre of the field of lesion-deficit mapping, which given brain lesions and associated cognitive deficits attempts to discover the true functional anatomy of the brain. I provide a new Bayesian generative framework and implementation that allows for greatly improved results on this challenge, hopefully, paving part of the road towards a greater and more complete understanding of the human brain

    Applications of Gradient Representations in Resting-State fMRI

    Get PDF
    Classical models of brain organization have often considered the brain to be made up of a mosaic of patches that are demarcated by discrete boundaries, often defined histologically. In contrast, emerging views have pointed towards an alternative paradigm – referred to as gradients – by conceptualizing brain organization as sets of organizational axes that characterizes spatial variation of differing connectivity principles over the extent of a region. Such organizational axes provide a well-suited framework for elucidating underpinnings of brain connectivity and has garnered widespread attention across various domains of neuroimaging. This work seeks to explore various applications of gradient estimation techniques, in combination with resting-state functional connectivity data, across the fields of basic, comparative, and clinical neuroscience. First, gradient estimation was performed on resting-state functional connectivity (RSFC) patterns of the primary somatosensory cortex to unveil a secondary organizational axis that spans the region’s anterior-posterior axis, akin to circuitry fundamental to sensory cortical information processing. Second, gradient techniques were used in a cross-species comparison study to unify connectivity principles of humans and marmosets by mapping them simultaneously onto a set of organizational axes. In doing so, this provided a systematic framework to compare the functional architecture of both species, facilitating novel insight of a well-integrated default-mode network in humans, compared to marmosets. Third, connectivity gradients, along with a myriad of other resting-state fMRI features were used to explore the implications of focal lesion pathophysiology on functional organization of the thalamus in individuals with Multiple Sclerosis. A lack of focal changes to resting-state related features was observed suggesting the limited role of focal thalamic lesions to functional organization in MS. Together, these different avenues of research highlight the capacity for a gradient-centric view in neuroimaging to provide profound insights into brain organization, and its utility across the applications of basic, comparative, and clinical neuroscience

    Méthodes mathématiques d’analyse d’image pour les études de population transversales et longitudinales

    Get PDF
    In medicine, large scale population analysis aim to obtain statistical information in order to understand better diseases, identify their risk factors, develop preventive and curative treatments and improve the quality of life of the patients.In this thesis, we first introduce the medical context of Alzheimer’s disease, recall some concepts of statistical learning and the challenges that typically occurwhen applied in medical imaging. The second part focus on cross-sectional studies,i.e. at a single time point. We present an efficient method to classify white matter lesions based on support vector machines. Then we discuss the use of manifoldlearning techniques for image and shape analysis. Finally, we present extensions ofLaplacian eigenmaps to improve the low-dimension representations of patients usingthe combination of imaging and clinical data. The third part focus on longitudinalstudies, i.e. between several time points. We quantify the hippocampus deformations of patients via the large deformation diffeomorphic metric mapping frameworkto build disease progression classifiers. We introduce novel strategies and spatialregularizations for the classification and identification of biomarkers.En médecine, les analyses de population à grande échelle ont pour but d’obtenir des informations statistiques pour mieux comprendre des maladies, identifier leurs facteurs de risque, développer des traitements préventifs et curatifs et améliorer la qualité de vie des patients.Dans cette thèse, nous présentons d’abord le contexte médical de la maladie d’Alzheimer, rappelons certains concepts d’apprentissage statistique et difficultés rencontrées lors de l’application en imagerie médicale. Dans la deuxième partie,nous nous intéressons aux analyses transversales, c-a-d ayant un seul point temporel.Nous présentons une méthode efficace basée sur les séparateurs à vaste marge (SVM)permettant de classifier des lésions dans la matière blanche. Ensuite, nous étudions les techniques d’apprentissage de variétés pour l’analyse de formes et d’images, et présentons deux extensions des Laplacian eigenmaps améliorant la représentation de patients en faible dimension grâce à la combinaison de données d’imagerie et cliniques. Dans la troisième partie, nous nous intéressons aux analyses longitudinales, c-a-d entre plusieurs points temporels. Nous quantifions les déformations des hippocampus de patients via le modèle des larges déformations par difféomorphismes pour classifier les évolutions de la maladie. Nous introduisons de nouvelles stratégies et des régularisations spatiales pour la classification et l’identification de marqueurs biologiques

    Decoupling function and anatomy in atlases of functional connectivity patterns: Language mapping in tumor patients

    Get PDF
    In this paper we construct an atlas that summarizes functional connectivity characteristics of a cognitive process from a population of individuals. The atlas encodes functional connectivity structure in a low-dimensional embedding space that is derived from a diffusion process on a graph that represents correlations of fMRI time courses. The functional atlas is decoupled from the anatomical space, and thus can represent functional networks with variable spatial distribution in a population. In practice the atlas is represented by a common prior distribution for the embedded fMRI signals of all subjects. We derive an algorithm for fitting this generative model to the observed data in a population. Our results in a language fMRI study demonstrate that the method identifies coherent and functionally equivalent regions across subjects. The method also successfully maps functional networks from a healthy population used as a training set to individuals whose language networks are affected by tumors.National Science Foundation (U.S.). Division of Information & Intelligent Systems (Collaborative Research in Computational Neuroscience Grant 0904625)National Science Foundation (U.S.) (CAREER Grant 0642971)National Institutes of Health (U.S.) (National Center for Research Resources (U.S.)/Neuroimaging Analysis Center (U.S.) P41-RR13218)National Institutes of Health (U.S.) (National Institute for Biomedical Imaging and Bioengineering (U.S.)/Neuroimaging Analysis Center (U.S.) P41-EB-015902)National Institutes of Health (U.S.) (National Institute for Biomedical Imaging and Bioengineering (U.S.)/National Alliance for Medical Image Computing (U.S.) U54-EB005149)National Institutes of Health (U.S.) (U41RR019703)National Institutes of Health (U.S.) (Eunice Kennedy Shriver National Institute of Child Health and Human Development (U.S.) R01HD067312)National Institutes of Health (U.S.) (P01CA067165)Brain Science FoundationKlarman Family FoundationEuropean Commission (FP7/2007–2013) n°257528 (KHRESMOI))European Commission (330003 (FABRIC))Austrian Science Fund (P 22578-B19 (PULMARCH)

    Advancing efficiency and robustness of neural networks for imaging

    Get PDF
    Enabling machines to see and analyze the world is a longstanding research objective. Advances in computer vision have the potential of influencing many aspects of our lives as they can enable machines to tackle a variety of tasks. Great progress in computer vision has been made, catalyzed by recent progress in machine learning and especially the breakthroughs achieved by deep artificial neural networks. Goal of this work is to alleviate limitations of deep neural networks that hinder their large-scale adoption for real-world applications. To this end, it investigates methodologies for constructing and training deep neural networks with low computational requirements. Moreover, it explores strategies for achieving robust performance on unseen data. Of particular interest is the application of segmenting volumetric medical scans because of the technical challenges it imposes, as well as its clinical importance. The developed methodologies are generic and of relevance to a broader computer vision and machine learning audience. More specifically, this work introduces an efficient 3D convolutional neural network architecture, which achieves high performance for segmentation of volumetric medical images, an application previously hindered by high computational requirements of 3D networks. It then investigates sensitivity of network performance on hyper-parameter configuration, which we interpret as overfitting the model configuration to the data available during development. It is shown that ensembling a set of models with diverse configurations mitigates this and improves generalization. The thesis then explores how to utilize unlabelled data for learning representations that generalize better. It investigates domain adaptation and introduces an architecture for adversarial networks tailored for adaptation of segmentation networks. Finally, a novel semi-supervised learning method is proposed that introduces a graph in the latent space of a neural network to capture relations between labelled and unlabelled samples. It then regularizes the embedding to form a compact cluster per class, which improves generalization.Open Acces

    Towards autonomous diagnostic systems with medical imaging

    Get PDF
    Democratizing access to high quality healthcare has highlighted the need for autonomous diagnostic systems that a non-expert can use. Remote communities, first responders and even deep space explorers will come to rely on medical imaging systems that will provide them with Point of Care diagnostic capabilities. This thesis introduces the building blocks that would enable the creation of such a system. Firstly, we present a case study in order to further motivate the need and requirements of autonomous diagnostic systems. This case study primarily concerns deep space exploration where astronauts cannot rely on communication with earth-bound doctors to help them through diagnosis, nor can they make the trip back to earth for treatment. Requirements and possible solutions about the major challenges faced with such an application are discussed. Moreover, this work describes how a system can explore its perceived environment by developing a Multi Agent Reinforcement Learning method that allows for implicit communication between the agents. Under this regime agents can share the knowledge that benefits them all in achieving their individual tasks. Furthermore, we explore how systems can understand the 3D properties of 2D depicted objects in a probabilistic way. In Part II, this work explores how to reason about the extracted information in a causally enabled manner. A critical view on the applications of causality in medical imaging, and its potential uses is provided. It is then narrowed down to estimating possible future outcomes and reasoning about counterfactual outcomes by embedding data on a pseudo-Riemannian manifold and constraining the latent space by using the relativistic concept of light cones. By formalizing an approach to estimating counterfactuals, a computationally lighter alternative to the abduction-action-prediction paradigm is presented through the introduction of Deep Twin Networks. Appropriate partial identifiability constraints for categorical variables are derived and the method is applied in a series of medical tasks involving structured data, images and videos. All methods are evaluated in a wide array of synthetic and real life tasks that showcase their abilities, often achieving state-of-the-art performance or matching the existing best performance while requiring a fraction of the computational cost.Open Acces

    Precision Monitoring for Disease Progression in Patients with Multiple Sclerosis: A Deep Learning Approach

    Get PDF
    Artificial intelligence has tremendous potential in a range of clinical applications. Leveraging recent advances in deep learning, the works in this thesis has generated a range of technologies for patients with Multiple Sclerosis (MS) that facilitate precision monitoring using routine MRI and clinical assessments; and contribute to realising the goal of personalised disease management. MS is a chronic inflammatory demyelinating disease of the central nervous system (CNS), characterised by focal demyelinating plaques in the brain and spinal cord; and progressive neurodegeneration. Despite success in cohort studies and clinical trials, the measurement of disease activity using conventional imaging biomarkers in real-world clinical practice is limited to qualitative assessment of lesion activity, which is time consuming and prone to human error. Quantitative measures, such as T2 lesion load, volumetric assessment of lesion activity and brain atrophy, are constrained by challenges associated with handling real-world data variances. In this thesis, DeepBVC was developed for robust brain atrophy assessment through imaging synthesis, while a lesion segmentation model was developed using a novel federated learning framework, Fed-CoT, to leverage large data collaborations. With existing quantitative brain structural analyses, this work has developed an effective deep learning analysis pipeline, which delivers a fully automated suite of MS-specific clinical imaging biomarkers to facilitate the precision monitoring of patients with MS and response to disease modifying therapy. The framework for individualised MRI-guided management in this thesis was complemented by a disease prognosis model, based on a Large Language Model, providing insights into the risks of clinical worsening over the subsequent 3 years. The value and performance of the MS biomarkers in this thesis are underpinned by extensive validation in real-world, multi-centre data from more than 1030 patients
    • …
    corecore