41 research outputs found

    Analysis of sparsity- and nonlocality-reinforced convolutional neural networks for image denoising

    Get PDF
    Different neural networks are built using the same elements, but they vary a lot by architecture and show different denoising quality results. The master thesis aims to analyze how different network hyperparameters, input transformations (sparsity), and nonlocal filters (nonlocality) impact the performance in image denoising tasks. The thesis work provides rich experimentally based research and analysis in the image denoising field. A number of different denoising methods have been considered, and state-of-the-art denoising algorithms were examined. As a result of the parameter analysis, the vast majority of the learning-based algorithms and deep networks were improved in terms of denoising capability

    Efficient Deep Learning-based Estimation of the Vital Signs on Smartphones

    Full text link
    Nowadays, due to the widespread use of smartphones in everyday life and the improvement of computational capabilities of these devices, many complex tasks can now be deployed on them. Concerning the need for continuous monitoring of vital signs, especially for the elderly or those with certain types of diseases, the development of algorithms that can estimate vital signs using smartphones has attracted researchers worldwide. Such algorithms estimate vital signs (heart rate and oxygen saturation level) by processing an input PPG signal. These methods often apply multiple pre-processing steps to the input signal before the prediction step. This can increase the computational complexity of these methods, meaning only a limited number of mobile devices can run them. Furthermore, multiple pre-processing steps also require the design of a couple of hand-crafted stages to obtain an optimal result. This research proposes a novel end-to-end solution to mobile-based vital sign estimation by deep learning. The proposed method does not require any pre-processing. Due to the use of fully convolutional architecture, the parameter count of our proposed model is, on average, a quarter of the ordinary architectures that use fully-connected layers as the prediction heads. As a result, the proposed model has less over-fitting chance and computational complexity. A public dataset for vital sign estimation, including 62 videos collected from 35 men and 27 women, is also provided. The experimental results demonstrate state-of-the-art estimation accuracy.Comment: 6 pages, 9 figures, 4 table

    Deep Learning in Visual Computing and Signal Processing

    Get PDF

    Automatic Segmentation of Intramedullary Multiple Sclerosis Lesions

    Get PDF
    Contexte: La moelle Ă©piniĂšre est un composant essentiel du systĂšme nerveux central. Elle contient des neurones responsables d’importantes fonctionnalitĂ©s et assure la transmission d’informations motrices et sensorielles entre le cerveau et le systĂšme nerveux pĂ©riphĂ©rique. Un endommagement de la moelle Ă©piniĂšre, causĂ© par un choc ou une maladie neurodĂ©gĂ©nĂ©rative, peut mener Ă  un sĂ©rieux handicap, pouvant entraĂźner des incapacitĂ©s fonctionnelles, de la paralysie et/ou de la douleur. Chez les patients atteints de sclĂ©rose en plaques (SEP), la moelle Ă©piniĂšre est frĂ©quemment affectĂ©e par de l’atrophie et/ou des lĂ©sions. L’imagerie par rĂ©sonance magnĂ©tique (IRM) conventionnelle est largement utilisĂ©e par des chercheurs et des cliniciens pour Ă©valuer et caractĂ©riser, de façon non-invasive, des altĂ©rations micro-structurelles. Une Ă©valuation quantitative des atteintes structurelles portĂ©es Ă  la moelle Ă©piniĂšre (e.g. sĂ©vĂ©ritĂ© de l’atrophie, extension des lĂ©sions) est essentielle pour le diagnostic, le pronostic et la supervision sur le long terme de maladies, telles que la SEP. De plus, le dĂ©veloppement de biomarqueurs impartiaux est indispensable pour Ă©valuer l’effet de nouveaux traitements thĂ©rapeutiques. La segmentation de la moelle Ă©piniĂšre et des lĂ©sions intramĂ©dullaires de SEP sont, par consĂ©quent, pertinentes d’un point de vue clinique, aussi bien qu’une Ă©tape nĂ©cessaire vers l’interprĂ©tation d’images RM multiparamĂ©triques. Cependant, la segmentation manuelle est une tĂąche extrĂȘmement chronophage, fastidieuse et sujette Ă  des variations inter- et intra-expert. Il y a par consĂ©quent un besoin d’automatiser les mĂ©thodes de segmentations, ce qui pourrait faciliter l’efficacitĂ© procĂ©dures d’analyses. La segmentation automatique de lĂ©sions est compliquĂ© pour plusieurs raisons: (i) la variabilitĂ© des lĂ©sions en termes de forme, taille et position, (ii) les contours des lĂ©sions sont la plupart du temps difficilement discernables, (iii) l’intensitĂ© des lĂ©sions sur des images MR sont similaires Ă  celles de structures visiblement saines. En plus de cela, rĂ©aliser une segmentation rigoureuse sur l’ensemble d’une base de donnĂ©es multi-centrique d’IRM est rendue difficile par l’importante variabilitĂ© des protocoles d’acquisition (e.g. rĂ©solution, orientation, champ de vue de l’image). MalgrĂ© de considĂ©rables rĂ©cents dĂ©veloppements dans le traitement d’images MR de moelle Ă©piniĂšre, il n’y a toujours pas de mĂ©thode disponible pouvant fournir une segmentation rigoureuse et fiable de la moelle Ă©piniĂšre pour un large spectre de pathologies et de protocoles d’acquisition. Concernant les lĂ©sions intramĂ©dullaires, une recherche approfondie dans la littĂ©rature n’a pas pu fournir une mĂ©thode disponible de segmentation automatique. Objectif: DĂ©velopper un systĂšme complĂštement automatique pour segmenter la moelle Ă©piniĂšre et les lĂ©sions intramĂ©dullaires sur des IRM conventionnelles humaines. MĂ©thode: L’approche prĂ©sentĂ©e est basĂ©e de deux rĂ©seaux de neurones Ă  convolution mis en cascade. La mĂ©thode a Ă©tĂ© pensĂ©e pour faire face aux principaux obstacles que prĂ©sentent les donnĂ©es IRM de moelle Ă©piniĂšre. Le procĂ©dĂ© de segmentation a Ă©tĂ© entrainĂ© et validĂ© sur une base de donnĂ©es privĂ©e composĂ©e de 1943 images, acquises dans 30 diffĂ©rents centres avec des protocoles hĂ©tĂ©rogĂšnes. Les sujets scannĂ©s comportent 459 sujets sains, 471 patients SEP et 112 avec d’autres pathologies affectant la moelle Ă©piniĂšre. Le module de segmentation de la moelle Ă©piniĂšre a Ă©tĂ© comparĂ© Ă  une mĂ©thode existante reconnue par la communautĂ©, PropSeg. RĂ©sultats: L’approche basĂ©e sur les rĂ©seaux de neurones Ă  convolution a fourni de meilleurs rĂ©sultats que PropSeg, atteignant un Dice mĂ©dian (intervalle inter-quartiles) de 94.6 (4.6) vs. 87.9 (18.3) %. Pour les lĂ©sions, notre segmentation automatique a permis d'obtenir un Dice de 60.0 (21.4) % en le comparant Ă  la segmentation manuelle, un ratio de vrai positifs de 83 (34) %, et une prĂ©cision de 77 (44) %. Conclusion: Une mĂ©thode complĂštement automatique et innovante pour segmenter la moelle Ă©piniĂšre et les lĂ©sions SEP intramĂ©dullaires sur des donnĂ©es IRM a Ă©tĂ© conçue durant ce projet de maĂźtrise. La mĂ©thode a Ă©tĂ© abondamment validĂ©e sur une base de donnĂ©es clinique. La robustesse de la mĂ©thode de segmentation de moelle Ă©piniĂšre a Ă©tĂ© dĂ©montrĂ©e, mĂȘme sur des cas pathologiques. Concernant la segmentation des lĂ©sions, les rĂ©sultats sont encourageants, malgrĂ© un taux de faux positifs relativement Ă©levĂ©. Je crois en l’impact que peut potentiellement avoir ces outils pour la communautĂ© de chercheurs. Dans cette optique, les mĂ©thodes ont Ă©tĂ© intĂ©grĂ©es et documentĂ©es dans un logiciel en accĂšs-ouvert, la “Spinal Cord Toolbox”. Certains des outils dĂ©veloppĂ©s pendant ce projet de MaĂźtrise sont dĂ©jĂ  utilisĂ©s par des analyses d’études cliniques, portant sur des patients SEP et sclĂ©rose latĂ©rale amyotrophique.----------ABSTRACT Context: The spinal cord is a key component of the central nervous system, which contains neurons responsible for complex functions, and ensures the conduction of motor and sensory information between the brain and the peripheral nervous system. Damage to the spinal cord, through trauma or neurodegenerative diseases, can lead to severe impairment, including functional disabilities, paralysis and/or pain. In multiple sclerosis (MS) patients, the spinal cord is frequently affected by atrophy and/or lesions. Conventional magnetic resonance imaging (MRI) is widely used by researchers and clinicians to non-invasively assess and characterize spinal cord microstructural changes. Quantitative assessment of the structural damage to the spinal cord (e.g. atrophy severity, lesion extent) is essential for the diagnosis, prognosis and longitudinal monitoring of diseases, such as MS. Furthermore, the development of objective biomarkers is essential to evaluate the effect of new therapeutic treatments. Spinal cord and intramedullary MS lesions segmentation is consequently clinically relevant, as well as a necessary step towards the interpretation of multi-parametric MR images. However, manual segmentation is highly time-consuming, tedious and prone to intra- and inter-rater variability. There is therefore a need for automated segmentation methods to facilitate the efficiency of analysis pipelines. Automatic lesion segmentation is challenging for various reasons: (i) lesion variability in terms of shape, size and location, (ii) lesion boundaries are most of the time not well defined, (iii) lesion intensities on MR data are confounding with those of normal-appearing structures. Moreover, achieving robust segmentation across multi-center MRI data is challenging because of the broad variability of data features (e.g. resolution, orientation, field of view). Despite recent substantial developments in spinal cord MRI processing, there is still no method available that can yield robust and reliable spinal cord segmentation across the very diverse spinal pathologies and data features. Regarding the intramedullary lesions, a thorough search of the relevant literature did not yield available method of automatic segmentation. Goal: To develop a fully-automatic framework for segmenting the spinal cord and intramedullary MS lesions from conventional human MRI data. Method: The presented approach is based on a cascade of two Convolutional Neural Networks (CNN). The method has been designed to face the main challenges of ‘real world’ spinal cord MRI data. It was trained and validated on a private dataset made up of 1943 MR volumes, acquired in different 30 sites with heterogeneous acquisition protocols. Scanned subjects involve 459 healthy controls, 471 MS patients and 112 with other spinal pathologies. The proposed spinal cord segmentation method was compared to a state-of-the-art spinal cord segmentation method, PropSeg. Results: The CNN-based approach achieved better results than PropSeg, yielding a median (interquartile range) Dice of 94.6 (4.6) vs. 87.9 (18.3) % when compared to the manual segmentation. For the lesion segmentation task, our method provided a median Dice-overlap with the manual segmentation of 60.0 (21.4) %, a lesion-based true positive rate of 83 (34) % and a lesion-based precision de 77 (44) %. Conclusion: An original fully-automatic method to segment the spinal cord and intramedullary MS lesions on MRI data has been devised during this Master’s project. The method was validated extensively against a clinical dataset. The robustness of the spinal cord segmentation has been demonstrated, even on challenging pathological cases. Regarding the lesion segmentation, the results are encouraging despite the fairly high false positive rate. I believe in the potential value of these developed tools for the research community. In this vein, the methods are integrated and documented into an open-source software, the Spinal Cord Toolbox. Some of the tools developed during this Master’s project are already integrated into automated analysis pipelines of clinical studies, including MS and Amyotrophic Lateral Sclerosis patients

    Region of Interest Localization Methods for Publicly Available Palmprint Databases

    Get PDF
    So far, there exist many publicly available palmprint databases. However, not all of them have provided the corresponding region of interest (ROI) images. If everyone uses their own extracted ROI images for performance testing, the final accuracy is not strictly comparable. Since ROI localization is the critical stage of palmprint recognition. The location precision has a significant impact on the final recognition accuracy, especially in unconstrained scenarios. This problem has limited the applications of palmprint recognition. However, many currently published surveys only focus on feature extraction and classification methods. Throughout these years, many new ROI localization methods have been proposed. In this chapter, we will group the existing ROI localization methods into different categories, analyze their basic ideas, reproduce some of the codes, make comparisons of their performances, and provide further directions. We hope this could be a useful reference for further research

    Deep Learning in Visual Computing and Signal Processing

    Get PDF
    Deep learning is a subfield of machine learning, which aims to learn a hierarchy of features from input data. Nowadays, researchers have intensively investigated deep learning algorithms for solving challenging problems in many areas such as image classification, speech recognition, signal processing, and natural language processing. In this study, we not only review typical deep learning algorithms in computer vision and signal processing but also provide detailed information on how to apply deep learning to specific areas such as road crack detection, fault diagnosis, and human activity detection. Besides, this study also discusses the challenges of designing and training deep neural networks
    corecore