41 research outputs found
Analysis of sparsity- and nonlocality-reinforced convolutional neural networks for image denoising
Different neural networks are built using the same elements, but they vary a lot by architecture and show different denoising quality results. The master thesis aims to analyze how different network hyperparameters, input transformations (sparsity), and nonlocal filters (nonlocality) impact the performance in image denoising tasks. The thesis work provides rich experimentally based research and analysis in the image denoising field. A number of different denoising methods have been considered, and state-of-the-art denoising algorithms were examined. As a result of the parameter analysis, the vast majority of the learning-based algorithms and deep networks were improved in terms of denoising capability
Efficient Deep Learning-based Estimation of the Vital Signs on Smartphones
Nowadays, due to the widespread use of smartphones in everyday life and the
improvement of computational capabilities of these devices, many complex tasks
can now be deployed on them. Concerning the need for continuous monitoring of
vital signs, especially for the elderly or those with certain types of
diseases, the development of algorithms that can estimate vital signs using
smartphones has attracted researchers worldwide. Such algorithms estimate vital
signs (heart rate and oxygen saturation level) by processing an input PPG
signal. These methods often apply multiple pre-processing steps to the input
signal before the prediction step. This can increase the computational
complexity of these methods, meaning only a limited number of mobile devices
can run them. Furthermore, multiple pre-processing steps also require the
design of a couple of hand-crafted stages to obtain an optimal result. This
research proposes a novel end-to-end solution to mobile-based vital sign
estimation by deep learning. The proposed method does not require any
pre-processing. Due to the use of fully convolutional architecture, the
parameter count of our proposed model is, on average, a quarter of the ordinary
architectures that use fully-connected layers as the prediction heads. As a
result, the proposed model has less over-fitting chance and computational
complexity. A public dataset for vital sign estimation, including 62 videos
collected from 35 men and 27 women, is also provided. The experimental results
demonstrate state-of-the-art estimation accuracy.Comment: 6 pages, 9 figures, 4 table
Automatic Segmentation of Intramedullary Multiple Sclerosis Lesions
Contexte: La moelle Ă©piniĂšre est un composant essentiel du systĂšme nerveux central. Elle contient des neurones responsables dâimportantes fonctionnalitĂ©s et assure la transmission dâinformations motrices et sensorielles entre le cerveau et le systĂšme nerveux pĂ©riphĂ©rique. Un endommagement de la moelle Ă©piniĂšre, causĂ© par un choc ou une maladie neurodĂ©gĂ©nĂ©rative, peut mener Ă un sĂ©rieux handicap, pouvant entraĂźner des incapacitĂ©s fonctionnelles, de la paralysie et/ou de la douleur. Chez les patients atteints de sclĂ©rose en plaques (SEP), la moelle Ă©piniĂšre est frĂ©quemment affectĂ©e par de lâatrophie et/ou des lĂ©sions. Lâimagerie par rĂ©sonance magnĂ©tique (IRM) conventionnelle est largement utilisĂ©e par des chercheurs et des cliniciens pour Ă©valuer et caractĂ©riser, de façon non-invasive, des altĂ©rations micro-structurelles. Une Ă©valuation quantitative des atteintes structurelles portĂ©es Ă la moelle Ă©piniĂšre (e.g. sĂ©vĂ©ritĂ© de lâatrophie, extension des lĂ©sions) est essentielle pour le diagnostic, le pronostic et la supervision sur le long terme de maladies, telles que la SEP. De plus, le dĂ©veloppement de biomarqueurs impartiaux est indispensable pour Ă©valuer lâeffet de nouveaux traitements thĂ©rapeutiques. La segmentation de la moelle Ă©piniĂšre et des lĂ©sions intramĂ©dullaires de SEP sont, par consĂ©quent, pertinentes dâun point de vue clinique, aussi bien quâune Ă©tape nĂ©cessaire vers lâinterprĂ©tation dâimages RM multiparamĂ©triques. Cependant, la segmentation manuelle est une tĂąche extrĂȘmement chronophage, fastidieuse et sujette Ă des variations inter- et intra-expert. Il y a par consĂ©quent un besoin dâautomatiser les mĂ©thodes de segmentations, ce qui pourrait faciliter lâefficacitĂ© procĂ©dures dâanalyses. La segmentation automatique de lĂ©sions est compliquĂ© pour plusieurs raisons: (i) la variabilitĂ© des lĂ©sions en termes de forme, taille et position, (ii) les contours des lĂ©sions sont la plupart du temps difficilement discernables, (iii) lâintensitĂ© des lĂ©sions sur des images MR sont similaires Ă celles de structures visiblement saines. En plus de cela, rĂ©aliser une segmentation rigoureuse sur lâensemble dâune base de donnĂ©es multi-centrique dâIRM est rendue difficile par lâimportante variabilitĂ© des protocoles dâacquisition (e.g. rĂ©solution, orientation, champ de vue de lâimage). MalgrĂ© de considĂ©rables rĂ©cents dĂ©veloppements dans le traitement dâimages MR de moelle Ă©piniĂšre, il nây a toujours pas de mĂ©thode disponible pouvant fournir une segmentation rigoureuse et fiable de la moelle Ă©piniĂšre pour un large spectre de pathologies et de protocoles dâacquisition. Concernant les lĂ©sions intramĂ©dullaires, une recherche approfondie dans la littĂ©rature nâa pas pu fournir une mĂ©thode disponible de segmentation automatique.
Objectif: Développer un systÚme complÚtement automatique pour segmenter la moelle épiniÚre et les lésions intramédullaires sur des IRM conventionnelles humaines.
MĂ©thode: Lâapproche prĂ©sentĂ©e est basĂ©e de deux rĂ©seaux de neurones Ă convolution mis en cascade. La mĂ©thode a Ă©tĂ© pensĂ©e pour faire face aux principaux obstacles que prĂ©sentent les donnĂ©es IRM de moelle Ă©piniĂšre. Le procĂ©dĂ© de segmentation a Ă©tĂ© entrainĂ© et validĂ© sur une base de donnĂ©es privĂ©e composĂ©e de 1943 images, acquises dans 30 diffĂ©rents centres avec des protocoles hĂ©tĂ©rogĂšnes. Les sujets scannĂ©s comportent 459 sujets sains, 471 patients SEP et 112 avec dâautres pathologies affectant la moelle Ă©piniĂšre. Le module de segmentation de la moelle Ă©piniĂšre a Ă©tĂ© comparĂ© Ă une mĂ©thode existante reconnue par la communautĂ©, PropSeg.
RĂ©sultats: Lâapproche basĂ©e sur les rĂ©seaux de neurones Ă convolution a fourni de meilleurs rĂ©sultats que PropSeg, atteignant un Dice mĂ©dian (intervalle inter-quartiles) de 94.6 (4.6) vs. 87.9 (18.3) %. Pour les lĂ©sions, notre segmentation automatique a permis d'obtenir un Dice de 60.0 (21.4) % en le comparant Ă la segmentation manuelle, un ratio de vrai positifs de 83 (34) %, et une prĂ©cision de 77 (44) %.
Conclusion: Une mĂ©thode complĂštement automatique et innovante pour segmenter la moelle Ă©piniĂšre et les lĂ©sions SEP intramĂ©dullaires sur des donnĂ©es IRM a Ă©tĂ© conçue durant ce projet de maĂźtrise. La mĂ©thode a Ă©tĂ© abondamment validĂ©e sur une base de donnĂ©es clinique. La robustesse de la mĂ©thode de segmentation de moelle Ă©piniĂšre a Ă©tĂ© dĂ©montrĂ©e, mĂȘme sur des cas pathologiques. Concernant la segmentation des lĂ©sions, les rĂ©sultats sont encourageants, malgrĂ© un taux de faux positifs relativement Ă©levĂ©. Je crois en lâimpact que peut potentiellement avoir ces outils pour la communautĂ© de chercheurs. Dans cette optique, les mĂ©thodes ont Ă©tĂ© intĂ©grĂ©es et documentĂ©es dans un logiciel en accĂšs-ouvert, la âSpinal Cord Toolboxâ. Certains des outils dĂ©veloppĂ©s pendant ce projet de MaĂźtrise sont dĂ©jĂ utilisĂ©s par des analyses dâĂ©tudes cliniques, portant sur des patients SEP et sclĂ©rose latĂ©rale amyotrophique.----------ABSTRACT
Context: The spinal cord is a key component of the central nervous system, which contains neurons responsible for complex functions, and ensures the conduction of motor and sensory information between the brain and the peripheral nervous system. Damage to the spinal cord, through trauma or neurodegenerative diseases, can lead to severe impairment, including functional disabilities, paralysis and/or pain. In multiple sclerosis (MS) patients, the spinal cord is frequently affected by atrophy and/or lesions. Conventional magnetic resonance imaging (MRI) is widely used by researchers and clinicians to non-invasively assess and characterize spinal cord microstructural changes. Quantitative assessment of the structural damage to the spinal cord (e.g. atrophy severity, lesion extent) is essential for the diagnosis, prognosis and longitudinal monitoring of diseases, such as MS. Furthermore, the development of objective biomarkers is essential to evaluate the effect of new therapeutic treatments. Spinal cord and intramedullary MS lesions segmentation is consequently clinically relevant, as well as a necessary step towards the interpretation of multi-parametric MR images. However, manual segmentation is highly time-consuming, tedious and prone to intra- and inter-rater variability. There is therefore a need for automated segmentation methods to facilitate the efficiency of analysis pipelines. Automatic lesion segmentation is challenging for various reasons: (i) lesion variability in terms of shape, size and location, (ii) lesion boundaries are most of the time not well defined, (iii) lesion intensities on MR data are confounding with those of normal-appearing structures. Moreover, achieving robust segmentation across multi-center MRI data is challenging because of the broad variability of data features (e.g. resolution, orientation, field of view). Despite recent substantial developments in spinal cord MRI processing, there is still no method available that can yield robust and reliable spinal cord segmentation across the very diverse spinal pathologies and data features. Regarding the intramedullary lesions, a thorough search of the relevant literature did not yield available method of automatic segmentation.
Goal: To develop a fully-automatic framework for segmenting the spinal cord and intramedullary MS lesions from conventional human MRI data.
Method: The presented approach is based on a cascade of two Convolutional Neural Networks (CNN). The method has been designed to face the main challenges of âreal worldâ spinal cord MRI data. It was trained and validated on a private dataset made up of 1943 MR volumes, acquired in different 30 sites with heterogeneous acquisition protocols. Scanned subjects involve 459 healthy controls, 471 MS patients and 112 with other spinal pathologies. The proposed spinal cord segmentation method was compared to a state-of-the-art spinal cord segmentation method, PropSeg.
Results: The CNN-based approach achieved better results than PropSeg, yielding a median (interquartile range) Dice of 94.6 (4.6) vs. 87.9 (18.3) % when compared to the manual segmentation. For the lesion segmentation task, our method provided a median Dice-overlap with the manual segmentation of 60.0 (21.4) %, a lesion-based true positive rate of 83 (34) % and a lesion-based precision de 77 (44) %.
Conclusion: An original fully-automatic method to segment the spinal cord and intramedullary MS lesions on MRI data has been devised during this Masterâs project. The method was validated extensively against a clinical dataset. The robustness of the spinal cord segmentation has been demonstrated, even on challenging pathological cases. Regarding the lesion segmentation, the results are encouraging despite the fairly high false positive rate. I believe in the potential value of these developed tools for the research community. In this vein, the methods are integrated and documented into an open-source software, the Spinal Cord Toolbox. Some of the tools developed during this Masterâs project are already integrated into automated analysis pipelines of clinical studies, including MS and Amyotrophic Lateral Sclerosis patients
Region of Interest Localization Methods for Publicly Available Palmprint Databases
So far, there exist many publicly available palmprint databases. However, not all of them have provided the corresponding region of interest (ROI) images. If everyone uses their own extracted ROI images for performance testing, the final accuracy is not strictly comparable. Since ROI localization is the critical stage of palmprint recognition. The location precision has a significant impact on the final recognition accuracy, especially in unconstrained scenarios. This problem has limited the applications of palmprint recognition. However, many currently published surveys only focus on feature extraction and classification methods. Throughout these years, many new ROI localization methods have been proposed. In this chapter, we will group the existing ROI localization methods into different categories, analyze their basic ideas, reproduce some of the codes, make comparisons of their performances, and provide further directions. We hope this could be a useful reference for further research
Deep Learning in Visual Computing and Signal Processing
Deep learning is a subfield of machine learning, which aims to learn a hierarchy of features from input data. Nowadays, researchers have intensively investigated deep learning algorithms for solving challenging problems in many areas such as image classification, speech recognition, signal processing, and natural language processing. In this study, we not only review typical deep learning algorithms in computer vision and signal processing but also provide detailed information on how to apply deep learning to specific areas such as road crack detection, fault diagnosis, and human activity detection. Besides, this study also discusses the challenges of designing and training deep neural networks