40 research outputs found
Cross-Spectral Face Recognition Between Near-Infrared and Visible Light Modalities.
In this thesis, improvement of face recognition performance with the use of images from the visible (VIS) and near-infrared (NIR) spectrum is attempted. Face recognition systems can be adversely affected by scenarios which encounter a significant amount of illumination variation across images of the same subject. Cross-spectral face recognition systems using images collected across the VIS and NIR spectrum can counter the ill-effects of illumination variation by standardising both sets of images. A novel preprocessing technique is proposed, which attempts the transformation of faces across both modalities to a feature space with enhanced correlation. Direct matching across the modalities is not possible due to the inherent spectral differences between NIR and VIS face images. Compared to a VIS light source, NIR radiation has a greater penetrative depth when incident on human skin. This fact, in addition to the greater number of scattering interactions within the skin by rays from the NIR spectrum can alter the morphology of the human face enough to disable a direct match with the corresponding VIS face. Several ways to bridge the gap between NIR-VIS faces have been proposed previously. Mostly of a data-driven approach, these techniques include standardised photometric normalisation techniques and subspace projections. A generative approach driven by a true physical model has not been investigated till now. In this thesis, it is proposed that a large proportion of the scattering interactions present in the NIR spectrum can be accounted for using a model for subsurface scattering. A novel subsurface scattering inversion (SSI) algorithm is developed that implements an inversion approach based on translucent surface rendering by the computer graphics field, whereby the reversal of the first order effects of subsurface scattering is attempted. The SSI algorithm is then evaluated against several preprocessing techniques, and using various permutations of feature extraction and subspace projection algorithms. The results of this evaluation show an improvement in cross spectral face recognition performance using SSI over existing Retinex-based approaches. The top performing combination of an existing photometric normalisation technique, Sequential Chain, is seen to be the best performing with a Rank 1 recognition rate of 92. 5%. In addition, the improvement in performance using non-linear projection models shows an element of non-linearity exists in the relationship between NIR and VIS
Cancer diagnosis using deep learning: A bibliographic review
In this paper, we first describe the basics of the field of cancer diagnosis, which includes steps of cancer diagnosis followed by the typical classification methods used by doctors, providing a historical idea of cancer classification techniques to the readers. These methods include Asymmetry, Border, Color and Diameter (ABCD) method, seven-point detection method, Menzies method, and pattern analysis. They are used regularly by doctors for cancer diagnosis, although they are not considered very efficient for obtaining better performance. Moreover, considering all types of audience, the basic evaluation criteria are also discussed. The criteria include the receiver operating characteristic curve (ROC curve), Area under the ROC curve (AUC), F1 score, accuracy, specificity, sensitivity, precision, dice-coefficient, average accuracy, and Jaccard index. Previously used methods are considered inefficient, asking for better and smarter methods for cancer diagnosis. Artificial intelligence and cancer diagnosis are gaining attention as a way to define better diagnostic tools. In particular, deep neural networks can be successfully used for intelligent image analysis. The basic framework of how this machine learning works on medical imaging is provided in this study, i.e., pre-processing, image segmentation and post-processing. The second part of this manuscript describes the different deep learning techniques, such as convolutional neural networks (CNNs), generative adversarial models (GANs), deep autoencoders (DANs), restricted Boltzmann’s machine (RBM), stacked autoencoders (SAE), convolutional autoencoders (CAE), recurrent neural networks (RNNs), long short-term memory (LTSM), multi-scale convolutional neural network (M-CNN), multi-instance learning convolutional neural network (MIL-CNN). For each technique, we provide Python codes, to allow interested readers to experiment with the cited algorithms on their own diagnostic problems. The third part of this manuscript compiles the successfully applied deep learning models for different types of cancers. Considering the length of the manuscript, we restrict ourselves to the discussion of breast cancer, lung cancer, brain cancer, and skin cancer. The purpose of this bibliographic review is to provide researchers opting to work in implementing deep learning and artificial neural networks for cancer diagnosis a knowledge from scratch of the state-of-the-art achievements
Image Quality Improvement of Medical Images using Deep Learning for Computer-aided Diagnosis
Retina image analysis is an important screening tool for early detection of multiple dis eases such as diabetic retinopathy which greatly impairs visual function. Image analy sis and pathology detection can be accomplished both by ophthalmologists and by the
use of computer-aided diagnosis systems. Advancements in hardware technology led to
more portable and less expensive imaging devices for medical image acquisition. This
promotes large scale remote diagnosis by clinicians as well as the implementation of
computer-aided diagnosis systems for local routine disease screening. However, lower cost equipment generally results in inferior quality images. This may jeopardize the
reliability of the acquired images and thus hinder the overall performance of the diagnos tic tool. To solve this open challenge, we carried out an in-depth study on using different
deep learning-based frameworks for improving retina image quality while maintaining
the underlying morphological information for the diagnosis. Our results demonstrate
that using a Cycle Generative Adversarial Network for unpaired image-to-image trans lation leads to successful transformations of retina images from a low- to a high-quality
domain. The visual evidence of this improvement was quantitatively affirmed by the two
proposed validation methods. The first used a retina image quality classifier to confirm a
significant prediction label shift towards a quality enhance. On average, a 50% increase
of images being classified as high-quality was verified. The second analysed the perfor mance modifications of a diabetic retinopathy detection algorithm upon being trained
with the quality-improved images. The latter led to strong evidence that the proposed
solution satisfies the requirement of maintaining the images’ original information for
diagnosis, and that it assures a pathology-assessment more sensitive to the presence of
pathological signs. These experimental results confirm the potential effectiveness of our
solution in improving retina image quality for diagnosis. Along with the addressed con tributions, we analysed how the construction of the data sets representing the low-quality
domain impacts the quality translation efficiency. Our findings suggest that by tackling
the problem more selectively, that is, constructing data sets more homogeneous in terms
of their image defects, we can obtain more accentuated quality transformations
Upper airways segmentation using principal curvatures
Esta tesis propone una nueva técnica para segmentar las vías aéreas superiores. Esta propuesta
permite la extracción de estructuras curvilíneas usando curvaturas principales. La propuesta
permite la extracción de éstas estructuras en imágenes 2D y 3D. Entre las principales novedades
se encuentra la propuesta de un nuevo criterio de parada en la propagación del algoritmo de
realce de contraste (operador multi-escala de tipo sombrero alto). De la misma forma, el criterio
de parada propuesto es usado para detener los algoritmos de difusión anisotrópica. Además, un
nuevo criterio es propuesto para seleccionar las curvaturas principales que conforman las
estructuras curvilíneas, que se basa en los criterios propuestos por Steger, Deng et. al. y
Armande et. al. Además, se propone un nuevo algoritmo para realizar la supresión de nomáximos
que permite reducir la presencia de discontinuidades en el borde de las estructuras
curvilíneas. Para extraer los bordes de las estructuras curvilíneas, se utiliza un algoritmo de
enlace que incluye un nuevo criterio de distancia para reducir la aparición de agujeros en la
estructura final. Finalmente, con base en los resultados obtenidos, se utiliza un algoritmo
morfológico para cerrar los agujeros y se aplica un algoritmo de crecimiento de regiones para
obtener la segmentación final de las vías respiratorias superiores.This dissertation proposes a new approach to segment the upper airways. This proposal allows
the extraction of curvilinear structures based on the principal curvatures. The proposal
allows extracting these structures from 2D and 3D images. Among the main novelties is the
proposal of a new stopping criterion to stop the propagation of the contrast enhancement algorithm
(multiscale top-hat morphological operator). In the same way, the proposed stopping
criterion is used to stop the anisotropic diffusion algorithms. In addition, a new criterion is
proposed to select the principal curvatures that make up the curvilinear structures, which is
based on the criteria proposed by Steger, Deng et. al. and Armande et. al. Furthermore, a
new algorithm to perform the non-maximum suppression that allows reducing the presence
of discontinuities in the border of curvilinear structures is proposed. To extract the edges of
the curvilinear structures, a linking algorithm is used that includes a new distance criterion to
reduce the appearance of gaps in the final structure. Finally, based on the obtained results, a
morphological algorithm is used to close the gaps and a region growing algorithm to obtain
the final upper airways segmentation is applied.Doctor en IngenieríaDoctorad
Aerospace Medicine and Biology: A continuing bibliography with indexes, supplement 178
This bibliography lists 230 reports, articles, and other documents introduced into the NASA scientific and technical information system in February 1978
Sea-Surface Object Detection Based on Electro-Optical Sensors: A Review
Sea-surface object detection is critical for navigation safety of autonomous ships. Electrooptical (EO) sensors, such as video cameras, complement radar on board in detecting small obstacle
sea-surface objects. Traditionally, researchers have used horizon detection, background subtraction, and
foreground segmentation techniques to detect sea-surface objects. Recently, deep learning-based object
detection technologies have been gradually applied to sea-surface object detection. This article demonstrates a comprehensive overview of sea-surface object-detection approaches where the advantages
and drawbacks of each technique are compared, covering four essential aspects: EO sensors and image
types, traditional object-detection methods, deep learning methods, and maritime datasets collection. In
particular, sea-surface object detections based on deep learning methods are thoroughly analyzed and
compared with highly influential public datasets introduced as benchmarks to verify the effectiveness of
these approaches. The arti