12,121 research outputs found
Screen Content Image Segmentation Using Sparse-Smooth Decomposition
Sparse decomposition has been extensively used for different applications
including signal compression and denoising and document analysis. In this
paper, sparse decomposition is used for image segmentation. The proposed
algorithm separates the background and foreground using a sparse-smooth
decomposition technique such that the smooth and sparse components correspond
to the background and foreground respectively. This algorithm is tested on
several test images from HEVC test sequences and is shown to have superior
performance over other methods, such as the hierarchical k-means clustering in
DjVu. This segmentation algorithm can also be used for text extraction, video
compression and medical image segmentation.Comment: Asilomar Conference on Signals, Systems and Computers, IEEE, 2015,
(to Appear
Face recognition technologies for evidential evaluation of video traces
Human recognition from video traces is an important task in forensic investigations and evidence evaluations. Compared with other biometric traits, face is one of the most popularly used modalities for human recognition due to the fact that its collection is non-intrusive and requires less cooperation from the subjects. Moreover, face images taken at a long distance can still provide reasonable resolution, while most biometric modalities, such as iris and fingerprint, do not have this merit. In this chapter, we discuss automatic face recognition technologies for evidential evaluations of video traces. We first introduce the general concepts in both forensic and automatic face recognition , then analyse the difficulties in face recognition from videos . We summarise and categorise the approaches for handling different uncontrollable factors in difficult recognition conditions. Finally we discuss some challenges and trends in face recognition research in both forensics and biometrics . Given its merits tested in many deployed systems and great potential in other emerging applications, considerable research and development efforts are expected to be devoted in face recognition in the near future
Generating and analyzing synthetic finger vein images
Abstract: The finger-vein biometric offers higher degree of security, personal privacy and strong anti-spoofing capabilities than most other biometric modalities employed today. Emerging privacy concerns with the database acquisition and lack of availability of large scale finger-vein database have posed challenges in exploring this technology for large scale applications. This paper details the first such attempt to synthesize finger-vein images and presents analysis of synthesized images for the biometrics authentication. We generate a database of 50,000 finger vein images, corresponding to 5000 different subjects, with 10 different synthesized finger-vein images from each of the subject. We use tractable probability models to compare synthesized finger-vein images with the real finger- vein images for their image variability. This paper also presents matching accuracy using the synthesized finger-vein database from 5000 different subjects, using 225000 genuine and 1249750000 impostor matching scores, which suggests significant promises from this finger-vein biometric modality for large scale biometrics applications
TV-GAN: Generative Adversarial Network Based Thermal to Visible Face Recognition
This work tackles the face recognition task on images captured using thermal
camera sensors which can operate in the non-light environment. While it can
greatly increase the scope and benefits of the current security surveillance
systems, performing such a task using thermal images is a challenging problem
compared to face recognition task in the Visible Light Domain (VLD). This is
partly due to the much smaller amount number of thermal imagery data collected
compared to the VLD data. Unfortunately, direct application of the existing
very strong face recognition models trained using VLD data into the thermal
imagery data will not produce a satisfactory performance. This is due to the
existence of the domain gap between the thermal and VLD images. To this end, we
propose a Thermal-to-Visible Generative Adversarial Network (TV-GAN) that is
able to transform thermal face images into their corresponding VLD images
whilst maintaining identity information which is sufficient enough for the
existing VLD face recognition models to perform recognition. Some examples are
presented in Figure 1. Unlike the previous methods, our proposed TV-GAN uses an
explicit closed-set face recognition loss to regularize the discriminator
network training. This information will then be conveyed into the generator
network in the forms of gradient loss. In the experiment, we show that by using
this additional explicit regularization for the discriminator network, the
TV-GAN is able to preserve more identity information when translating a thermal
image of a person which is not seen before by the TV-GAN
Toward Flare-Free Images: A Survey
Lens flare is a common image artifact that can significantly degrade image
quality and affect the performance of computer vision systems due to a strong
light source pointing at the camera. This survey provides a comprehensive
overview of the multifaceted domain of lens flare, encompassing its underlying
physics, influencing factors, types, and characteristics. It delves into the
complex optics of flare formation, arising from factors like internal
reflection, scattering, diffraction, and dispersion within the camera lens
system. The diverse categories of flare are explored, including scattering,
reflective, glare, orb, and starburst types. Key properties such as shape,
color, and localization are analyzed. The numerous factors impacting flare
appearance are discussed, spanning light source attributes, lens features,
camera settings, and scene content. The survey extensively covers the wide
range of methods proposed for flare removal, including hardware optimization
strategies, classical image processing techniques, and learning-based methods
using deep learning. It not only describes pioneering flare datasets created
for training and evaluation purposes but also how they were created. Commonly
employed performance metrics such as PSNR, SSIM, and LPIPS are explored.
Challenges posed by flare's complex and data-dependent characteristics are
highlighted. The survey provides insights into best practices, limitations, and
promising future directions for flare removal research. Reviewing the
state-of-the-art enables an in-depth understanding of the inherent complexities
of the flare phenomenon and the capabilities of existing solutions. This can
inform and inspire new innovations for handling lens flare artifacts and
improving visual quality across various applications
Exploring Deep Learning Image Super-Resolution for Iris Recognition
In this work we test the ability of deep learning methods to provide an
end-to-end mapping between low and high resolution images applying it to the
iris recognition problem. Here, we propose the use of two deep learning
single-image super-resolution approaches: Stacked Auto-Encoders (SAE) and
Convolutional Neural Networks (CNN) with the most possible lightweight
structure to achieve fast speed, preserve local information and reduce
artifacts at the same time. We validate the methods with a database of 1.872
near-infrared iris images with quality assessment and recognition experiments
showing the superiority of deep learning approaches over the compared
algorithms.Comment: Published at Proc. 25th European Signal Processing Conference,
EUSIPCO 201
- …