Search CORE

174,224 research outputs found

Using Photorealistic Face Synthesis and Domain Adaptation to Improve Facial Expression Analysis

Author: Bozorgtabar Behzad
Ekenel Hazim Kemal
Rad Mohammad Saeed
Thiran Jean-Philippe
Publication venue
Publication date: 14/04/2019
Field of study

Cross-domain synthesizing realistic faces to learn deep models has attracted increasing attention for facial expression analysis as it helps to improve the performance of expression recognition accuracy despite having small number of real training images. However, learning from synthetic face images can be problematic due to the distribution discrepancy between low-quality synthetic images and real face images and may not achieve the desired performance when the learned model applies to real world scenarios. To this end, we propose a new attribute guided face image synthesis to perform a translation between multiple image domains using a single model. In addition, we adopt the proposed model to learn from synthetic faces by matching the feature distributions between different domains while preserving each domain's characteristics. We evaluate the effectiveness of the proposed approach on several face datasets on generating realistic face images. We demonstrate that the expression recognition performance can be enhanced by benefiting from our face synthesis model. Moreover, we also conduct experiments on a near-infrared dataset containing facial expression videos of drivers to assess the performance using in-the-wild data for driver emotion recognition.Comment: 8 pages, 8 figures, 5 tables, accepted by FG 2019. arXiv admin note: substantial text overlap with arXiv:1905.0028

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

Author: Chandraker Manmohan
Liu Sifei
Sohn Kihyuk
Yang Ming-Hsuan
Yu Xiang
Zhong Guangyu
Publication venue
Publication date: 07/08/2017
Field of study

Despite rapid advances in face recognition, there remains a clear gap between the performance of still image-based face recognition and video-based face recognition, due to the vast difference in visual quality between the domains and the difficulty of curating diverse large-scale video datasets. This paper addresses both of those challenges, through an image to video feature-level domain adaptation approach, to learn discriminative video frame representations. The framework utilizes large-scale unlabeled video data to reduce the gap between different domains while transferring discriminative knowledge from large-scale labeled still images. Given a face recognition network that is pretrained in the image domain, the adaptation is achieved by (i) distilling knowledge from the network to a video adaptation network through feature matching, (ii) performing feature restoration through synthetic data augmentation and (iii) learning a domain-invariant feature through a domain adversarial discriminator. We further improve performance through a discriminator-guided feature fusion that boosts high-quality frames while eliminating those degraded by video domain-specific factors. Experiments on the YouTube Faces and IJB-A datasets demonstrate that each module contributes to our feature-level domain adaptation framework and substantially improves video face recognition performance to achieve state-of-the-art accuracy. We demonstrate qualitatively that the network learns to suppress diverse artifacts in videos such as pose, illumination or occlusion without being explicitly trained for them.Comment: accepted for publication at International Conference on Computer Vision (ICCV) 201

arXiv.org e-Print Archive

Crossref

Characterization of linear-mode avalanche photodiodes in standard CMOS

Author: Diéguez Barrientos Àngel
López de Miguel Manuel
Palacio Fernando
Vilella Figueras Eva
Vilà i Arbonès Anna Maria
Publication venue: 'Elsevier BV'
Publication date: 07/10/2015
Field of study

Linear-mode Avalanche PhotoDiodes (APDs) can be fabricated in standard CMOS processes for obtaining high multiplication gains that allow to determine the number of incident photons with great precision. This idea can be exploited in several application domains, such as image sensors, optical communications and quantum information. In this work, we present a linear-mode APD fabricated in a 0.35 µm CMOS process and report its noise and gain characterization by means of two different experimental set-ups. Good matching is observed between the results obtained by means of the two different methods

Diposit Digital de la Universitat de Barcelona

Beyond 2D-grids: a dependence maximization view on image browsing

Author: Buntine WL
Kersting K
Quadrianto N
Tuytelaars T
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

Ideally, one would like to perform image search using an intuitive and friendly approach. Many existing image search engines, however, present users with sets of images arranged in some default order on the screen, typically the relevance to a query, only. While this certainly has its advantages, arguably, a more flexible and intuitive way would be to sort images into arbitrary structures such as grids, hierarchies, or spheres so that images that are visually or semantically alike are placed together. This paper focuses on designing such a navigation system for image browsers. This is a challenging task because arbitrary layout structure makes it difficult -- if not impossible -- to compute cross-similarities between images and structure coordinates, the main ingredient of traditional layouting approaches. For this reason, we resort to a recently developed machine learning technique: kernelized sorting. It is a general technique for matching pairs of objects from different domains without requiring cross-domain similarity measures and hence elegantly allows sorting images into arbitrary structures. Moreover, we extend it so that some images can be preselected for instance forming the tip of the hierarchy allowing to subsequently navigate through the search results in the lower levels in an intuitive way

CiteSeerX

Crossref

Fraunhofer-ePrints

Sussex Research Online

CUED - Cambridge University Engineering Department

Recommended from our members

Mathematical Imaging and Surface Processing

Author
Publication venue: Zürich : EMS Publ. House
Publication date: 01/01/2016
Field of study

Within the last decade image and geometry processing have become increasingly rigorous with solid foundations in mathematics. Both areas are research fields at the intersection of different mathematical disciplines, ranging from geometry and calculus of variations to PDE analysis and numerical analysis. The workshop brought together scientists from all these areas and a fruitful interplay took place. There was a lively exchange of ideas between geometry and image processing applications areas, characterized in a number of ways in this workshop. For example, optimal transport, first applied in computer vision is now used to define a distance measure between 3d shapes, spectral analysis as a tool in image processing can be applied in surface classification and matching, and so on. We have also seen the use of Riemannian geometry as a powerful tool to improve the analysis of multivalued images. This volume collects the abstracts for all the presentations covering this wide spectrum of tools and application domains

Repositorium für Naturwissenschaften und Technik