Search CORE

3,153 research outputs found

End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning

Author: Ding Shengyong
Lin Liang
Wu Xian
Zhang Lei
Zhang Liliang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 11/04/2015
Field of study

Sketch-based face recognition is an interesting task in vision and multimedia research, yet it is quite challenging due to the great difference between face photos and sketches. In this paper, we propose a novel approach for photo-sketch generation, aiming to automatically transform face photos into detail-preserving personal sketches. Unlike the traditional models synthesizing sketches based on a dictionary of exemplars, we develop a fully convolutional network to learn the end-to-end photo-sketch mapping. Our approach takes whole face photos as inputs and directly generates the corresponding sketch images with efficient inference and learning, in which the architecture are stacked by only convolutional kernels of very small sizes. To well capture the person identity during the photo-sketch transformation, we define our optimization objective in the form of joint generative-discriminative minimization. In particular, a discriminative regularization term is incorporated into the photo-sketch generation, enhancing the discriminability of the generated person sketches against other individuals. Extensive experiments on several standard benchmarks suggest that our approach outperforms other state-of-the-art methods in both photo-sketch generation and face sketch verification.Comment: 8 pages, 6 figures. Proceeding in ACM International Conference on Multimedia Retrieval (ICMR), 201

arXiv.org e-Print Archive

Crossref

High-Quality Facial Photo-Sketch Synthesis Using Multi-Adversarial Networks

Author: Patel Vishal M.
Sindagi Vishwanath A.
Wang Lidan
Publication venue
Publication date: 02/03/2018
Field of study

Synthesizing face sketches from real photos and its inverse have many applications. However, photo/sketch synthesis remains a challenging problem due to the fact that photo and sketch have different characteristics. In this work, we consider this task as an image-to-image translation problem and explore the recently popular generative models (GANs) to generate high-quality realistic photos from sketches and sketches from photos. Recent GAN-based methods have shown promising results on image-to-image translation problems and photo-to-sketch synthesis in particular, however, they are known to have limited abilities in generating high-resolution realistic images. To this end, we propose a novel synthesis framework called Photo-Sketch Synthesis using Multi-Adversarial Networks, (PS2-MAN) that iteratively generates low resolution to high resolution images in an adversarial way. The hidden layers of the generator are supervised to first generate lower resolution images followed by implicit refinement in the network to generate higher resolution images. Furthermore, since photo-sketch synthesis is a coupled/paired translation problem, we leverage the pair information using CycleGAN framework. Both Image Quality Assessment (IQA) and Photo-Sketch Matching experiments are conducted to demonstrate the superior performance of our framework in comparison to existing state-of-the-art solutions. Code available at: https://github.com/lidan1/PhotoSketchMAN.Comment: Accepted by 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018)(Oral

arXiv.org e-Print Archive

Crossref

r-BTN: Cross-domain Face Composite and Synthesis from Limited Facial Patches

Author: Qi Hairong
Song Yang
Zhang Zhifei
Publication venue
Publication date: 06/12/2017
Field of study

We start by asking an interesting yet challenging question, "If an eyewitness can only recall the eye features of the suspect, such that the forensic artist can only produce a sketch of the eyes (e.g., the top-left sketch shown in Fig. 1), can advanced computer vision techniques help generate the whole face image?" A more generalized question is that if a large proportion (e.g., more than 50%) of the face/sketch is missing, can a realistic whole face sketch/image still be estimated. Existing face completion and generation methods either do not conduct domain transfer learning or can not handle large missing area. For example, the inpainting approach tends to blur the generated region when the missing area is large (i.e., more than 50%). In this paper, we exploit the potential of deep learning networks in filling large missing region (e.g., as high as 95% missing) and generating realistic faces with high-fidelity in cross domains. We propose the recursive generation by bidirectional transformation networks (r-BTN) that recursively generates a whole face/sketch from a small sketch/face patch. The large missing area and the cross domain challenge make it difficult to generate satisfactory results using a unidirectional cross-domain learning structure. On the other hand, a forward and backward bidirectional learning between the face and sketch domains would enable recursive estimation of the missing region in an incremental manner (Fig. 1) and yield appealing results. r-BTN also adopts an adversarial constraint to encourage the generation of realistic faces/sketches. Extensive experiments have been conducted to demonstrate the superior performance from r-BTN as compared to existing potential solutions.Comment: Accepted by AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

An Experimental and Numerical Investigation of Nitrogen Dioxide Emissions Characteristics of Compression Ignition Dual Fuel Engines

Author: Liu Shiyu
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2013
Field of study

Detailed experimental research was conducted to explore the impact of the addition of gaseous fuels, including H2 and natural gas (NG), and engine load on the emissions of NO2, NO, and NOx from dual fuel engines. The addition of less than 2% of H2 or NG was shown to dramatically increase the emissions of NO2 until a maximum level of NO2 emissions was reached. The increased NO 2 emissions were due to the conversion of NO to NO2. The maximum NO2/NOx ratio obtained with the addition of H2 was 3.2 to 5.0 times that of diesel operation. The maximum NO 2/NOx ratio obtained with the addition of NG was 3.4 to 4.3 times that of diesel operation. Further increasing the amount of gaseous fuel beyond the point of maximum NO2 emissions resulted in a reduction of NO2 emissions. Detailed examination of factors having the potential to affect the formation of NOx and NO2 in compression ignition engines reported a firm correlation between the emissions of NO 2 and emissions of unburned H2 and methane (CH4), and their relative emissions. The presence of unburned gaseous fuels that survived the main combustion process appears to be one of the main factors contributing to the enhanced conversion of NO to NO2. This was supported by the experimental data reported in the literature. The presence of fumigation fuels outside the diesel spray plume might be the main factor contributing to the increased emissions of NO2 from dual fuel engines. The spontaneous combustion of fumigation fuels that are entrained into the diesel spray plume may not contribute to the increased emissions of NO 2. In comparison, the correlations between the increased emissions of NO2 and the variation in bulk mixture temperature and heat release process including maximum heat release rate, and combustion duration were weak.;A single zone, zero-dimensional, constant volume numerical model with detailed chemistry was used to simulate the oxidization process of the gaseous fuel, as well as its effect on the conversion of NO to NO2 after the post-combustion mixing of the gaseous fuel surviving the main combustion process with the NOx-containing combustion products. The gaseous fuel examined included CH4, H2, and carbon monoxide (CO). The simulation results revealed the significant effects of the fuel mixed, its initial concentration in the mixture, and the initial temperature on the oxidization of gaseous fuel, the conversion of NO to NO2, and the destruction of NO2 to NO after the completion of the oxidation process.;The single zone zero-dimensional model was further modified to a variable volume model with the volume of the combustion chamber calculated using the geometry of the 1999 Cummins engine and engine speed. The modified variable volume model with detailed chemistry was used to improve the simulation of the effect on the conversion of NO to NO2 of the post-combustion mixing of surviving gaseous fuel with NOx-containing combustion products. The spatial variation of the local bulk mixture temperature with the progress of the combustion process and the variation of cylinder volume during the expansion process was taken into account by a pseudo temperature at the top dead center (TDC) noted as Tpseudo TDC defined in this research. The simulation identified the importance of the phasing of postcombustion mixing on the oxidation of gaseous fuel and its effect on the conversion of NO to NO2.;A preliminary sensitivity analysis was also conducted to identify the reactions having significant effect on the conversion of NO to NO2 and its destruction to NO. Among the four reactions associated with the formation and destruction of NO2, R186 was identified as the main reaction to the formation of NO2 during the oxidation process of H 2 and CO. This was due to the high concentration of HO2 formed during the oxidation process of H2 and CO in the combustion product. The destruction of NO2 to NO occurred through R187 and R189. (Abstract shortened by UMI.)

The Research Repository @ WVU (West Virginia University)

Face Image Modality Recognition and Photo-Sketch Matching

Author: Liu Bingjie
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2017
Field of study

Face is an important physical characteristic of human body, and is widely used in many crucial applications, such as video surveillance, criminal investigation, and security access system. Based on realistic demand, such as useful face images in dark environment and criminal profile, different modalities of face images appeared, e.g. three-dimensional (3D), near infrared (NIR), and thermal infrared (TIR) face images. Thus, researches with various face image modalities become a hot area. Most of them are set on knowing the modality of face images in advance, which contains a few limitations. In this thesis, we present approaches for face image modality recognition to extend the possibility of cross-modality researches as well as handle new modality-mixed face images. Furthermore, a large facial image database is assembled with five commonly used modalities such as 3D, NIR, TIR, sketch, and visible light spectrum (VIS). Based on the analysis of results, a feature descriptor based on convolutional neural network with linear kernel SVM did an optimal performance.;As we mentioned above, face images are widely used in crucial applications, and one of them is using the sketch of suspect\u27s face, which based on the witness\u27 description, to assist law enforcement. Since it is difficult to capture face photos of the suspect during a criminal activity, automatic retrieving photos based on the suspect\u27s facial sketch is used for locating potential suspects. In this thesis, we perform photo-sketch matching by synthesizing the corresponding pseudo sketch from a given photo. There are three methods applied in this thesis, which are respectively based on style transfer, DualGAN, and cycle-consistent adversarial networks. Among the results of these methods, style transfer based method did a poor performance in photo-sketch matching, since it is an unsupervised one which is not purposeful in photo to sketch synthesis problem while the others need to train pointed models in synthesis stage

The Research Repository @ WVU (West Virginia University)