Search CORE

91,424 research outputs found

SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis

Author: Chen Wengling
Hays James
Publication venue
Publication date: 12/04/2018
Field of study

Synthesizing realistic images from human drawn sketches is a challenging problem in computer graphics and vision. Existing approaches either need exact edge maps, or rely on retrieval of existing photographs. In this work, we propose a novel Generative Adversarial Network (GAN) approach that synthesizes plausible images from 50 categories including motorcycles, horses and couches. We demonstrate a data augmentation technique for sketches which is fully automatic, and we show that the augmented data is helpful to our task. We introduce a new network building block suitable for both the generator and discriminator which improves the information flow by injecting the input image at multiple scales. Compared to state-of-the-art image translation methods, our approach generates more realistic images and achieves significantly higher Inception Scores.Comment: Accepted to CVPR 201

arXiv.org e-Print Archive

Crossref

Style Separation and Synthesis via Generative Adversarial Networks

Author: Alexei
Chen Dongdong
Chi Jingze
Diederik
Dumoulin Vincent
Gatys Leon A.
Glorot Xavier
Huang Xun
Ioffe Sergey
Isola Phillip
Johnson Justin
Kim Taeksoo
Li Chuan
Li Chuan
Li Yijun
Pathak Deepak
Radford Alec
Reed Scott E.
Shen Wei
Ulyanov Dmitry
Wang Xiaolong
Zhang Jian
Zhang Rui
Zhu Jun-Yan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/11/2018
Field of study

Style synthesis attracts great interests recently, while few works focus on its dual problem "style separation". In this paper, we propose the Style Separation and Synthesis Generative Adversarial Network (S3-GAN) to simultaneously implement style separation and style synthesis on object photographs of specific categories. Based on the assumption that the object photographs lie on a manifold, and the contents and styles are independent, we employ S3-GAN to build mappings between the manifold and a latent vector space for separating and synthesizing the contents and styles. The S3-GAN consists of an encoder network, a generator network, and an adversarial network. The encoder network performs style separation by mapping an object photograph to a latent vector. Two halves of the latent vector represent the content and style, respectively. The generator network performs style synthesis by taking a concatenated vector as input. The concatenated vector contains the style half vector of the style target image and the content half vector of the content target image. Once obtaining the images from the generator network, an adversarial network is imposed to generate more photo-realistic images. Experiments on CelebA and UT Zappos 50K datasets demonstrate that the S3-GAN has the capacity of style separation and synthesis simultaneously, and could capture various styles in a single model

arXiv.org e-Print Archive

Crossref

Recommended from our members

Analysis-by-synthesis: Pedestrian tracking with crowd simulation models in a multi-camera video network

Author: Bhanu B
Jin Z
Publication venue: eScholarship, University of California
Publication date: 01/05/2015
Field of study

For tracking systems consisting of multiple cameras with overlapping field-of-views, homography-based approaches are widely adopted to significantly reduce occlusions among pedestrians by sharing information among multiple views. However, in these approaches, the usage of information under real-world coordinates is only at a preliminary level. Therefore, in this paper, a multi-camera tracking system with integrated crowd simulation is proposed in order to explore the possibility to make homography information more helpful. Two crowd simulators with different simulation strategies are used to investigate the influence of the simulation strategy on the final tracking performance. The performance is evaluated by multiple object tracking precision and accuracy (MOTP and MOTA) metrics, for all the camera views and the results obtained under real-world coordinates. The experimental results demonstrate that crowd simulators boost the tracking performance significantly, especially for crowded scenes with higher density. In addition, a more realistic simulation strategy helps to further improve the overall tracking result

eScholarship - University of California