236 research outputs found
Sliced Wasserstein Generative Models
In generative modeling, the Wasserstein distance (WD) has emerged as a useful
metric to measure the discrepancy between generated and real data
distributions. Unfortunately, it is challenging to approximate the WD of
high-dimensional distributions. In contrast, the sliced Wasserstein distance
(SWD) factorizes high-dimensional distributions into their multiple
one-dimensional marginal distributions and is thus easier to approximate. In
this paper, we introduce novel approximations of the primal and dual SWD.
Instead of using a large number of random projections, as it is done by
conventional SWD approximation methods, we propose to approximate SWDs with a
small number of parameterized orthogonal projections in an end-to-end deep
learning fashion. As concrete applications of our SWD approximations, we design
two types of differentiable SWD blocks to equip modern generative
frameworks---Auto-Encoders (AE) and Generative Adversarial Networks (GAN). In
the experiments, we not only show the superiority of the proposed generative
models on standard image synthesis benchmarks, but also demonstrate the
state-of-the-art performance on challenging high resolution image and video
generation in an unsupervised manner.Comment: This paper is accepted by CVPR 2019, accidentally uploaded as a new
submission (arXiv:1904.05408, which has been withdrawn). The code is
available at this https URL https:// github.com/musikisomorphie/swd.gi
Quality Aware Generative Adversarial Networks
Generative Adversarial Networks (GANs) have become a very popular tool for
implicitly learning high-dimensional probability distributions. Several
improvements have been made to the original GAN formulation to address some of
its shortcomings like mode collapse, convergence issues, entanglement, poor
visual quality etc. While a significant effort has been directed towards
improving the visual quality of images generated by GANs, it is rather
surprising that objective image quality metrics have neither been employed as
cost functions nor as regularizers in GAN objective functions. In this work, we
show how a distance metric that is a variant of the Structural SIMilarity
(SSIM) index (a popular full-reference image quality assessment algorithm), and
a novel quality aware discriminator gradient penalty function that is inspired
by the Natural Image Quality Evaluator (NIQE, a popular no-reference image
quality assessment algorithm) can each be used as excellent regularizers for
GAN objective functions. Specifically, we demonstrate state-of-the-art
performance using the Wasserstein GAN gradient penalty (WGAN-GP) framework over
CIFAR-10, STL10 and CelebA datasets.Comment: 10 pages, NeurIPS 201
Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions
Generative Adversarial Networks (GANs) is a novel class of deep generative
models which has recently gained significant attention. GANs learns complex and
high-dimensional distributions implicitly over images, audio, and data.
However, there exists major challenges in training of GANs, i.e., mode
collapse, non-convergence and instability, due to inappropriate design of
network architecture, use of objective function and selection of optimization
algorithm. Recently, to address these challenges, several solutions for better
design and optimization of GANs have been investigated based on techniques of
re-engineered network architectures, new objective functions and alternative
optimization algorithms. To the best of our knowledge, there is no existing
survey that has particularly focused on broad and systematic developments of
these solutions. In this study, we perform a comprehensive survey of the
advancements in GANs design and optimization solutions proposed to handle GANs
challenges. We first identify key research issues within each design and
optimization technique and then propose a new taxonomy to structure solutions
by key research issues. In accordance with the taxonomy, we provide a detailed
discussion on different GANs variants proposed within each solution and their
relationships. Finally, based on the insights gained, we present the promising
research directions in this rapidly growing field.Comment: 42 pages, Figure 13, Table
- …