Search CORE

10,731 research outputs found

Probabilistic Image Colorization

Author: Kolesnikov Alexander
Lampert Christoph H.
Royer Amelie
Publication venue
Publication date: 01/01/2017
Field of study

We develop a probabilistic technique for colorizing grayscale natural images. In light of the intrinsic uncertainty of this task, the proposed probabilistic framework has numerous desirable properties. In particular, our model is able to produce multiple plausible and vivid colorizations for a given grayscale image and is one of the first colorization models to provide a proper stochastic sampling scheme. Moreover, our training procedure is supported by a rigorous theoretical framework that does not require any ad hoc heuristics and allows for efficient modeling and learning of the joint pixel color distribution. We demonstrate strong quantitative and qualitative experimental results on the CIFAR-10 dataset and the challenging ILSVRC 2012 dataset

arXiv.org e-Print Archive

FloWaveNet : A Generative Flow for Raw Audio

Author: Kim Jaehyeon
Kim Sungwon
Lee Sang-gil
Song Jongyoon
Yoon Sungroh
Publication venue
Publication date: 20/05/2019
Field of study

Most modern text-to-speech architectures use a WaveNet vocoder for synthesizing high-fidelity waveform audio, but there have been limitations, such as high inference time, in its practical application due to its ancestral sampling scheme. The recently suggested Parallel WaveNet and ClariNet have achieved real-time audio synthesis capability by incorporating inverse autoregressive flow for parallel sampling. However, these approaches require a two-stage training pipeline with a well-trained teacher network and can only produce natural sound by using probability distillation along with auxiliary loss terms. We propose FloWaveNet, a flow-based generative model for raw audio synthesis. FloWaveNet requires only a single-stage training procedure and a single maximum likelihood loss, without any additional auxiliary terms, and it is inherently parallel due to the characteristics of generative flow. The model can efficiently sample raw audio in real-time, with clarity comparable to previous two-stage parallel models. The code and samples for all models, including our FloWaveNet, are publicly available.Comment: 9 pages, ICML'201

arXiv.org e-Print Archive

Fostering the potential endogenous development of European regions: a spatial dynamic panel data analysis of the Cohesion Policy on regional convergence over the period 1980-2005

Author: Lionel Vedrine
Nadine Turpin
Salima Bouayad-Agha
Publication venue
Publication date
Field of study

Dynamic panels, GMM, Regional Convergence, Spatial Dependence, Structural Funds

Auxiliary Guided Autoregressive Variational Autoencoders

Author: Lucas Thomas
Verbeek Jakob
Publication venue
Publication date: 10/09/2018
Field of study

Generative modeling of high-dimensional data is a key problem in machine learning. Successful approaches include latent variable models and autoregressive models. The complementary strengths of these approaches, to model global and local image statistics respectively, suggest hybrid models that encode global image structure into latent variables while autoregressively modeling low level detail. Previous approaches to such hybrid models restrict the capacity of the autoregressive decoder to prevent degenerate models that ignore the latent variables and only rely on autoregressive modeling. Our contribution is a training procedure relying on an auxiliary loss function that controls which information is captured by the latent variables and what is left to the autoregressive decoder. Our approach can leverage arbitrarily powerful autoregressive decoders, achieves state-of-the art quantitative performance among models with latent variables, and generates qualitatively convincing samples.Comment: Published as a conference paper at ECML-PKDD 201

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server