Search CORE

14,000 research outputs found

Dynamic Variational Autoencoders for Visual Process Modeling

Author: Sagel Alexander
Shen Hao
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/02/2020
Field of study

This work studies the problem of modeling visual processes by leveraging deep generative architectures for learning linear, Gaussian representations from observed sequences. We propose a joint learning framework, combining a vector autoregressive model and Variational Autoencoders. This results in an architecture that allows Variational Autoencoders to simultaneously learn a non-linear observation as well as a linear state model from sequences of frames. We validate our approach on artificial sequences and dynamic textures

arXiv.org e-Print Archive

Crossref

Deep Markov Random Field for Image Modeling

Author: A Dempster
Alex Graves
C Dong
E Ising
GE Hinton
GR Cross
H Rue
J Duchi
J Pearl
J Portilla
J Shotton
J Wright
James Hays
Jean-François Lalonde
M Schuster
PJ Werbos
R Timofte
S Geman
SC Zhu
SZ Li
WT Freeman
Publication venue
Publication date: 07/09/2016
Field of study

Markov Random Fields (MRFs), a formulation widely used in generative image modeling, have long been plagued by the lack of expressive power. This issue is primarily due to the fact that conventional MRFs formulations tend to use simplistic factors to capture local patterns. In this paper, we move beyond such limitations, and propose a novel MRF model that uses fully-connected neurons to express the complex interactions among pixels. Through theoretical analysis, we reveal an inherent connection between this model and recurrent neural networks, and thereon derive an approximated feed-forward network that couples multiple RNNs along opposite directions. This formulation combines the expressive power of deep neural networks and the cyclic dependency structure of MRF in a unified model, bringing the modeling capability to a new level. The feed-forward approximation also allows it to be efficiently learned from data. Experimental results on a variety of low-level vision tasks show notable improvement over state-of-the-arts.Comment: Accepted at ECCV 201

arXiv.org e-Print Archive

Crossref

Function-led design of multifunctional stimuli-responsive superhydrophobic surface based on hierarchical graphene-titania nanocoating

Author: Liu Yang
Publication venue
Publication date: 12/12/2017
Field of study

Multifunctional smart superhydrophobic surface with full-spectrum tunable wettability control is fabricated through the self-assembly of the graphene and titania nanofilm double-layer coating. Advanced microfluidic manipulative functions, including directional water transport, adhesion & spreading controls, droplet storage & transfer, and droplet sensing array, can be readily realized on this smart surface. An in-depth mechanism study regarding the underlying secrets of the tunable wettability and the UV-induced superhydrophilic conversion of anatase titania are also presented

arXiv.org e-Print Archive

The Hong Kong Polytechnic University Pao Yue-kong Library

ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing

Author: Lin Chen-Hsuan
Lucey Simon
Shechtman Eli
Wang Oliver
Yumer Ersin
Publication venue
Publication date: 05/03/2018
Field of study

We address the problem of finding realistic geometric corrections to a foreground object such that it appears natural when composited into a background image. To achieve this, we propose a novel Generative Adversarial Network (GAN) architecture that utilizes Spatial Transformer Networks (STNs) as the generator, which we call Spatial Transformer GANs (ST-GANs). ST-GANs seek image realism by operating in the geometric warp parameter space. In particular, we exploit an iterative STN warping scheme and propose a sequential training strategy that achieves better results compared to naive training of a single generator. One of the key advantages of ST-GAN is its applicability to high-resolution images indirectly since the predicted warp parameters are transferable between reference frames. We demonstrate our approach in two applications: (1) visualizing how indoor furniture (e.g. from product images) might be perceived in a room, (2) hallucinating how accessories like glasses would look when matched with real portraits.Comment: Accepted to CVPR 2018 (website & code: https://chenhsuanlin.bitbucket.io/spatial-transformer-GAN/

arXiv.org e-Print Archive

Crossref

Generative Image Modeling Using Spatial LSTMs

Author: Bethge Matthias
Theis Lucas
Publication venue
Publication date: 18/09/2015
Field of study

Modeling the distribution of natural images is challenging, partly because of strong statistical dependencies which can extend over hundreds of pixels. Recurrent neural networks have been successful in capturing long-range dependencies in a number of problems but only recently have found their way into generative image models. We here introduce a recurrent image model based on multi-dimensional long short-term memory units which are particularly suited for image modeling due to their spatial structure. Our model scales to images of arbitrary size and its likelihood is computationally tractable. We find that it outperforms the state of the art in quantitative comparisons on several image datasets and produces promising results when used for texture synthesis and inpainting

arXiv.org e-Print Archive

CiteSeerX

MPG.PuRe