Search CORE

243,009 research outputs found

Autoencoding beyond pixels using a learned similarity metric

Author: Larochelle Hugo
Larsen Anders Boesen Lindbo
Sønderby Søren Kaae
Winther Ole
Publication venue
Publication date: 01/01/2016
Field of study

We present an autoencoder that leverages learned representations to better measure similarities in data space. By combining a variational autoencoder with a generative adversarial network we can use learned feature representations in the GAN discriminator as basis for the VAE reconstruction objective. Thereby, we replace element-wise errors with feature-wise errors to better capture the data distribution while offering invariance towards e.g. translation. We apply our method to images of faces and show that it outperforms VAEs with element-wise similarity measures in terms of visual fidelity. Moreover, we show that the method learns an embedding in which high-level abstract visual features (e.g. wearing glasses) can be modified using simple arithmetic

arXiv.org e-Print Archive

Copenhagen University Research Information System

Online Research Database In Technology

Adding rotation to translation: percepts and illusions

Author: Loffler Gunter
Magnussen Camilla M.
Orbach Harry S.
Publication venue: 'Pion Ltd'
Publication date: 01/09/2014
Field of study

ResearchOnline@GCU

How Does Our Visual System Achieve Shift and Size Invariance?

Author: Wiskott Laurenz
Publication venue: Oxford University Press
Publication date: 01/01/2004
Field of study

The question of shift and size invariance in the primate visual system is discussed. After a short review of the relevant neurobiology and psychophysics, a more detailed analysis of computational models is given. The two main types of networks considered are the dynamic routing circuit model and invariant feature networks, such as the neocognitron. Some specific open questions in context of these models are raised and possible solutions discussed

CogPrints Cognitive Sciences Eprint Archive