Search CORE

1,692 research outputs found

Self Adversarial Training for Human Pose Estimation

Author: arjovsky
belagiannis
berthelot
bulat
cao
carreira
chen
chen
chu
gkioxari
gong
goodfellow
gulrajani
insafutdinov
isola
ledig
lifshitz
luc
mirza
newell
pan
pishchulin
radford
rafi
ramakrishna
tompson
wei
zhao
Publication venue
Publication date: 15/08/2017
Field of study

This paper presents a deep learning based approach to the problem of human pose estimation. We employ generative adversarial networks as our learning paradigm in which we set up two stacked hourglass networks with the same architecture, one as the generator and the other as the discriminator. The generator is used as a human pose estimator after the training is done. The discriminator distinguishes ground-truth heatmaps from generated ones, and back-propagates the adversarial loss to the generator. This process enables the generator to learn plausible human body configurations and is shown to be useful for improving the prediction accuracy.Comment: CVPR 2017 Workshop on Visual Understanding of Humans in Crowd Scene and the 1st Look Into Person (LIP) Challeng

arXiv.org e-Print Archive

MedGAN: Medical Image Translation using GANs

Author: Armanious Karim
Fischer Marc
Gatidis Sergios
Hepp Tobias
Jiang Chenming
Küstner Thomas
Nikolaou Konstantin
Yang Bin
Publication venue: 'Elsevier BV'
Publication date: 04/04/2019
Field of study

Image-to-image translation is considered a new frontier in the field of medical image analysis, with numerous potential applications. However, a large portion of recent approaches offers individualized solutions based on specialized task-specific architectures or require refinement through non-end-to-end training. In this paper, we propose a new framework, named MedGAN, for medical image-to-image translation which operates on the image level in an end-to-end manner. MedGAN builds upon recent advances in the field of generative adversarial networks (GANs) by merging the adversarial framework with a new combination of non-adversarial losses. We utilize a discriminator network as a trainable feature extractor which penalizes the discrepancy between the translated medical images and the desired modalities. Moreover, style-transfer losses are utilized to match the textures and fine-structures of the desired target images to the translated images. Additionally, we present a new generator architecture, titled CasNet, which enhances the sharpness of the translated medical outputs through progressive refinement via encoder-decoder pairs. Without any application-specific modifications, we apply MedGAN on three different tasks: PET-CT translation, correction of MR motion artefacts and PET image denoising. Perceptual analysis by radiologists and quantitative evaluations illustrate that the MedGAN outperforms other existing translation approaches.Comment: 16 pages, 8 figure

arXiv.org e-Print Archive

King's Research Portal

Super-Resolution for Overhead Imagery Using DenseNets and Adversarial Learning

Author: Bosch Marc
Gifford Christopher M.
Rodriguez Pedro A.
Publication venue
Publication date: 28/11/2017
Field of study

Recent advances in Generative Adversarial Learning allow for new modalities of image super-resolution by learning low to high resolution mappings. In this paper we present our work using Generative Adversarial Networks (GANs) with applications to overhead and satellite imagery. We have experimented with several state-of-the-art architectures. We propose a GAN-based architecture using densely connected convolutional neural networks (DenseNets) to be able to super-resolve overhead imagery with a factor of up to 8x. We have also investigated resolution limits of these networks. We report results on several publicly available datasets, including SpaceNet data and IARPA Multi-View Stereo Challenge, and compare performance with other state-of-the-art architectures.Comment: 9 pages, 9 figures, WACV 2018 submissio

arXiv.org e-Print Archive