Search CORE

37,220 research outputs found

Interaction between high-level and low-level image analysis for semantic video object extraction

Author: Cavallaro A
Ebrahimi T
Publication venue
Publication date: 01/01/2004
Field of study

Authors of articles published in EURASIP Journal on Advances in Signal Processing are the copyright holders of their articles and have granted to any third party, in advance and in perpetuity, the right to use, reproduce or disseminate the article, according to the SpringerOpen copyright and license agreement (http://www.springeropen.com/authors/license)

Springer - Publisher Connector

Directory of Open Access Journals

Queen Mary Research Online

ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Author: Chen Kan
Chen Liang-Chieh
Gao Haoyuan
Nevatia Ram
Wang Jiang
Xu Wei
Publication venue
Publication date: 03/04/2016
Field of study

We propose a novel attention based deep learning architecture for visual question answering task (VQA). Given an image and an image related natural language question, VQA generates the natural language answer for the question. Generating the correct answers requires the model's attention to focus on the regions corresponding to the question, because different questions inquire about the attributes of different image regions. We introduce an attention based configurable convolutional neural network (ABC-CNN) to learn such question-guided attention. ABC-CNN determines an attention map for an image-question pair by convolving the image feature map with configurable convolutional kernels derived from the question's semantics. We evaluate the ABC-CNN architecture on three benchmark VQA datasets: Toronto COCO-QA, DAQUAR, and VQA dataset. ABC-CNN model achieves significant improvements over state-of-the-art methods on these datasets. The question-guided attention generated by ABC-CNN is also shown to reflect the regions that are highly relevant to the questions

arXiv.org e-Print Archive

CiteSeerX

The Missing Data Encoder: Cross-Channel Image Completion\\with Hide-And-Seek Adversarial Network

Author: Cord Matthieu
Dapogny Arnaud
Perez Patrick
Publication venue
Publication date: 06/05/2019
Field of study

Image completion is the problem of generating whole images from fragments only. It encompasses inpainting (generating a patch given its surrounding), reverse inpainting/extrapolation (generating the periphery given the central patch) as well as colorization (generating one or several channels given other ones). In this paper, we employ a deep network to perform image completion, with adversarial training as well as perceptual and completion losses, and call it the ``missing data encoder'' (MDE). We consider several configurations based on how the seed fragments are chosen. We show that training MDE for ``random extrapolation and colorization'' (MDE-REC), i.e. using random channel-independent fragments, allows a better capture of the image semantics and geometry. MDE training makes use of a novel ``hide-and-seek'' adversarial loss, where the discriminator seeks the original non-masked regions, while the generator tries to hide them. We validate our models both qualitatively and quantitatively on several datasets, showing their interest for image completion, unsupervised representation learning as well as face occlusion handling

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Laterality and unilateral deafness : patients with congenital right ear deafness do not develop atypical language dominance

Author: Acke Frederic
Brysbaert Marc
Cai Qing
De Leenheer Els
Dhooge Ingeborg
Van der Haegen Lise
Vingerhoets Guy
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Ghent University Academic Bibliography