Search CORE

36 research outputs found

Learning Visual Question Answering by Bootstrapping Hard Attention

Author: A Mack
A Rohrbach
DJ Simons
DJ Simons
DL Sheinberg
M Malinowski
S Harnad
S Hochreiter
S Singh
T Çukur
Publication venue
Publication date: 01/08/2018
Field of study

Attention mechanisms in biological perception are thought to select subsets of perceptual information for more sophisticated processing which would be prohibitive to perform on all sensory inputs. In computer vision, however, there has been relatively little exploration of hard attention, where some information is selectively ignored, in spite of the success of soft attention, where information is re-weighted and aggregated, but never filtered out. Here, we introduce a new approach for hard attention and find it achieves very competitive performance on a recently-released visual question answering datasets, equalling and in some cases surpassing similar soft attention architectures while entirely ignoring some features. Even though the hard attention mechanism is thought to be non-differentiable, we found that the feature magnitudes correlate with semantic relevance, and provide a useful signal for our mechanism's attentional selection criterion. Because hard attention selects important features of the input information, it can also be more efficient than analogous soft attention mechanisms. This is especially important for recent approaches that use non-local pairwise operations, whereby computational and memory costs are quadratic in the size of the set of features.Comment: ECCV 201

arXiv.org e-Print Archive

Crossref

Adults' Awareness of Faces Follows Newborns' Looking Preferences

Author: B de Gelder
B de Gelder
BC Duchaine
BN Pasley
CC Goren
CY Kim
DA Leopold
DL Sheinberg
E McKone
E Valenza
E Yang
F Fang
F Moradi
F Tong
G Golarai
G Yovel
G Zhou
I Amihai
J Morton
L Pessoa
MA Williams
Marius V. Peelen
Mark W. Greenlee
MH Johnson
MH Johnson
MH Johnson
MH Johnson
ML Anderson
MV Peelen
N Kanwisher
N Tsuchiya
N Tsuchiya
N Tsuchiya
NN Oosterhof
P Sterzer
P Sterzer
P Sterzer
P Sterzer
P Tomalski
P Tomalski
Philipp Sterzer
R Blake
R Blake
R Fox
S Dehaene
S Gilad
Stein T
T Farroni
T Farroni
T Stein
Timo Stein
TJ Andrews
WJM Levelt
Y Jiang
Y Jiang
Publication venue: Public Library of Science
Publication date: 21/12/2011
Field of study

From the first days of life, humans preferentially orient towards upright faces, likely reflecting innate subcortical mechanisms. Here, we show that binocular rivalry can reveal face detection mechanisms in adults that are surprisingly similar to inborn face detection mechanism. We used continuous flash suppression (CFS), a variant of binocular rivalry, to render stimuli invisible at the beginning of each trial and measured the time upright and inverted stimuli needed to overcome such interocular suppression. Critically, specific stimulus properties previously shown to modulate looking preferences in neonates similarly modulated adults' awareness of faces presented during CFS. First, the advantage of upright faces in overcoming CFS was strongly modulated by contrast polarity and direction of illumination. Second, schematic patterns consisting of three dark blobs were suppressed for shorter durations when the arrangement of these blobs respected the face-like configuration of the eyes and the mouth, and this effect was modulated by contrast polarity. No such effects were obtained in a binocular control experiment not involving CFS, suggesting a crucial role for face-sensitive mechanisms operating outside of conscious awareness. These findings indicate that visual awareness of faces in adults is governed by perceptual mechanisms that are sensitive to similar stimulus properties as those modulating newborns' face preferences

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Bistable Percepts in the Brain: fMRI Contrasts Monocular Pattern Rivalry and Binocular Rivalry

The neural correlates of binocular rivalry have been actively debated in recent years, and are of considerable interest as they may shed light on mechanisms of conscious awareness. In a related phenomenon, monocular rivalry, a composite image is shown to both eyes. The subject experiences perceptual alternations in which the two stimulus components alternate in clarity or salience. The experience is similar to perceptual alternations in binocular rivalry, although the reduction in visibility of the suppressed component is greater for binocular rivalry, especially at higher stimulus contrasts. We used fMRI at 3T to image activity in visual cortex while subjects perceived either monocular or binocular rivalry, or a matched non-rivalrous control condition. The stimulus patterns were left/right oblique gratings with the luminance contrast set at 9%, 18% or 36%. Compared to a blank screen, both binocular and monocular rivalry showed a U-shaped function of activation as a function of stimulus contrast, i.e. higher activity for most areas at 9% and 36%. The sites of cortical activation for monocular rivalry included occipital pole (V1, V2, V3), ventral temporal, and superior parietal cortex. The additional areas for binocular rivalry included lateral occipital regions, as well as inferior parietal cortex close to the temporoparietal junction (TPJ). In particular, higher-tier areas MT+ and V3A were more active for binocular than monocular rivalry for all contrasts. In comparison, activation in V2 and V3 was reduced for binocular compared to monocular rivalry at the higher contrasts that evoked stronger binocular perceptual suppression, indicating that the effects of suppression are not limited to interocular suppression in V1

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Incremental grouping of image elements in vision

Author: A Cohen
A Mack
A Mack
A Pooresmaeili
A Sutter
A Thiele
AF Kramer
AG Leventhal
AK Engel
AM Richard
AM Treisman
AM Treisman
AM Treisman
AM Treisman
AO Holcombe
B DeSchepper
B Julesz
BA Eriksen
BC Motter
BJ Baars
BJ Scholl
BJA Palanca
C Bundesen
C Haimson
C Lefebvre
C Malsburg von der
C Malsburg von der
C Russell
CI Baker
CJ McAdams
CM Moore
CP Hung
D Crundall
D Crundall
D Kahneman
D Mumford
DB Liston
DG Pelli
DJ Felleman
DJ Field
DJ Freedman
DL Sheinberg
DY Tsao
E Blaser
E Borenstein
E Kobatake
E Sharon
EK Miller
F Velde van der
FG Ashby
FH Hamker
G Kayaert
G Kreiman
G Sáry
GC Baylis
GC Baylis
GM Ghose
GW Humphreys
H Komatsu
H Supèr
HB Barlow
HE Egeth
HS Scholte
I Kovács
I Kovács
I Rock
ID Gilchrist
J Avrahami
J Beck
J Driver
J Driver
J Driver
J Duncan
J Duncan
J Duncan
J Malik
JB Mattingly
JD Schall
JE Hoffman
JH Reynolds
JJ Knierim
JK Tsotsos
JM Wolfe
JM Wolfe
JM Wolfe
JM Wolfe
JR Bergen
K Fukushima
K Fukushima
K Koffka
K Tanaka
KA Schneider
KE Schmidt
KM Armstrong
KM O'Craven
KR Gegenfurtner
L Chelazzi
L Chelazzi
L Harms
L Itti
LB Ekstrom
LC Sincich
M Behrmann
M Bravo
M Carrasco
M Kubovy
M Pavlovskaya
M Riesenhuber
M Riesenhuber
M Wertheimer
M-E Large
MA Peterson
MB Ben-Av
MJ Bravo
MK Kapadia
MN Shadlen
MS Livingstone
MW Oram
N Cowan
N Donnelly
NP Bichot
O Ben-Shahar
P Cavanagh
P Jolicoeur
P Jolicoeur
P Jolicoeur
P Jolicoeur
P McLeod
PA Salin
Pieter R. Roelfsema
PJ Kellman
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PR Roelfsema
PS Khayat
PW Halligan
R Desimone
R Eckhorn
R Egly
R Houtkamp
R Houtkamp
R Kimchi
R Kimchi
R Pringle
R Sireteanu
RB Ivry
RF Hess
RJ Watt
Roos Houtkamp
RQ Quiroga
S Celebrini
S Coren
S Dehaene
S Dehaene
S Edelman
S Grossberg
S Grossberg
S Ling
S Thorpe
S Treue
S Treue
S Ullman
S Zeki
SE Palmer
SE Palmer
SE Palmer
SE Watson
SJ Luck
SL Brincat
SL Brincat
SL Franconeri
SP Vecera
SP Vecera
SP Vecera
SR Mitroff
SS Wolfson
SW Zucker
T Moore
TD Albright
TJ Buschman
TJ Vickery
U Neisser
VAF Lamme
VAF Lamme
VAF Lamme
VAF Lamme
W Prinzmetal
W Prinzmetal
W Singer
WA Phillips
WH Bosking
WP Banks
WY Chan
Y Sugase
YS Bonneh
Z Gigus
Z Kourtzi
Z Li
ZJ He
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

One important task for the visual system is to group image elements that belong to an object and to segregate them from other objects and the background. We here present an incremental grouping theory (IGT) that addresses the role of object-based attention in perceptual grouping at a psychological level and, at the same time, outlines the mechanisms for grouping at the neurophysiological level. The IGT proposes that there are two processes for perceptual grouping. The first process is base grouping and relies on neurons that are tuned to feature conjunctions. Base grouping is fast and occurs in parallel across the visual scene, but not all possible feature conjunctions can be coded as base groupings. If there are no neurons tuned to the relevant feature conjunctions, a second process called incremental grouping comes into play. Incremental grouping is a time-consuming and capacity-limited process that requires the gradual spread of enhanced neuronal activity across the representation of an object in the visual cortex. The spread of enhanced neuronal activity corresponds to the labeling of image elements with object-based attention

Crossref

VU Research Portal

PubMed Central

Visual categorization shapes feature selectivity in the primate temporal cortex

Author: A Delorme
CG Gross
DJ Freedman
DL Sheinberg
DL Sheinberg
I Gauthier
I Gauthier
JS Bruner
JW Tanaka
K Tanaka
N Sigala
Natasha Sigala
Nikos K. Logothetis
NK Logothetis
P Schyns
PH Schiller
R Vogels
R Vogels
RM Nosofsky
S Judge
SF Sands
T Sugihara
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

The way that we perceive and interact with objects depends on our previous experience with them. For example, a bird expert is more likely to recognize a bird as a sparrow, a sandpiper or a cockatiel than a non-expert. Neurons in the inferior temporal cortex have been shown to be important in the representation of visual objects; however, it is unknown which object features are represented and how these representations are affected by categorization training. Here we show that feature selectivity in the macaque inferior temporal cortex is shaped by categorization of objects on the basis of their visual features. Specifically, we recorded from single neurons while monkeys performed a categorization task with two sets of parametric stimuli. Each stimulus set consisted of four varying features, but only two of the four were important for the categorization task (diagnostic features). We found enhanced neuronal representation of the diagnostic features relative to the non-diagnostic ones. These findings demonstrate that stimulus features important for categorization are instantiated in the activity of single units (neurons) in the primate inferior temporal corte

Crossref

Oxford University Research Archive

The University of Manchester - Institutional Repository

Sussex Research Online

MPG.PuRe