Search CORE

23 research outputs found

Saliency Benchmarking Made Easy: Separating Models, Maps and Metrics

Author: A Borji
A Borji
A Borji
A Nuthmann
AM Treisman
BT Vincent
BW Tatler
BW Tatler
BW Tatler
CA Rothkopf
H Adeli
HH Schütt
J Xiao
K Koehler
L Itti
L Itti
L Itti
L Zhang
M Cerf
M Kümmerer
N Riche
N Riche
N Wilming
ND Bruce
NDB Bruce
O Meur Le
RJ Peters
S Barthelme
SSS Kruthiventi
T Jost
W Einhauser
W Kienzle
Z Bylinskii
Z Li
Publication venue
Publication date: 25/07/2018
Field of study

Dozens of new models on fixation prediction are published every year and compared on open benchmarks such as MIT300 and LSUN. However, progress in the field can be difficult to judge because models are compared using a variety of inconsistent metrics. Here we show that no single saliency map can perform well under all metrics. Instead, we propose a principled approach to solve the benchmarking problem by separating the notions of saliency models, maps and metrics. Inspired by Bayesian decision theory, we define a saliency model to be a probabilistic model of fixation density prediction and a saliency map to be a metric-specific prediction derived from the model density which maximizes the expected performance on that metric given the model density. We derive these optimal saliency maps for the most commonly used saliency metrics (AUC, sAUC, NSS, CC, SIM, KL-Div) and show that they can be computed analytically or approximated with high precision. We show that this leads to consistent rankings in all metrics and avoids the penalties of using one saliency map for all metrics. Our method allows researchers to have their model compete on many different metrics with state-of-the-art in those metrics: "good" models will perform well in all metrics.Comment: published at ECCV 201

arXiv.org e-Print Archive

Crossref

Salience-based selection: attentional capture by distractors less salient than the target

Author: A Found
A Kristjansson
A Soltani
A Treisman
AB Leber
AB Leber
Anja Isabel Koch
AR Koene
BA Purcell
C Hickey
C Hickey
C Koch
CL Folk
D Gao
D Gao
D Kahneman
DM Beck
G Schwarz
GW Humphreys
H Deubel
H-C Nothdurft
Harriet Goschy
HE Egeth
Hermann Joseph Müller
HJ Müller
HJ Müller
J de Fockert
J Hodsoll
J Palmer
J Palmer
J Theeuwes
J Theeuwes
J Theeuwes
J Vandekerckhove
JA Nelder
JH Reynolds
JI Gold
JM Wolfe
JM Wolfe
JM Wolfe
Joy J. Geng
JP Gottlieb
JP Gottlieb
JW Bisley
KR Cave
KR Cave
L Itti
L Itti
M Eimer
M Kiss
M Usher
M Usher
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
Michael Zehetleitner
MP Eckstein
N Lavie
NDB Bruce
P Verghese
P Verghese
R Desimone
R Desimone
R Ratcliff
R Ratcliff
RH Phaf
SJ Luck
T Töllner
T van Zandt
V Maljkovic
W van Zoest
W van Zoest
Z Li
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Current accounts of attentional capture predict the most salient stimulus to be invariably selected first. However, existing salience and visual search models assume noise in the map computation or selection process. Consequently, they predict the first selection to be stochastically dependent on salience, implying that attention could even be captured first by the second most salient (instead of the most salient) stimulus in the field. Yet, capture by less salient distractors has not been reported and salience-based selection accounts claim that the distractor has to be more salient in order to capture attention. We tested this prediction using an empirical and modeling approach of the visual search distractor paradigm. For the empirical part, we manipulated salience of target and distractor parametrically and measured reaction time interference when a distractor was present compared to absent. Reaction time interference was strongly correlated with distractor salience relative to the target. Moreover, even distractors less salient than the target captured attention, as measured by reaction time interference and oculomotor capture. In the modeling part, we simulated first selection in the distractor paradigm using behavioral measures of salience and considering the time course of selection including noise. We were able to replicate the result pattern we obtained in the empirical part. We conclude that each salience value follows a specific selection time distribution and attentional capture occurs when the selection time distributions of target and distractor overlap. Hence, selection is stochastic in nature and attentional capture occurs with a certain probability depending on relative salience

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Open Access LMU ( Ludwig-Maximilians-Univ. München)

PubMed Central

Publikationsserver der Katholischen Universität Eichstätt-Ingolstadt

Birkbeck Institutional Research Online

The Francis Crick Institute

Emergence of Visual Saliency from Natural Scenes via Context-Mediated Probability Distributions Coding

Author: A Hyvarinen
A Hyvarinen
A Olmos
A Torralba
AJ Bell
AM Treisman
B Julesz
BA Olshausen
BA Olshausen
BW Tatler
C Kayser
C Koch
D Field
D Gao
D Gao
D Gao
EP Simoncelli
EP Simoncelli
F Attneave
G Felsen
GC DeAngelis
H Barlow
HJ Seo
JH van Hateren
JH van Hateren
Jinhua Xu
JJ Atick
JM Wolf
Joe Z. Tsien
L Itti
L Itti
L Itti
L Itti
L Zhang
L Zhang
L Zhaoping
M Carandini
Matjaz Perc
MS Caywood
NC Rust
ND Bruce
NDB Bruce
O Le Meur
PO Hoyer
RP Rao
RPN Rao
T Wachtler
TD Albright
WE Vinje
WJ Ma
WS Geisler
X Chen
Y Karklin
Z Li
Zhiyong Yang
Publication venue: Public Library of Science
Publication date: 29/12/2010
Field of study

Visual saliency is the perceptual quality that makes some items in visual scenes stand out from their immediate contexts. Visual saliency plays important roles in natural vision in that saliency can direct eye movements, deploy attention, and facilitate tasks like object detection and scene understanding. A central unsolved issue is: What features should be encoded in the early visual cortex for detecting salient features in natural scenes? To explore this important issue, we propose a hypothesis that visual saliency is based on efficient encoding of the probability distributions (PDs) of visual variables in specific contexts in natural scenes, referred to as context-mediated PDs in natural scenes. In this concept, computational units in the model of the early visual system do not act as feature detectors but rather as estimators of the context-mediated PDs of a full range of visual variables in natural scenes, which directly give rise to a measure of visual saliency of any input stimulus. To test this hypothesis, we developed a model of the context-mediated PDs in natural scenes using a modified algorithm for independent component analysis (ICA) and derived a measure of visual saliency based on these PDs estimated from a set of natural scenes. We demonstrated that visual saliency based on the context-mediated PDs in natural scenes effectively predicts human gaze in free-viewing of both static and dynamic natural scenes. This study suggests that the computation based on the context-mediated PDs of visual variables in natural scenes may underlie the neural mechanism in the early visual cortex for detecting salient features in natural scenes

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Measures and Limits of Models of Fixation Selection

Author: A Acik
A Açik
A Chao
A Hwang
A Torralba
B Tatler
B Tatler
BW Tatler
C Kanan
D Holste
D Parkhurst
G Kootstra
G Miller
H Frey
I Nemenman
J Gilland
J Harel
J Hausser
KA Ehinger
L Elazary
L Itti
L Itti
L Itti
L Zhang
LW Renninger
M Cerf
M Cerf
M Recarte
N Parikh
NDB Bruce
Niklas Wilming
NJ Butko
Peter König
R Krichevsky
R Peters
RJ Baddeley
S Trybula
T Betz
T Fawcett
T Schürmann
Thomas Wennekers
Tim C. Kietzmann
Torsten Betz
V Yanulevskaya
W Einhäuser
W Kienzle
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Models of fixation selection are a central tool in the quest to understand how the human mind selects relevant information. Using this tool in the evaluation of competing claims often requires comparing different models' relative performance in predicting eye movements. However, studies use a wide variety of performance measures with markedly different properties, which makes a comparison difficult. We make three main contributions to this line of research: First we argue for a set of desirable properties, review commonly used measures, and conclude that no single measure unites all desirable properties. However the area under the ROC curve (a classification measure) and the KL-divergence (a distance measure of probability distributions) combine many desirable properties and allow a meaningful comparison of critical model performance. We give an analytical proof of the linearity of the ROC measure with respect to averaging over subjects and demonstrate an appropriate correction of entropy-based measures like KL-divergence for small sample sizes in the context of eye-tracking data. Second, we provide a lower bound and an upper bound of these measures, based on image-independent properties of fixation data and between subject consistency respectively. Based on these bounds it is possible to give a reference frame to judge the predictive power of a model of fixation selection . We provide open-source python code to compute the reference frame. Third, we show that the upper, between subject consistency bound holds only for models that predict averages of subject populations. Departing from this we show that incorporating subject-specific viewing behavior can generate predictions which surpass that upper bound. Taken together, these findings lay out the required information that allow a well-founded judgment of the quality of any model of fixation selection and should therefore be reported when a new model is introduced

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Methods for comparing scanpaths and saliency maps: strengths and weaknesses

Author: A Delorme
A Toet
A Torralba
A Yarbus
B Efron
B Follet
BM Velichkovsky
BM Velichkovsky
BW Tatler
BW Tatler
BW Tatler
CB Trevarthen
CM Privitera
D Green
DE Irwin
EAB Over
GA Rousselet
H Jarodzka
J Simola
JM Henderson
JR Antes
K Rayner
K Rayner
L Itti
L Itti
L Itti
L Zhang
LW Renninger
M Bindemann
M Chanceaux
MA Just
NDB Bruce
NDB Bruce
O Meur Le
Olivier Le Meur
P Viviani
PJA Unema
Q Zhao
R Nisbett
RA Wagner
RJ Peters
RJ Peters
RR Althoff
S Josephson
S Pannasch
SK Mannan
SK Mannan
SK Mannan
T Baccino
T Baccino
T Baccino
T Jost
Thierry Baccino
U Rajashekar
VI Levenshtein
W Einhauser
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Predicting Eye Fixations on Complex Visual Stimuli Using Local Symmetry

Most bottom-up models that predict human eye fixations are based on contrast features. The saliency model of Itti, Koch and Niebur is an example of such contrast-saliency models. Although the model has been successfully compared to human eye fixations, we show that it lacks preciseness in the prediction of fixations on mirror-symmetrical forms. The contrast model gives high response at the borders, whereas human observers consistently look at the symmetrical center of these forms. We propose a saliency model that predicts eye fixations using local mirror symmetry. To test the model, we performed an eye-tracking experiment with participants viewing complex photographic images and compared the data with our symmetry model and the contrast model. The results show that our symmetry model predicts human eye fixations significantly better on a wide variety of images including many that are not selected for their symmetrical content. Moreover, our results show that especially early fixations are on highly symmetrical areas of the images. We conclude that symmetry is a strong predictor of human eye fixations and that it can be used as a predictor of the order of fixation

Crossref

Proceedings - University of Groningen

University of Groningen

Springer - Publisher Connector

ARTS repository - University of Groningen

PubMed Central

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Swepub

Dissertations of the University of Groningen

Entity linking for biomedical literature

Author: AB Abacha
B Liu
Boliang Zhang
CE Shannon
D Milne
Daniel Howsmon
Deborah McGuinness
H Fang
H Ji
H Ji
Heng Ji
HJ Dai
J Biesiada
J Zheng
James Hendler
Jin G Zheng
Juergen Hahn
L Hirschman
L Hunter
L Page
L Ratinov
LM Akella
M Frisch
M Miwa
M Pennacchiotti
NDB Bruce
P Ferragina
PN Mendes
R Mihalcea
S Cucerzan
S Kulkarni
T Cassidy
T Lipniacki
V Punyakanok
W Shen
X Cheng
X Liu
Y Guo
Y Sun
Y Usami
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Subjective Quality Assessment of Objects Based on Visual Attention

Author: D Cai
L Itti
L Ma
NDB Bruce
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Modeling and integration of situational awareness and soldier target search

Author: Alt J
Bruce NDB
Darken C
Darken C
Doll T
Hoffman DD
Johnson J
Lehmann EL
Mazz JP
Medina J
Reece DA
Washburn A
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Integration of Multi-modal Cues in Synthetic Attention Processes to Drive Virtual Agent Behavior

Author: C Bordeux
C Peters
Christopher Peters
CW Reynolds
DM Kuiper
F Krueger
H Noser
J Oijen van
M Luck
NDB Bruce
T Balint
T Conde
T Haubrich
T Pejsa
Y Kim
Publication venue
Publication date: 26/08/2017
Field of study

Simulations and serious games require realistic behavior of multiple intelligent agents in real-time. One particular issue is how attention and multi-modal sensory memory can be modeled in a natural but effective way, such that agents controllably react to salient objects or are distracted by other multi-modal cues from their current intention. We propose a conceptual framework that provides a solution with adherence to three main design goals: natural behavior, real-time performance, and controllability. As a proof of concept, we implement three major components and showcase effectiveness in a real-time game engine scenario. Within the exemplified scenario, a visual sensor is combined with static saliency probes and auditory cues. The attention model weighs bottom-up attention against intention-related top-down processing, controllable by a designer using memory and attention inhibitor parameters. We demonstrate our case and discuss future extensions

Crossref

pub H-BRS - Publikationsserver der Hochschule Bonn-Rhein-Sieg