Search CORE

45 research outputs found

Some Objects Are More Equal Than Others: Measuring and Predicting Importance

Author: A. Fog
A. Yarbus
B.F.J. Manly
D. Walther
L. Elazary
M. Weber
S. Kullback
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2008
Field of study

We observe that everyday images contain dozens of objects, and that humans, in describing these images, give different priority to these objects. We argue that a goal of visual recognition is, therefore, not only to detect and classify objects but also to associate with each a level of priority which we call 'importance'. We propose a definition of importance and show how this may be estimated reliably from data harvested from human observers. We conclude by showing that a first-order estimate of importance may be computed from a number of simple image region measurements and does not require access to image meaning

Crossref

Caltech Authors

View and Illumination Invariant Object Classification Based on 3D Color Histogram Using Convolutional Neural Networks

Author: A Andreopoulos
JM Geusebroek
L Elazary
M Wang
S Ya
SS Bucak
Y Bengio
Y Bengio
Z Jiang
Publication venue: Springer International Publishing
Publication date: 01/01/2015
Field of study

Object classification is an important step in visual recognition and semantic analysis of visual content. In this paper, we propose a method for classification of objects that is invariant to illumination color, illumination direction and viewpoint based on 3D color histogram. A 3D color histogram of an image is represented as a 2D image, to capture the color composition while preserving the neighborhood information of color bins, to realize the necessary visual cues for classification of objects. Also, the ability of convolutional neural network (CNN) to learn invariant visual patterns is exploited for object classification. The efficacy of the proposed method is demonstrated on Amsterdam Library of Object Images (ALOI) dataset captured under various illumination conditions and angles-of-view

Crossref

Relevance ratings and salience categorizations for objects in a set of 80 pictures

Author: A. Hollingworth
C. Koch
D. I. Shore
D. T. Levin
D. T. Levin
D. Walther
Erin Williams
G. Wallis
H. L. Pringle
J. A. Stirk
J. DiVita
J. K. O’Regan
J. K. O’Regan
K. Rayner
L. Elazary
L. Itti
M. Donk
M. J. Wright
Michael M. Marcell
P. M. Pearson
R. A. Rensink
R. A. Rensink
S. Werner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Single-incision laparoscopic cholecystectomy: initial evaluation of a large series of patients

Author: A Cuschieri
A Cuschieri
AG Nagy
C Zornig
CA Del Pin
CR Tracy
D Rattner
D Rattner
Daniel Scott
DJ Scott
E Berber
Esteban Varela
G Navarra
Homero Rivas
JD Raman
JF Zhu
L Boni
MA Cuesta
NJ Soper
P Swain
R Elazary
R Tacchino
R Zorron
RD Pai
SA Castellucci
SE Hodgett
T Kosumi
WN Wang
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Measures and Limits of Models of Fixation Selection

Author: A Acik
A Açik
A Chao
A Hwang
A Torralba
B Tatler
B Tatler
BW Tatler
C Kanan
D Holste
D Parkhurst
G Kootstra
G Miller
H Frey
I Nemenman
J Gilland
J Harel
J Hausser
KA Ehinger
L Elazary
L Itti
L Itti
L Itti
L Zhang
LW Renninger
M Cerf
M Cerf
M Recarte
N Parikh
NDB Bruce
Niklas Wilming
NJ Butko
Peter König
R Krichevsky
R Peters
RJ Baddeley
S Trybula
T Betz
T Fawcett
T Schürmann
Thomas Wennekers
Tim C. Kietzmann
Torsten Betz
V Yanulevskaya
W Einhäuser
W Kienzle
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Models of fixation selection are a central tool in the quest to understand how the human mind selects relevant information. Using this tool in the evaluation of competing claims often requires comparing different models' relative performance in predicting eye movements. However, studies use a wide variety of performance measures with markedly different properties, which makes a comparison difficult. We make three main contributions to this line of research: First we argue for a set of desirable properties, review commonly used measures, and conclude that no single measure unites all desirable properties. However the area under the ROC curve (a classification measure) and the KL-divergence (a distance measure of probability distributions) combine many desirable properties and allow a meaningful comparison of critical model performance. We give an analytical proof of the linearity of the ROC measure with respect to averaging over subjects and demonstrate an appropriate correction of entropy-based measures like KL-divergence for small sample sizes in the context of eye-tracking data. Second, we provide a lower bound and an upper bound of these measures, based on image-independent properties of fixation data and between subject consistency respectively. Based on these bounds it is possible to give a reference frame to judge the predictive power of a model of fixation selection . We provide open-source python code to compute the reference frame. Third, we show that the upper, between subject consistency bound holds only for models that predict averages of subject populations. Departing from this we show that incorporating subject-specific viewing behavior can generate predictions which surpass that upper bound. Taken together, these findings lay out the required information that allow a well-founded judgment of the quality of any model of fixation selection and should therefore be reported when a new model is introduced

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Oculomotor Evidence for Top-Down Control following the Initial Saccade

Author: A Nuthmann
A Pollatsek
A Treisman
A Treisman
Adrian G. Dyer
Alisha Siebold
AR Hunt
AV Belopolsky
B Noudoost
BW Tatler
C Koch
CJH Ludwig
D Parkhurst
DJ Parkhurst
DM Stampe
G Underwood
GJ Zelinsky
GL Malcolm
GT Buswell
J Theeuwes
J Theeuwes
J Theeuwes
JCP Roos
JM Findlay
JM Findlay
JM Wolfe
JM Wolfe
K Rayner
K Verfaillie
L Elazary
L Itti
L Itti
LA Carlson-Radvansky
M Donk
M Donk
MB Neider
MI Posner
Mieke Donk
RJ Peters
S Mannan
SK Mannan
SK Mannan
T Betz
T Foulsham
Theeuwes
Theeuwes
W Einhäuser
W van Zoest
W van Zoest
W van Zoest
W van Zoest
W van Zoest
W van Zoest
Wieske van Zoest
X Chen
Z Li
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

The goal of the current study was to investigate how salience-driven and goal-driven processes unfold during visual search over multiple eye movements. Eye movements were recorded while observers searched for a target, which was located on (Experiment 1) or defined as (Experiment 2) a specific orientation singleton. This singleton could either be the most, medium, or least salient element in the display. Results were analyzed as a function of response time separately for initial and second eye movements. Irrespective of the search task, initial saccades elicited shortly after the onset of the search display were primarily salience-driven whereas initial saccades elicited after approximately 250 ms were completely unaffected by salience. Initial saccades were increasingly guided in line with task requirements with increasing response times. Second saccades were completely unaffected by salience and were consistently goal-driven, irrespective of response time. These results suggest that stimulus-salience affects the visual system only briefly after a visual image enters the brain and has no effect thereafter

CiteSeerX

Public Library of Science (PLOS)

Crossref

VU Research Portal

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

Integrating Mechanisms of Visual Guidance in Naturalistic Language Production

Author: A Gabadinho
A Hwang
A Kennedy
A Kukona
A Myachykov
A Nuthmann
A Papafragou
B Russell
C Elzinga
CA Rothkopf
D Dahan
D Noton
D Parkhursta
DJ Barr
F Huettig
Frank Keller
G Altmann
G Underwood
G Zelinsky
H Branigan
J Arnold
J Findlay
J Henderson
J Henderson
J Henderson
J McDonald
J Pinheiro
J Pynte
K Bock
K Fukumura
K Fukumura
K Rayner
L Elazary
L Gleitman
L Itti
M Castelhano
M Frank
M Pomplun
M Prat-Sala
M Tanenhaus
MI Coco
Moreno I. Coco
P Allopenna
P Knoeferle
R Andersson
R Baayen
R Rosenholtz
S Brown-Schmidt
S Fletcher-Watson
S Kuchinksy
S Qu
V Ferreira
W Levelt
W Nelson
Z Griffin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2015
Field of study

Situated language production requires the integration of visual attention and lin-guistic processing. Previous work has not conclusively disentangled the role of perceptual scene information and structural sentence information in guiding visual attention. In this paper, we present an eye-tracking study that demonstrates that three types of guidance, perceptual, conceptual, and structural, interact to control visual attention. In a cued language production experiment, we manipulate percep-tual (scene clutter) and conceptual guidance (cue animacy), and measure structural guidance (syntactic complexity of the utterance). Analysis of the time course of lan-guage production, before and during speech, reveals that all three forms of guidance affect the complexity of visual responses, quantified in terms of the entropy of atten-tional landscapes and the turbulence of scan patterns, especially during speech. We find that perceptual and conceptual guidance mediate the distribution of attention in the scene, whereas structural guidance closely relates to scan-pattern complexity. Furthermore, the eye-voice span of the cued object and its perceptual competitor are similar; its latency mediated by both perceptual and structural guidance. These results rule out a strict interpretation of structural guidance as the single dominant form of visual guidance in situated language production. Rather, the phase of the task and the associated demands of cross-modal cognitive processing determine the mechanisms that guide attention

CiteSeerX

Crossref

Edinburgh Research Explorer

The contributions of image content and behavioral relevancy to overt attention

Author: A Açık
A Açık
A Oliva
A Torralba
A Torralba
Alper Açık
AM Derrington
AM Treisman
Andrea Antal
BM ’t Hart
BW Tatler
BW Tatler
C Koch
C Mota
CM Masciocchi
D Parkhurst
DJ Berg
DJ Parkhurst
DJ Parkhurst
E Charles Leek
F Schumann
FF Li
Frank Schumann
G Kootstra
G Underwood
GR Loftus
H Nothdurft
H-C Wang
H-P Frey
H-P Frey
I Biederman
J Driver
J Gautier
J Theeuwes
JA Stirk
JM Henderson
JP Ossandón
JR Antes
K Kaspar
K Kaspar
L Elazary
L Itti
L Itti
L Itti
L Jansen
L Zhaoping
LW Renninger
M Cerf
M Dorr
M Dorr
M Mackay
M Naber
M Yoshida
N Althaus
N Parikh
N Wilming
N Wilming
O Le Meur
P Kovesi
P Reinagel
Peter König
Q Zhao
RJ Baddeley
S Engmann
S Kollmorgen
S Onat
S Thorpe
Selim Onat
SJ Thorpe
T Betz
W Einhäuser
W Einhäuser
W Einhäuser
W Kienzle
Z Li
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 15/03/2014
Field of study

During free-viewing of natural scenes, eye movements are guided by bottom-up factors inherent to the stimulus, as well as top-down factors inherent to the observer. The question of how these two different sources of information interact and contribute to fixation behavior has recently received a lot of attention. Here, a battery of 15 visual stimulus features was used to quantify the contribution of stimulus properties during free-viewing of 4 different categories of images (Natural, Urban, Fractal and Pink Noise). Behaviorally relevant information was estimated in the form of topographical interestingness maps by asking an independent set of subjects to click at image regions that they subjectively found most interesting. Using a Bayesian scheme, we computed saliency functions that described the probability of a given feature to be fixated. In the case of stimulus features, the precise shape of the saliency functions was strongly dependent upon image category and overall the saliency associated with these features was generally weak. When testing multiple features jointly, a linear additive integration model of individual saliencies performed satisfactorily. We found that the saliency associated with interesting locations was much higher than any low-level image feature and any pair-wise combination thereof. Furthermore, the low-level image features were found to be maximally salient at those locations that had already high interestingness ratings. Temporal analysis showed that regions with high interestingness ratings were fixated as early as the third fixation following stimulus onset. Paralleling these findings, fixation durations were found to be dependent mainly on interestingness ratings and to a lesser extent on the low-level image features. Our results suggest that both low- and high-level sources of information play a significant role during exploration of complex scenes with behaviorally relevant information being more effective compared to stimulus features.publisher versio

Public Library of Science (PLOS)

Crossref

eResearch@Ozyegin

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Scenes, saliency maps and scanpaths

Author: A Borji
A Nuthmann
A Nuthmann
AD Clarke
AL Yarbus
AM Treisman
AS Atalay
B Laeng
BW Tatler
BW Tatler
BW Tatler
C Klein
C Koch
C Schwedes
C Siagian
CA Dickinson
CA Rothkopf
CM Privitera
D Noton
D Parkhurst
D Walther
D Walther
David S. Wooding
DH Ballard
E Birmingham
F Cristino
G Underwood
GJ Zelinsky
GJ Zelinsky
GJ Zelinsky
GR Loftus
Jeremy M. Wolfe
JM Henderson
JM Henderson
JM Henderson
JM Henderson
JW Bisley
K Rayner
KA Ehinger
L Elazary
L Holm
L Itti
L Itti
L Itti
L Stark
L Zhang
LW Barsalou
M Freeth
M Kümmerer
M Mills
M Nyström
M Wedel
MI Posner
NH Mackworth
O Meur Le
O Meur Le
PH Tseng
PJ Unema
R Dewhurst
R Dodge
R Johansson
R Lans Van der
RJ Peters
RR Althoff
S DiPaola
S Mannan
S Treue
S Vogt
SA Brandt
T Foulsham
T Foulsham
T Foulsham
T Foulsham
T Foulsham
T Foulsham
T Foulsham
T Foulsham
T Moore
V Navalpakkam
W Einhäuser
W Einhäuser
YS Choi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/10/2019
Field of study

The aim of this chapter is to review some of the key research investigating how people look at pictures. In particular, my goal is to provide theoretical background for those that are new to the field, while also explaining some of the relevant methods and analyses. I begin by introducing eye movements in the context of natural scene perception. As in other complex tasks, eye movements provide a measure of attention and information processing over time, and they tell us about how the foveated visual system determines what to prioritise. I then describe some of the many measures which have been derived to summarize where people look in complex images. These include global measures, analyses based on regions of interest and comparisons based on heat maps. A particularly popular approach for trying to explain fixation locations is the saliency map approach, and the first half of the chapter is mostly devoted to this topic. A large number of papers and models are built on this approach, but it is also worth spending time on this topic because the methods involved have been used across a wide range of applications. The saliency map approach is based on the fact that the visual system has topographic maps of visual features, that contrast within these features seems to be represented and prioritized, and that a central representation can be used to control attention and eye movements. This approach, and the underlying principles, has led to an increase in the number of researchers using complex natural scenes as stimuli. It is therefore important that those new to the field are familiar with saliency maps, their usage, and their pitfalls. I describe the original implementation of this approach (Itti & Koch, 2000), which uses spatial filtering at different levels of coarseness and combines them in an attempt to identify the regions which stand out from their background. Evaluating this model requires comparing fixation locations to model predictions. Several different experimental and comparison methods have been used, but most recent research shows that bottom-up guidance is rather limited in terms of predicting real eye movements. The second part of the chapter is largely concerned with measuring eye movement scanpaths. Scanpaths are the sequential patterns of fixations and saccades made when looking at something for a period of time. They show regularities which may reflect top-down attention, and some have attempted to link these to memory and an individual’s mental model of what they are looking at. While not all researchers will be testing hypotheses about scanpaths, an understanding of the underlying methods and theory will be of benefit to all. I describe the theories behind analyzing eye movements in this way, and various methods which have been used to represent and compare them. These methods allow one to quantify the similarity between two viewing patterns, and this similarity is linked to both the image and the observer. The last part of the chapter describes some applications of eye movements in image viewing. The methods discussed can be applied to complex images, and therefore these experiments can tell us about perception in art and marketing, as well as about machine vision

University of Essex Research Repository

Crossref

Saliency-Based Applications

Author: B. Chalmond
C. Siagian
H. Fu
H. Li
L. Elazary
Y.F. Ma
Z. Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref