Search CORE

134,601 research outputs found

Exploiting visual saliency for assessing the impact of car commercials upon viewers

Author: Díaz de María Fernando
Fernández Martínez Fernando
Fernández Torres Miguel Ángel
Garcia Faura Alvaro
González Díaz Iván
Hernández García Alejandro
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/08/2018
Field of study

Content based video indexing and retrieval (CBVIR) is a lively area of research which focuses on automating the indexing, retrieval and management of videos. This area has a wide spectrum of promising applications where assessing the impact of audiovisual productions emerges as a particularly interesting and motivating one. In this paper we present a computational model capable to predict the impact (i.e. positive or negative) upon viewers of car advertisements videos by using a set of visual saliency descriptors. Visual saliency provides information about parts of the image perceived as most important, which are instinctively targeted by humans when looking at a picture or watching a video. For this reason we propose to exploit visual information, introducing it as a new feature which reflects high-level semantics objectively, to improve the video impact categorization results. The suggested salience descriptors are inspired by the mechanisms that underlie the attentional abilities of the human visual system and organized into seven distinct families according to different measurements over the identified salient areas in the video frames, namely population, size, location, geometry, orientation, movement and photographic composition. Proposed approach starts by computing saliency maps for all the video frames, where two different visual saliency detection frameworks have been considered and evaluated: the popular graph based visual saliency (GBVS) algorithm, and a state-of-the-art DNN-based approach.This work has been partially supported by the National Grants RTC-2016-5305-7 and TEC2014-53390-P of the Spanish Ministry of Economy and Competitiveness.Publicad

Universidad Carlos III de Madrid e-Archivo

Object Detection Through Exploration With A Foveated Visual Field

Author: A Borji
A Lewis
A Torralba
B Alexe
BR Beutter
BW Tatler
C Bradley
C Morvan
CA Curcio
CA Curcio
CA Curcio
CH Lampert
CJ Ludwig
DG Lowe
DM Dacey
DM Levi
Emre Akbas
GJ Zelinsky
GL Malcolm
H Larochelle
H Strasburger
H Yamamoto
I Kokkinos
J Elder
J Freeman
J Hosang
J Najemnik
J Najemnik
J Rovamo
JH Elder
JM Findlay
JM Findlay
K Koehler
L Itti
L Itti
L Zhaoping
L Zhaoping
LW Renninger
MB Neider
MF Land
Miguel P. Eckstein
MJ Choi
MP Eckstein
MP Eckstein
MP Eckstein
MP Eckstein
MP Eckstein
ND Bruce
NJ Butko
NJ Marshall
P Azzopardi
P Kontschieder
P Verghese
P Viola
PF Felzenszwalb
R Rosenholtz
S Ren
S Zhang
SC Mack
T Malisiewicz
T Wertheim
TJ Preston
W Zhang
Wolfgang Einhäuser
X Chen
Z Li
ZP Li
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/10/2017
Field of study

We present a foveated object detector (FOD) as a biologically-inspired alternative to the sliding window (SW) approach which is the dominant method of search in computer vision object detection. Similar to the human visual system, the FOD has higher resolution at the fovea and lower resolution at the visual periphery. Consequently, more computational resources are allocated at the fovea and relatively fewer at the periphery. The FOD processes the entire scene, uses retino-specific object detection classifiers to guide eye movements, aligns its fovea with regions of interest in the input image and integrates observations across multiple fixations. Our approach combines modern object detectors from computer vision with a recent model of peripheral pooling regions found at the V1 layer of the human visual system. We assessed various eye movement strategies on the PASCAL VOC 2007 dataset and show that the FOD performs on par with the SW detector while bringing significant computational cost savings.Comment: An extended version of this manuscript was published in PLOS Computational Biology (October 2017) at https://doi.org/10.1371/journal.pcbi.100574

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

The reentry hypothesis: The putative interaction of the frontal eye field, ventrolateral prefrontal cortex, and areas V4, IT for attention and eye movement

Author: Hamker Fred H.
Publication venue
Publication date: 01/01/2005
Field of study

Attention is known to play a key role in perception, including action selection, object recognition and memory. Despite findings revealing competitive interactions among cell populations, attention remains difficult to explain. The central purpose of this paper is to link up a large number of findings in a single computational approach. Our simulation results suggest that attention can be well explained on a network level involving many areas of the brain. We argue that attention is an emergent phenomenon that arises from reentry and competitive interactions. We hypothesize that guided visual search requires the usage of an object-specific template in prefrontal cortex to sensitize V4 and IT cells whose preferred stimuli match the target template. This induces a feature-specific bias and provides guidance for eye movements. Prior to an eye movement, a spatially organized reentry from occulomotor centers, specifically the movement cells of the frontal eye field, occurs and modulates the gain of V4 and IT cells. The processes involved are elucidated by quantitatively comparing the time course of simulated neural activity with experimental data. Using visual search tasks as an example, we provide clear and empirically testable predictions for the participation of IT, V4 and the frontal eye field in attention. Finally, we explain a possible physiological mechanism that can lead to non-flat search slopes as the result of a slow, parallel discrimination process

CiteSeerX

Caltech Authors

A computational approach to the covert and overt deployment of spatial attention

Author: Alexandre Frédéric
Fix Jérémy
Rougier Nicolas P.
Publication venue
Publication date: 26/09/2008
Field of study

Popular computational models of visual attention tend to neglect the influence of saccadic eye movements whereas it has been shown that the primates perform on average three of them per seconds and that the neural substrate for the deployment of attention and the execution of an eye movement might considerably overlap. Here we propose a computational model in which the deployment of attention with or without a subsequent eye movement emerges from local, distributed and numerical computations

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Probabilistic modeling of eye movement data during conjunction search via feature-based attention

Author: Koch Christof
Rutishauser Ueli
Publication venue: 'Association for Research in Vision and Ophthalmology (ARVO)'
Publication date: 01/04/2007
Field of study

Where the eyes fixate during search is not random; rather, gaze reflects the combination of information about the target and the visual input. It is not clear, however, what information about a target is used to bias the underlying neuronal responses. We here engage subjects in a variety of simple conjunction search tasks while tracking their eye movements. We derive a generative model that reproduces these eye movements and calculate the conditional probabilities that observers fixate, given the target, on or near an item in the display sharing a specific feature with the target. We use these probabilities to infer which features were biased by top-down attention: Color seems to be the dominant stimulus dimension for guiding search, followed by object size, and lastly orientation. We use the number of fixations it took to find the target as a measure of task difficulty. We find that only a model that biases multiple feature dimensions in a hierarchical manner can account for the data. Contrary to common assumptions, memory plays almost no role in search performance. Our model can be fit to average data of multiple subjects or to individual subjects. Small variations of a few key parameters account well for the intersubject differences. The model is compatible with neurophysiological findings of V4 and frontal eye fields (FEF) neurons and predicts the gain modulation of these cells

Caltech Authors

A computer vision model for visual-object-based attention and eye movements

Author: Backer
Bonmassar
Chambers
Craighero
Duncan
Fang Wang
Herman Martins Gomes
Hoffman
Horowitz
Itti
Juan
Kelley
LaBerge
Lee
McPeek
Pashler
Posner
Pylyshyn
Rensink
Rizzolatti
Robert Fisher
Scholl
Sela
Serences
Sun
Thompson
Tipper
Tsotsos
Walther
Wright
Yaoru Sun
Publication venue: 'Elsevier BV'
Publication date: 01/11/2008
Field of study

This is the post-print version of the final paper published in Computer Vision and Image Understanding. The published article is available from the link below. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. Copyright @ 2008 Elsevier B.V.This paper presents a new computational framework for modelling visual-object-based attention and attention-driven eye movements within an integrated system in a biologically inspired approach. Attention operates at multiple levels of visual selection by space, feature, object and group depending on the nature of targets and visual tasks. Attentional shifts and gaze shifts are constructed upon their common process circuits and control mechanisms but also separated from their different function roles, working together to fulfil flexible visual selection tasks in complicated visual environments. The framework integrates the important aspects of human visual attention and eye movements resulting in sophisticated performance in complicated natural scenes. The proposed approach aims at exploring a useful visual selection system for computer vision, especially for usage in cluttered natural visual environments.National Natural Science of Founda- tion of Chin

Brunel University Research Archive

Slowness and Sparseness Lead to Place, Head-Direction, and Spatial-View Cells

Author: Franzius Mathias
Sprekeler Henning
Wiskott Prof. Dr. Laurenz
Publication venue
Publication date: 01/08/2007
Field of study

We present a model for the self-organized formation of place cells, head-direction cells, and spatial-view cells in the hippocampal formation based on unsupervised learning on quasi-natural visual stimuli. The model comprises a hierarchy of Slow Feature Analysis (SFA) nodes, which were recently shown to reproduce many properties of complex cells in the early visual system. The system extracts a distributed grid-like representation of position and orientation, which is transcoded into a localized place-field, head-direction, or view representation, by sparse coding. The type of cells that develops depends solely on the relevant input statistics, i.e., the movement pattern of the simulated animal. The numerical simulations are complemented by a mathematical analysis that allows us to accurately predict the output of the top SFA laye

CogPrints Cognitive Sciences Eprint Archive

A Computational Model of Spatial Memory Anticipation during Visual Search

Author: Fix Jérémy
Rougier Nicolas
Vitay Julien
Publication venue
Publication date: 30/09/2006
Field of study

Some visual search tasks require to memorize the location of stimuli that have been previously scanned. Considerations about the eye movements raise the question of how we are able to maintain a coherent memory, despite the frequent drastically changes in the perception. In this article, we present a computational model that is able to anticipate the consequences of the eye movements on the visual perception in order to update a spatial memor

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Oskar Bordeaux