Search CORE

777 research outputs found

Temporal unpredictability detection of real-time video sequence

Author: Liu Yang
Liu Yang
Publication venue
Publication date: 01/01/2008
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

Humanoid visual attention and gaze control

Author: van de Weem J.M.W
Publication venue
Publication date: 01/01/2011
Field of study

Deep learning investigation for chess player attention prediction using eye-tracking and game data

Author: Crowley James
Guntz Thomas
Louedec Justin Le
Vaufreydaz Dominique
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 17/04/2019
Field of study

This article reports on an investigation of the use of convolutional neural networks to predict the visual attention of chess players. The visual attention model described in this article has been created to generate saliency maps that capture hierarchical and spatial features of chessboard, in order to predict the probability fixation for individual pixels Using a skip-layer architecture of an autoencoder, with a unified decoder, we are able to use multiscale features to predict saliency of part of the board at different scales, showing multiple relations between pieces. We have used scan path and fixation data from players engaged in solving chess problems, to compute 6600 saliency maps associated to the corresponding chess piece configurations. This corpus is completed with synthetically generated data from actual games gathered from an online chess platform. Experiments realized using both scan-paths from chess players and the CAT2000 saliency dataset of natural images, highlights several results. Deep features, pretrained on natural images, were found to be helpful in training visual attention prediction for chess. The proposed neural network architecture is able to generate meaningful saliency maps on unseen chess configurations with good scores on standard metrics. This work provides a baseline for future work on visual attention prediction in similar contexts

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Towards High-Speed Vision for Attention and Navigation of Autonomous City Explorer (ACE)

Author: Kolja K&#252
Martin Buss
Tianguang Zhang
Tingting Xu
Publication venue: 'IntechOpen'
Publication date: 01/11/2008
Field of study

Local Energy Variability as a Generic Measure of Bottom-Up Salience

Author: Anton Garcia-Diaz
Raquel Dosil
Xose M. Pardo
Xose R. Fdez-Vidal
Publication venue: 'IntechOpen'
Publication date: 01/11/2008
Field of study

Predicting human eye fixations via an LSTM-Based saliency attentive model

Author: Baraldi L.
Cornia M.
Cucchiara R.
Serra G.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Data-driven saliency has recently gained a lot of attention thanks to the use of convolutional neural networks for predicting gaze fixations. In this paper, we go beyond standard approaches to saliency prediction, in which gaze maps are computed with a feed-forward network, and present a novel model which can predict accurate saliency maps by incorporating neural attentive mechanisms. The core of our solution is a convolutional long short-term memory that focuses on the most salient regions of the input image to iteratively refine the predicted saliency map. In addition, to tackle the center bias typical of human eye fixations, our model can learn a set of prior maps generated with Gaussian functions. We show, through an extensive evaluation, that the proposed architecture outperforms the current state-of-the-art on public saliency prediction datasets. We further study the contribution of each key component to demonstrate their robustness on different scenarios

arXiv.org e-Print Archive

Thermo-visual feature fusion for object tracking using multiple spatiogram trackers

Author: Alan Smeaton
C. Yang
Ciarán Ó Conaire
D. Comaniciu
G. Fumera
M. Spengler
Noel E. O’Connor
P. Pérez
R.E. Bellman
R.T. Collins
V. Comaniciu
W. Abd-Almageed
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/03/2007
Field of study

In this paper, we propose a framework that can efficiently combine features for robust tracking based on fusing the outputs of multiple spatiogram trackers. This is achieved without the exponential increase in storage and processing that other multimodal tracking approaches suffer from. The framework allows the features to be split arbitrarily between the trackers, as well as providing the flexibility to add, remove or dynamically weight features. We derive a mean-shift type algorithm for the framework that allows efficient object tracking with very low computational overhead. We especially target the fusion of thermal infrared and visible spectrum features as the most useful features for automated surveillance applications. Results are shown on multimodal video sequences clearly illustrating the benefits of combining multiple features using our framework