Search CORE

11 research outputs found

Simple vs complex temporal recurrences for video saliency prediction

Author: Giró-i-Nieto Xavier
Linardos Panagiotis
McGuinness Kevin
Mohedano Eva
Nieto Juan Jose
O'Connor Noel E.
Publication venue: 'British Machine Vision Association and Society for Pattern Recognition'
Publication date: 01/01/2019
Field of study

This paper investigates modifying an existing neural network architecture for static saliency prediction using two types of recurrences that integrate information from the temporal domain. The first modification is the addition of a ConvLSTM within the architecture, while the second is a conceptually simple exponential moving average of an internal convolutional state. We use weights pre-trained on the SALICON dataset and fine-tune our model on DHF1K. Our results show that both modifications achieve state-of-the-art results and produce similar saliency maps. Source code is available at https://git.io/fjPiB

arXiv.org e-Print Archive

UPCommons. Portal del coneixement obert de la UPC

Irish Universities

DCU Online Research Access Service

Towards End-to-end Video-based Eye-Tracking

Author: E Chong
O Chapelle
PK Mital
Q Huang
S Hochreiter
S Park
T Fischer
Y Cheng
Y Sugano
Z Li
Publication venue
Publication date: 01/01/2020
Field of study

Estimating eye-gaze from images alone is a challenging task, in large parts due to un-observable person-specific factors. Achieving high accuracy typically requires labeled data from test users which may not be attainable in real applications. We observe that there exists a strong relationship between what users are looking at and the appearance of the user's eyes. In response to this understanding, we propose a novel dataset and accompanying method which aims to explicitly learn these semantic and temporal relationships. Our video dataset consists of time-synchronized screen recordings, user-facing camera views, and eye gaze data, which allows for new benchmarks in temporal gaze tracking as well as label-free refinement of gaze. Importantly, we demonstrate that the fusion of information from visual stimuli as well as eye images can lead towards achieving performance similar to literature-reported figures acquired through supervised personalization. Our final method yields significant performance improvements on our proposed EVE dataset, with up to a 28 percent improvement in Point-of-Gaze estimates (resulting in 2.49 degrees in angular error), paving the path towards high-accuracy screen-based eye tracking purely from webcam sensors. The dataset and reference source code are available at https://ait.ethz.ch/projects/2020/EVEComment: Accepted at ECCV 202

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref

Simple vs complex temporal recurrences for video saliency prediction

Author: Giró Nieto Xavier
Linardos Panagiotis
McGuinness Kevin
Mohedano Eva
Nieto Juan Jose
O'Connor Noel
Publication venue
Publication date: 01/01/2019
Field of study