Summarizing Videos with Attention

A Graves; D Potapov; K Zhang; L dos Santos Belo; M Fei; M Gygli; Mayu Otani; O Russakovsky; RJ Williams; S Hochreiter; SEF De Avila; V Argyriou; Y Yuan

slides

Summarizing Videos with Attention

Authors: A Graves
D Potapov
K Zhang
L dos Santos Belo
M Fei
M Gygli
Mayu Otani
O Russakovsky
RJ Williams
S Hochreiter
SEF De Avila
V Argyriou
Y Yuan
Publication date: 21 February 2019
Publisher
Doi

Abstract

In this work we propose a novel method for supervised, keyshots based video summarization by applying a conceptually simple and computationally efficient soft, self-attention mechanism. Current state of the art methods leverage bi-directional recurrent networks such as BiLSTM combined with attention. These networks are complex to implement and computationally demanding compared to fully connected networks. To that end we propose a simple, self-attention based network for video summarization which performs the entire sequence to sequence transformation in a single feed forward pass and single backward pass during training. Our method sets a new state of the art results on two benchmarks TvSum and SumMe, commonly used in this domain.Comment: Presented at ACCV2018 AIU2018 worksho

Similar works

Full text

Available Versions

Crossref

Last time updated on 10/08/2021

Durham Research Online

oai:durham-repository.worktrib...

Last time updated on 14/08/2023