Tennis video abstraction from audio and visual cues

F. Coldefy; G. Gravier; M. Betser; P. Bouthemy

Tennis video abstraction from audio and visual cues

Authors: F. Coldefy
G. Gravier
M. Betser
P. Bouthemy
Publication date
Publisher

Abstract

We propose a context-based model of video abstraction exploiting both audio and video features and applied to tennis TV programs. We can automatically produce different types of summary of a given video depending on the users' constraints or preferences. We have first designed an efficient and accurate temporal segmentation of the video into segments homogeneous w.r.t. the camera motion. We introduce original visual descriptors related to the dominant and residual image motions. The different summary types are obtained by specifying adapted classification criteria which involve audio features to select the relevant segments to be included in the video abstract. The proposed scheme has been validated on 22 hours of tennis videos

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.59.63...

Last time updated on 22/10/2014