Search CORE

5 research outputs found

Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics

Author: Bao Linchao
He Shengfeng
Jiao Jianbo
Liu Wei
Liu Yun-hui
Wang Jiangliu
Publication venue
Publication date: 31/08/2020
Field of study

This paper proposes a novel pretext task to address the self-supervised video representation learning problem. Specifically, given an unlabeled video clip, we compute a series of spatio-temporal statistical summaries, such as the spatial location and dominant direction of the largest motion, the spatial location and dominant color of the largest color diversity along the temporal axis, etc. Then a neural network is built and trained to yield the statistical summaries given the video frames as inputs. In order to alleviate the learning difficulty, we employ several spatial partitioning patterns to encode rough spatial locations instead of exact spatial Cartesian coordinates. Our approach is inspired by the observation that human visual system is sensitive to rapidly changing contents in the visual field, and only needs impressions about rough spatial locations to understand the visual contents. To validate the effectiveness of the proposed approach, we conduct extensive experiments with four 3D backbone networks, i.e., C3D, 3D-ResNet, R(2+1)D and S3D-G. The results show that our approach outperforms the existing approaches across these backbone networks on four downstream video analysis tasks including action recognition, video retrieval, dynamic scene recognition, and action similarity labeling. The source code is publicly available at: https://github.com/laura-wang/video_repres_sts.Comment: Accepted by TPAMI. An extension of our previous work at arXiv:1904.0359

arXiv.org e-Print Archive

University of Birmingham Research Portal

Institutional Knowledge at Singapore Management University

Oxford University Research Archive

Self-supervised learning of pose embeddings from spatiotemporal relations in videos

Author: Dencker Tobias
Ommer Björn
Sümer Ömer
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/01/2023
Field of study

OPUS Augsburg