Search CORE

712 research outputs found

Lucid Data Dreaming for Video Object Segmentation

Author: Benenson Rodrigo
Brox Thomas
Ilg Eddy
Khoreva Anna
Schiele Bernt
Publication venue
Publication date: 01/01/2019
Field of study

Convolutional networks reach top quality in pixel-level video object segmentation but require a large amount of training data (1k~100k) to deliver such results. We propose a new training strategy which achieves state-of-the-art results across three evaluation datasets while using 20x~1000x less annotated data than competing methods. Our approach is suitable for both single and multiple object segmentation. Instead of using large training sets hoping to generalize across domains, we generate in-domain training data using the provided annotation on the first frame of each video to synthesize ("lucid dream") plausible future video frames. In-domain per-video training data allows us to train high quality appearance- and motion-based models, as well as tune the post-processing stage. This approach allows to reach competitive results even when training from only a single annotated frame, without ImageNet pre-training. Our results indicate that using a larger training set is not automatically better, and that for the video object segmentation task a smaller training set that is closer to the target domain is more effective. This changes the mindset regarding how many training samples and general "objectness" knowledge are required for the video object segmentation task.Comment: Accepted in International Journal of Computer Vision (IJCV

arXiv.org e-Print Archive

MPG.PuRe

Recommended from our members

Exploring Defocus Matting: Nonparametric Acceleration, Super-Resolution, and Off-Center Matting

Author: Avidan Shai
Freeman William T.
Joshi Neel
Matusik Wojciech
Pfister Hanspeter
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/05/2010
Field of study

Defocus matting is a fully automatic and passive method for pulling mattes from video captured with coaxial cameras that have different depths of field and planes of focus. Nonparametric sampling can accelerate the video-matting process from minutes to seconds per frame. In addition, a super-resolution technique efficiently bridges the gap between mattes from high-resolution video cameras and those from low-resolution cameras. Off-center matting pulls mattes for an external high-resolution camera that doesn't share the same center of projection as the low-resolution cameras used to capture the defocus matting data.Engineering and Applied Science

Harvard University - DASH

A Survey on Video-based Graphics and Video Visualization

Author: Xianghua Xie
Publication venue: EUROGRAPHICS
Publication date: 01/01/2011
Field of study

Cronfa at Swansea University

The World of Fast Moving Objects

Author: Kotera Jan
Matas Jiri
Novotny Lukas
Rozumnyi Denys
Sroubek Filip
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/11/2016
Field of study

The notion of a Fast Moving Object (FMO), i.e. an object that moves over a distance exceeding its size within the exposure time, is introduced. FMOs may, and typically do, rotate with high angular speed. FMOs are very common in sports videos, but are not rare elsewhere. In a single frame, such objects are often barely visible and appear as semi-transparent streaks. A method for the detection and tracking of FMOs is proposed. The method consists of three distinct algorithms, which form an efficient localization pipeline that operates successfully in a broad range of conditions. We show that it is possible to recover the appearance of the object and its axis of rotation, despite its blurred appearance. The proposed method is evaluated on a new annotated dataset. The results show that existing trackers are inadequate for the problem of FMO localization and a new approach is required. Two applications of localization, temporal super-resolution and highlighting, are presented

arXiv.org e-Print Archive

Crossref