How to Train Your Dragon: Tamed Warping Network for Semantic Video
  Segmentation

Chen, Yifeng; Cui, Jiabao; Feng, Junyi; Huang, Fuxian; Li, Songyuan; Li, Xi

research

How to Train Your Dragon: Tamed Warping Network for Semantic Video Segmentation

Authors: Yifeng Chen
Jiabao Cui
Junyi Feng
Fuxian Huang
Songyuan Li
Xi Li
Publication date: 20 July 2020
Publisher

Abstract

Real-time semantic segmentation on high-resolution videos is challenging due to the strict requirements of speed. Recent approaches have utilized the inter-frame continuity to reduce redundant computation by warping the feature maps across adjacent frames, greatly speeding up the inference phase. However, their accuracy drops significantly owing to the imprecise motion estimation and error accumulation. In this paper, we propose to introduce a simple and effective correction stage right after the warping stage to form a framework named Tamed Warping Network (TWNet), aiming to improve the accuracy and robustness of warping-based models. The experimental results on the Cityscapes dataset show that with the correction, the accuracy (mIoU) significantly increases from 67.3% to 71.6%, and the speed edges down from 65.5 FPS to 61.8 FPS. For non-rigid categories such as "human" and "object", the improvements of IoU are even higher than 18 percentage points

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2005.01344

Last time updated on 11/05/2020