Research on Deep Learning-based Fractional Interpolation in Video Coding

Pham Do Kim Chi

Research on Deep Learning-based Fractional Interpolation in Video Coding

Authors: Pham Do Kim Chi
Publication date: 24 March 2020
Publisher: 法政大学大学院理工学研究科

Abstract

Motion compensated prediction is one of the essential methods to reduce temporal redundancy in inter coding. The target of motion compensated prediction is to predict the current frame from the list of reference frames. Recent video coding standards commonly use interpolation filters to obtain sub-pixel for the best matching block located in the fractional position of the reference frame. However, the fixed filters are not flexible to adapt to the variety of natural video contents. Inspired by the success of CNN in super-resolution, we propose Convolutional Neural Network-based fractional interpolation for Luminance (Luma) and Chrominance (Chroma) components in motion compensated prediction to improve the coding efficiency. Moreover, two syntax elements indicate interpolation methods for the Luminance and Chrominance components, have been added to bin-string and encoded by CABAC using regular mode. As a result, our proposal gains 2.9%, 0.3%, 0.6% Y, U, V BD-rate reduction, respectively, under low delay P configuration.

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Institutional Repositories DataBase (IRDB)

oai:irdb.nii.ac.jp:01357:00046...

Last time updated on 03/12/2020

Hosei University Repository

oai:hosei.repo.nii.ac.jp:00022...

Last time updated on 11/07/2020