Lossy Light Field Compression Using Modern Deep Learning and Domain Randomization Techniques

Svetozar Jeliazkov Valtchev

Lossy Light Field Compression Using Modern Deep Learning and Domain Randomization Techniques

Authors: Svetozar Jeliazkov Valtchev
Publication date: 7 December 2022
Publisher

Abstract

Lossy data compression is a particular type of informational encoding utilizing approximations in order to efficiently tradeoff accuracy in favour of smaller file sizes. The transmission and storage of images is a typical example of this in the modern digital world. However the reconstructed images often suffer from degradation and display observable visual artifacts. Convolutional Neural Networks have garnered much attention in all corners of Computer Vision, including the tasks of image compression and artifact reduction. We study how lossy compression can be extended to higher dimensional images with varying viewpoints, known as light fields. Domain Randomization is explored in detail, and used to generate the largest light field dataset we are aware of, to be used as training data. We formulate the task of compression under the frameworks of neural networks and calculate a quantization tensor for the 4-D Discrete Cosine Transform coefficients of the light fields. In order to accurately train the network, a high degree approximation to the rounding operation is introduced. In addition, we present a multi-resolution convolutional-based light field enhancer, producing average gains of 0.854 db in Peak Signal-to-Noise Ratio, and 0.0338 in Structural Similarity Index Measure over the base model, across a wide range of bitrates

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

YorkSpace

oai:yorkspace.library.yorku.ca...

Last time updated on 19/12/2022