Multi-Context Dual Hyper-Prior Neural Image Compression

Akyash, Mohammad; Kashiani, Hossein; Khoshkhahtinat, Atefeh; Mehta, Piyush M.; Nasrabadi, Nasser M.; Zafari, Ali

Multi-Context Dual Hyper-Prior Neural Image Compression

Authors: Mohammad Akyash
Hossein Kashiani
Atefeh Khoshkhahtinat
Piyush M. Mehta
Nasser M. Nasrabadi
Ali Zafari
Publication date: 19 September 2023
Publisher

Abstract

Transform and entropy models are the two core components in deep image compression neural networks. Most existing learning-based image compression methods utilize convolutional-based transform, which lacks the ability to model long-range dependencies, primarily due to the limited receptive field of the convolution operation. To address this limitation, we propose a Transformer-based nonlinear transform. This transform has the remarkable ability to efficiently capture both local and global information from the input image, leading to a more decorrelated latent representation. In addition, we introduce a novel entropy model that incorporates two different hyperpriors to model cross-channel and spatial dependencies of the latent representation. To further improve the entropy model, we add a global context that leverages distant relationships to predict the current latent more accurately. This global context employs a causal attention mechanism to extract long-range information in a content-dependent manner. Our experiments show that our proposed framework performs better than the state-of-the-art methods in terms of rate-distortion performance.Comment: Accepted to IEEE 22

^nd

International Conference on Machine Learning and Applications 2023 (ICMLA) - Selected for Oral Presentatio

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2309.10799

Last time updated on 10/10/2023