Search CORE

604 research outputs found

Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

Author: Bosse Sebastian
Maniry Dominique
Müller Klaus-Robert
Samek Wojciech
Wiegand Thomas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 07/12/2017
Field of study

We present a deep neural network-based approach to image quality assessment (IQA). The network is trained end-to-end and comprises ten convolutional layers and five pooling layers for feature extraction, and two fully connected layers for regression, which makes it significantly deeper than related IQA models. Unique features of the proposed architecture are that: 1) with slight adaptations it can be used in a no-reference (NR) as well as in a full-reference (FR) IQA setting and 2) it allows for joint learning of local quality and local weights, i.e., relative importance of local quality to the global quality estimate, in an unified framework. Our approach is purely data-driven and does not rely on hand-crafted features or other types of prior domain knowledge about the human visual system or image statistics. We evaluate the proposed approach on the LIVE, CISQ, and TID2013 databases as well as the LIVE In the wild image quality challenge database and show superior performance to state-of-the-art NR and FR IQA methods. Finally, cross-database evaluation shows a high ability to generalize between different databases, indicating a high robustness of the learned features

arXiv.org e-Print Archive

Fraunhofer-ePrints

MPG.PuRe

Semantic Perceptual Image Compression using Deep Convolution Networks

Author: DiLillo Antonella
Garber Solomon
Moran Nick
Prakash Aaditya
Storer James
Publication venue
Publication date: 29/03/2017
Field of study

It has long been considered a significant problem to improve the visual quality of lossy image and video compression. Recent advances in computing power together with the availability of large training data sets has increased interest in the application of deep learning cnns to address image recognition and image processing tasks. Here, we present a powerful cnn tailored to the specific task of semantic image understanding to achieve higher visual quality in lossy compression. A modest increase in complexity is incorporated to the encoder which allows a standard, off-the-shelf jpeg decoder to be used. While jpeg encoding may be optimized for generic images, the process is ultimately unaware of the specific content of the image to be compressed. Our technique makes jpeg content-aware by designing and training a model to identify multiple semantic regions in a given image. Unlike object detection techniques, our model does not require labeling of object positions and is able to identify objects in a single pass. We present a new cnn architecture directed specifically to image compression, which generates a map that highlights semantically-salient regions so that they can be encoded at higher quality as compared to background regions. By adding a complete set of features for every class, and then taking a threshold over the sum of all feature activations, we generate a map that highlights semantically-salient regions so that they can be encoded at a better quality compared to background regions. Experiments are presented on the Kodak PhotoCD dataset and the MIT Saliency Benchmark dataset, in which our algorithm achieves higher visual quality for the same compressed size.Comment: Accepted to Data Compression Conference, 11 pages, 5 figure

arXiv.org e-Print Archive

Crossref

Reduced-Reference Video Quality Metric Using Spatial Information in Salient Regions

Author: Abdul Rahman Farah Diyana
Agrafiotis Dimitris
Khalifa Othman O.
Zhang Fan
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/06/2018
Field of study

In multimedia transmission, it is important to rely on an objective quality metric which accurately represents the subjective quality of processed images and video sequences. Maintaining acceptable Quality of Experience in video transmission requires the ability to measure the quality of the video seen at the receiver end. Reduced-reference metrics make use of side-information that is transmitted to the receiver for estimating the quality of the received sequence with low complexity. This attribute enables real-time assessment and visual degradation detection caused by transmission and compression errors. A novel reduced-reference video quality known as the Spatial Information in Salient Regions Reduced Reference Metric is proposed. The approach proposed makes use of spatial activity to estimate the received sequence distortion after concealment. The statistical elements analysed in this work are based on extracted edges and their luminance distributions. Results highlight that the proposed edge dissimilarity measure has a good correlation with DMOS scores from the LIVE Video Database

Journal of Education and Learning (EduLearn)

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System