Search CORE

6,203 research outputs found

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Author: Efros Alexei A.
Isola Phillip
Shechtman Eli
Wang Oliver
Zhang Richard
Publication venue
Publication date: 10/04/2018
Field of study

While it is nearly effortless for humans to quickly assess the perceptual similarity between two images, the underlying processes are thought to be quite complex. Despite this, the most widely used perceptual metrics today, such as PSNR and SSIM, are simple, shallow functions, and fail to account for many nuances of human perception. Recently, the deep learning community has found that features of the VGG network trained on ImageNet classification has been remarkably useful as a training loss for image synthesis. But how perceptual are these so-called "perceptual losses"? What elements are critical for their success? To answer these questions, we introduce a new dataset of human perceptual similarity judgments. We systematically evaluate deep features across different architectures and tasks and compare them with classic metrics. We find that deep features outperform all previous metrics by large margins on our dataset. More surprisingly, this result is not restricted to ImageNet-trained VGG features, but holds across different deep architectures and levels of supervision (supervised, self-supervised, or even unsupervised). Our results suggest that perceptual similarity is an emergent property shared across deep visual representations.Comment: Accepted to CVPR 2018; Code and data available at https://www.github.com/richzhang/PerceptualSimilarit

arXiv.org e-Print Archive

Crossref

A Reduced Reference Image Quality Measure Using Bessel K Forms Model for Tetrolet Coefficients

Author: Abdelouahad Abdelkaher Ait
Aboutajdine Driss
Cherifi Hocine
Hassouni Mohammed El
Publication venue
Publication date: 01/11/2011
Field of study

In this paper, we introduce a Reduced Reference Image Quality Assessment (RRIQA) measure based on the natural image statistic approach. A new adaptive transform called "Tetrolet" is applied to both reference and distorted images. To model the marginal distribution of tetrolet coefficients Bessel K Forms (BKF) density is proposed. Estimating the parameters of this distribution allows to summarize the reference image with a small amount of side information. Five distortion measures based on the BKF parameters of the original and processed image are used to predict quality scores. A comparison between these measures is presented showing a good consistency with human judgment

arXiv.org e-Print Archive

HAL-uB