Search CORE

8 research outputs found

Geometry-based spherical JND modeling for 360 $^\circ$ display

Author: Chen Bo
Chen Weiling
Lin Liqun
Liu Jiaqi
Wei Hongan
Zhao Tiesong
Publication venue
Publication date: 27/03/2023
Field of study

360

^\circ

videos have received widespread attention due to its realistic and immersive experiences for users. To date, how to accurately model the user perceptions on 360

^\circ

display is still a challenging issue. In this paper, we exploit the visual characteristics of 360

^\circ

projection and display and extend the popular just noticeable difference (JND) model to spherical JND (SJND). First, we propose a quantitative 2D-JND model by jointly considering spatial contrast sensitivity, luminance adaptation and texture masking effect. In particular, our model introduces an entropy-based region classification and utilizes different parameters for different types of regions for better modeling performance. Second, we extend our 2D-JND model to SJND by jointly exploiting latitude projection and field of view during 360

^\circ

display. With this operation, SJND reflects both the characteristics of human vision system and the 360

^\circ

display. Third, our SJND model is more consistent with user perceptions during subjective test and also shows more tolerance in distortions with fewer bit rates during 360

^\circ

video compression. To further examine the effectiveness of our SJND model, we embed it in Versatile Video Coding (VVC) compression. Compared with the state-of-the-arts, our SJND-VVC framework significantly reduced the bit rate with negligible loss in visual quality

arXiv.org e-Print Archive

A Novel Macroblock Level Rate Control Method for Stereo Video Coding

Author: Fen Chen
Feng Shao
Gangyi Jiang
Gaofeng Zhu
Mei Yu
Yo-Sung Ho
Zongju Peng
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2014
Field of study

To compress stereo video effectively, this paper proposes a novel macroblock (MB) level rate control method based on binocular perception. A binocular just-notification difference (BJND) model based on the parallax matching is first used to describe binocular perception. Then, the proposed rate control method is performed in stereo video coding with four levels, namely, view level, group-of-pictures (GOP) level, frame level, and MB level. In the view level, different proportions of bitrates are allocated for the left and right views of stereo video according to the prestatistical rate allocation proportion. In the GOP level, the total number of bitrates allocated to each GOP is computed and the initial quantization parameter of each GOP is set. In the frame level, the target bits allocated to each frame are computed. In the MB level, visual perception factor, which is measured by the BJND value of MB, is used to adjust the MB level bit allocation, so that the rate control results in line with the human visual characteristics. Experimental results show that the proposed method can control the bitrate more accurately and get better subjective quality of stereo video, compared with other methods

Crossref

Directory of Open Access Journals

PubMed Central

Visual Saliency in Video Compression and Transmission

Author: Hadizadeh Hadi
Publication venue
Publication date: 18/04/2013
Field of study

This dissertation explores the concept of visual saliency—a measure of propensity for drawing visual attention—and presents various novel methods for utilization of visual saliencyin video compression and transmission. Specifically, a computationally-efficient method for visual saliency estimation in digital images and videos is developed, which approximates one of the most well-known visual saliency models. In the context of video compression, a saliency-aware video coding method is proposed within a region-of-interest (ROI) video coding paradigm. The proposed video coding method attempts to reduce attention-grabbing coding artifacts and keep viewers’ attention in areas where the quality is highest. The method allows visual saliency to increase in high quality parts of the frame, and allows saliency to reduce in non-ROI parts. Using this approach, the proposed method is able to achieve the same subjective quality as competing state-of-the-art methods at a lower bit rate. In the context of video transmission, a novel saliency-cognizant error concealment method is presented for ROI-based video streaming in which regions with higher visual saliency are protected more heavily than low saliency regions. In the proposed error concealment method, a low-saliency prior is added to the error concealment process as a regularization term, which serves two purposes. First, it provides additional side information for the decoder to identify the correct replacement blocks for concealment. Second, in the event that a perfectly matched block cannot be unambiguously identified, the low-saliency prior reduces viewers’ visual attention on the loss-stricken regions, resulting in higher overall subjective quality. During the course of this research, an eye-tracking dataset for several standard video sequences was created and made publicly available. This dataset can be utilized to test saliency models for video and evaluate various perceptually-motivated algorithms for video processing and video quality assessment

Simon Fraser University Institutional Repository

No-reference image and video quality assessment: a classification and review of recent approaches

Author: A Amer
A Amer
A Chetouani
A Chetouani
A Ciancio
A Ciancio
A Eden
A Ichigaya
A Ichigaya
A Khan
A Khan
A Khan
A Maalouf
A Maalouf
A Mittal
A Mittal
A Raake
A Rossholm
A Rossholm
A Takahashi
AB Watson
AC Bovik
AG Davis
AK Moorthy
AM Treisman
AN Rimell
Andreas Rossholm
AR Reibman
AR Reibman
B Belmudez
B Lee
B-X Zuo
B-X Zuo
Benny Lövström
C Chen
C Chen
C Keimel
C Keimel
C Keimel
C Li
C Oprea
C-S Park
Cisco Visual Networking Index
D Bhattacharjee
D Ćulibrk
DL Ruderman
DM Chandler
E Cohen
F Battisti
F Yang
F Yang
F Yang
G Valenzise
G Valenzise
G Van Wallendael
G Yammine
G Zhai
H Boujut
H Liu
H Liu
H Liu
H Liu
H Liu
H Tong
Hans-Jürgen Zepernick
HR Sheikh
HR Sheikh
HR Wu
I Park
I Sedano
ITU
ITU
ITU-T
J Han
J Joskowicz
J Park
J Shen
J Tian
J You
J You
J Zhang
J Zhang
J Zhang
J Zhou
JE Caviedes
K Nishikawa
K Nishikawa
K Rank
K Watanabe
K Watanabe
K Yamagishi
K Zhu
K-C Yang
KD Singh
L Debing
L Liang
M Barkowsky
M Chin
M Ghazal
M Naccari
M Naccari
M Narwaria
M Ries
M Ries
M Ries
M Shahid
M Shahid
M Slanina
M Vranješ
M-J Chen
M-J Chen
M-N Garcia
MA Saad
MA Saad
MA Saad
MCQ Farias
MG Choi
MN Do
Muhammad Shahid
N Narvekar
N Narvekar
N Ponomarenko
N Staelens
N Staelens
ND Narvekar
NG Sadaka
O Sugimoto
OYG Castillo
P Gastaldo
P Kortum
P Marziliano
P Marziliano
P Romaniak
PL Callet
Q Huynh-Thu
Q Huynh-Thu
R Ferzli
R Ferzli
R Ferzli
R Ferzli
R Ferzli
R Hassen
R Soundararajan
RR Pastrana-Vidal
RR Pastrana-Vidal
RV Babu
RV Babu
S Argyropoulos
S Borer
S Chikkerur
S Gabarda
S Ouni
S Pyatykh
S Suresh
S Suthaharan
S Varadarajan
S Winkler
S Winkler
S Wolf
S Wu
S Wu
S Yao
S Zhao
S-O Lee
S-Y Shim
SI Olsen
SS Hemami
T Brandão
T Brandão
T Brandão
T Brandão
T Brandão
T Oelbaum
T Shanableh
T Shanableh
T Yamada
T Yamada
T Yamada
U Engelke
U Engelke
U Engelke
VQEG
VQEG
W Lin
W Lu
X Jiang
X Liu
X Liu
X Liu
X Marichal
X Zhu
X Zhu
X-H Wang
Z Hua
Z Hua
Z Wang
Z Wang
Z Wang
Z Zhang
ZMP Sazzad
ZMP Sazzad
ZMP Sazzad
ZMP Sazzad
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Image Quality Evaluation in Lossy Compressed Images

Author: Allen E.
Allen E.
Publication venue
Publication date: 01/01/2017
Field of study

This research focuses on the quantification of image quality in lossy compressed images, exploring the impact of digital artefacts and scene characteristics upon image quality evaluation. A subjective paired comparison test was implemented to assess perceived quality of JPEG 2000 against baseline JPEG over a range of different scene types. Interval scales were generated for both algorithms, which indicated a subjective preference for JPEG 2000, particularly at low bit rates, and these were confirmed by an objective distortion measure. The subjective results did not follow this trend for some scenes however, and both algorithms were found to be scene dependent as a result of the artefacts produced at high compression rates. The scene dependencies were explored from the interval scale results, which allowed scenes to be grouped according to their susceptibilities to each of the algorithms. Groupings were correlated with scene measures applied in a linked study. A pilot study was undertaken to explore perceptibility thresholds of JPEG 2000 of the same set of images. This work was developed with a further experiment to investigate the thresholds of perceptibility and acceptability of higher resolution JPEG 2000 compressed images. A set of images was captured using a professional level full-frame Digital Single Lens Reflex camera, using a raw workflow and carefully controlled image-processing pipeline. The scenes were quantified using a set of simple scene metrics to classify them according to whether they were average, higher than, or lower than average, for a number of scene properties known to affect image compression and perceived image quality; these were used to make a final selection of test images. Image fidelity was investigated using the method of constant stimuli to quantify perceptibility thresholds and just noticeable differences (JNDs) of perceptibility. Thresholds and JNDs of acceptability were also quantified to explore suprathreshold quality evaluation. The relationships between the two thresholds were examined and correlated with the results from the scene measures, to identify more or less susceptible scenes. It was found that the level and differences between the two thresholds was an indicator of scene dependency and could be predicted by certain types of scene characteristics. A third study implemented the soft copy quality ruler as an alternative psychophysical method, by matching the quality of compressed images to a set of images varying in a single attribute, separated by known JND increments of quality. The imaging chain and image processing workflow were evaluated using objective measures of tone reproduction and spatial frequency response. An alternative approach to the creation of ruler images was implemented and tested, and the resulting quality rulers were used to evaluate a subset of the images from the previous study. The quality ruler was found to be successful in identifying scene susceptibilities and observer sensitivity. The fourth investigation explored the implementation of four different image quality metrics. These were the Modular Image Difference Metric, the Structural Similarity Metric, The Multi-scale Structural Similarity Metric and the Weighted Structural Similarity Metric. The metrics were tested against the subjective results and all were found to have linear correlation in terms of predictability of image quality

WestminsterResearch

SSIM-Inspired Quality Assessment, Compression, and Processing for Visual Communications

Author: Rehman Abdul
Publication venue: 'University of Waterloo'
Publication date: 01/01/2013
Field of study

Objective Image and Video Quality Assessment (I/VQA) measures predict image/video quality as perceived by human beings - the ultimate consumers of visual data. Existing research in the area is mainly limited to benchmarking and monitoring of visual data. The use of I/VQA measures in the design and optimization of image/video processing algorithms and systems is more desirable, challenging and fruitful but has not been well explored. Among the recently proposed objective I/VQA approaches, the structural similarity (SSIM) index and its variants have emerged as promising measures that show superior performance as compared to the widely used mean squared error (MSE) and are computationally simple compared with other state-of-the-art perceptual quality measures. In addition, SSIM has a number of desirable mathematical properties for optimization tasks. The goal of this research is to break the tradition of using MSE as the optimization criterion for image and video processing algorithms. We tackle several important problems in visual communication applications by exploiting SSIM-inspired design and optimization to achieve significantly better performance. Firstly, the original SSIM is a Full-Reference IQA (FR-IQA) measure that requires access to the original reference image, making it impractical in many visual communication applications. We propose a general purpose Reduced-Reference IQA (RR-IQA) method that can estimate SSIM with high accuracy with the help of a small number of RR features extracted from the original image. Furthermore, we introduce and demonstrate the novel idea of partially repairing an image using RR features. Secondly, image processing algorithms such as image de-noising and image super-resolution are required at various stages of visual communication systems, starting from image acquisition to image display at the receiver. We incorporate SSIM into the framework of sparse signal representation and non-local means methods and demonstrate improved performance in image de-noising and super-resolution. Thirdly, we incorporate SSIM into the framework of perceptual video compression. We propose an SSIM-based rate-distortion optimization scheme and an SSIM-inspired divisive optimization method that transforms the DCT domain frame residuals to a perceptually uniform space. Both approaches demonstrate the potential to largely improve the rate-distortion performance of state-of-the-art video codecs. Finally, in real-world visual communications, it is a common experience that end-users receive video with significantly time-varying quality due to the variations in video content/complexity, codec configuration, and network conditions. How human visual quality of experience (QoE) changes with such time-varying video quality is not yet well-understood. We propose a quality adaptation model that is asymmetrically tuned to increasing and decreasing quality. The model improves upon the direct SSIM approach in predicting subjective perceptual experience of time-varying video quality

University of Waterloo's Institutional Repository

Activity in area V3A predicts positions of moving objects

Author: Maus Gerrit
Muckli Lars
Nijhawan Romi
Weigelt Sarah
Publication venue: 'Pion Ltd'
Publication date: 01/01/2008
Field of study

No description supplie

Sussex Research Online

Engineering Data Compendium. Human Perception and Performance, Volume 1

Author: Boff Kenneth R.
Lincoln Janet E.
Publication venue
Publication date: 01/01/1988
Field of study

The concept underlying the Engineering Data Compendium was the product an R and D program (Integrated Perceptual Information for Designers project) aimed at facilitating the application of basic research findings in human performance to the design of military crew systems. The principal objective was to develop a workable strategy for: (1) identifying and distilling information of potential value to system design from existing research literature, and (2) presenting this technical information in a way that would aid its accessibility, interpretability, and applicability by system designers. The present four volumes of the Engineering Data Compendium represent the first implementation of this strategy. This is Volume 1, which contains sections on Visual Acquisition of Information, Auditory Acquisition of Information, and Acquisition of Information by Other Senses

NASA Technical Reports Server