49 research outputs found
Quality comparison of the HEVC and VP9 encoders performance
This paper reports a comparison between two recent video codecs, namely the HEVC and the VP9, using High Definition Video Sequences encoded with different bit rates. A subjective test for the evaluation of the provided Quality of Experience is reported. The video sequences were shown to a panel of subjects on a High Definition LED display and the subjective tests were performed using a Single Stimulus Methodology. The results shown that the HEVC encoder provides a better visual quality on low bit rates than the VP9. Similar performance was obtained for visually lossless conditions, although the HEVC requires lower bit rates to reach that level. Moreover, the correlation of the subjective evaluation and three tested objective metrics (PSNR, SSIM, and FSIM) revealed a good representation of the subjective results, particularly the SSIM and the FSIM metrics.info:eu-repo/semantics/publishedVersio
User generated HDR gaming video streaming : dataset, codec comparison and challenges
Gaming video streaming services have grown tremendously in the past few
years, with higher resolutions, higher frame rates and HDR gaming videos
getting increasingly adopted among the gaming community. Since gaming content
as such is different from non-gaming content, it is imperative to evaluate the
performance of the existing encoders to help understand the bandwidth
requirements of such services, as well as further improve the compression
efficiency of such encoders. Towards this end, we present in this paper
GamingHDRVideoSET, a dataset consisting of eighteen 10-bit UHD-HDR gaming
videos and encoded video sequences using four different codecs, together with
their objective evaluation results. The dataset is available online at [to be
added after paper acceptance]. Additionally, the paper discusses the codec
compression efficiency of most widely used practical encoders, i.e., x264
(H.264/AVC), x265 (H.265/HEVC) and libvpx (VP9), as well the recently proposed
encoder libaom (AV1), on 10-bit, UHD-HDR content gaming content. Our results
show that the latest compression standard AV1 results in the best compression
efficiency, followed by HEVC, H.264, and VP9.Comment: 14 pages, 8 figures, submitted to IEEE journa
Video Compression and Optimization Technologies - Review
The use of video streaming is constantly increasing. High-resolution video requires resources on both the sender and the receiver side. There are many compression techniques that can be utilized to compress the video and simultaneously maintain quality. The main goal of this paper is to provide an overview of video streaming and QoE. This paper describes the basic concepts and discusses existing methodologies to measure QoE. Subjective, objective, and video compression technologies are discussed. This review paper gathers the codec implementation developed by MPEG, Google, and Apple. This paper outlines the challenges and future research directions that should be considered in the measurement and assessment of quality of experience for video services
Quality of Experience in Immersive Video Technologies
Over the last decades, several technological revolutions have impacted the television industry, such as the shifts from black & white to color and from standard to high-definition. Nevertheless, further considerable improvements can still be achieved to provide a better multimedia experience, for example with ultra-high-definition, high dynamic range & wide color gamut, or 3D. These so-called immersive technologies aim at providing better, more realistic, and emotionally stronger experiences. To measure quality of experience (QoE), subjective evaluation is the ultimate means since it relies on a pool of human subjects. However, reliable and meaningful results can only be obtained if experiments are properly designed and conducted following a strict methodology. In this thesis, we build a rigorous framework for subjective evaluation of new types of image and video content. We propose different procedures and analysis tools for measuring QoE in immersive technologies. As immersive technologies capture more information than conventional technologies, they have the ability to provide more details, enhanced depth perception, as well as better color, contrast, and brightness. To measure the impact of immersive technologies on the viewersâ QoE, we apply the proposed framework for designing experiments and analyzing collected subjectsâ ratings. We also analyze eye movements to study human visual attention during immersive content playback. Since immersive content carries more information than conventional content, efficient compression algorithms are needed for storage and transmission using existing infrastructures. To determine the required bandwidth for high-quality transmission of immersive content, we use the proposed framework to conduct meticulous evaluations of recent image and video codecs in the context of immersive technologies. Subjective evaluation is time consuming, expensive, and is not always feasible. Consequently, researchers have developed objective metrics to automatically predict quality. To measure the performance of objective metrics in assessing immersive content quality, we perform several in-depth benchmarks of state-of-the-art and commonly used objective metrics. For this aim, we use ground truth quality scores, which are collected under our subjective evaluation framework. To improve QoE, we propose different systems for stereoscopic and autostereoscopic 3D displays in particular. The proposed systems can help reducing the artifacts generated at the visualization stage, which impact picture quality, depth quality, and visual comfort. To demonstrate the effectiveness of these systems, we use the proposed framework to measure viewersâ preference between these systems and standard 2D & 3D modes. In summary, this thesis tackles the problems of measuring, predicting, and improving QoE in immersive technologies. To address these problems, we build a rigorous framework and we apply it through several in-depth investigations. We put essential concepts of multimedia QoE under this framework. These concepts not only are of fundamental nature, but also have shown their impact in very practical applications. In particular, the JPEG, MPEG, and VCEG standardization bodies have adopted these concepts to select technologies that were proposed for standardization and to validate the resulting standards in terms of compression efficiency
Comparison of HDR quality metrics in Per-Clip Lagrangian multiplier optimisation with AV1
The complexity of modern codecs along with the increased need of delivering
high-quality videos at low bitrates has reinforced the idea of a per-clip
tailoring of parameters for optimised rate-distortion performance. While the
objective quality metrics used for Standard Dynamic Range (SDR) videos have
been well studied, the transitioning of consumer displays to support High
Dynamic Range (HDR) videos, poses a new challenge to rate-distortion
optimisation. In this paper, we review the popular HDR metrics DeltaE100
(DE100), PSNRL100, wPSNR, and HDR-VQM. We measure the impact of employing these
metrics in per-clip direct search optimisation of the rate-distortion Lagrange
multiplier in AV1. We report, on 35 HDR videos, average Bjontegaard Delta Rate
(BD-Rate) gains of 4.675%, 2.226%, and 7.253% in terms of DE100, PSNRL100, and
HDR-VQM. We also show that the inclusion of chroma in the quality metrics has a
significant impact on optimisation, which can only be partially addressed by
the use of chroma offsets.Comment: Accepted version for ICME 2023 Special Session, "Optimised Media
Delivery
Deep Video Precoding
Several groups worldwide are currently investigating how deep learning may advance the state-of-the-art in image and video coding. An open question is how to make deep neural networks work in conjunction with existing (and upcoming) video codecs, such as MPEG H.264/AVC, H.265/HEVC, VVC, Google VP9 and AOMedia AV1, AV2, as well as existing container and transport formats, without imposing any changes at the client side. Such compatibility is a crucial aspect when it comes to practical deployment, especially when considering the fact that the video content industry and hardware manufacturers are expected to remain committed to supporting these standards for the foreseeable future. We propose to use deep neural networks as precoders for current and future video codecs and adaptive video streaming systems. In our current design, the core precoding component comprises a cascaded structure of downscaling neural networks that operates during video encoding, prior to transmission. This is coupled with a precoding mode selection algorithm for each independently-decodable stream segment, which adjusts the downscaling factor according to scene characteristics, the utilized encoder, and the desired bitrate and encoding configuration. Our framework is compatible with all current and future codec and transport standards, as our deep precoding network structure is trained in conjunction with linear upscaling filters (e.g., the bilinear filter), which are supported by all web video players. Extensive evaluation on FHD (1080p) and UHD (2160p) content and with widely-used H.264/AVC, H.265/HEVC and VP9 encoders, as well as a preliminary evaluation with the current test model of VVC (v.6.2rc1), shows that coupling such standards with the proposed deep video precoding allows for 8% to 52% rate reduction under encoding configurations and bitrates suitable for video-on-demand adaptive streaming systems. The use of precoding can also lead to encoding complexity reduction, which is essential for cost-effective cloud deployment of complex encoders like H.265/HEVC, VP9 and VVC, especially when considering the prominence of high-resolution adaptive video streaming