107 research outputs found

    Perceptually-Driven Video Coding with the Daala Video Codec

    Full text link
    The Daala project is a royalty-free video codec that attempts to compete with the best patent-encumbered codecs. Part of our strategy is to replace core tools of traditional video codecs with alternative approaches, many of them designed to take perceptual aspects into account, rather than optimizing for simple metrics like PSNR. This paper documents some of our experiences with these tools, which ones worked and which did not. We evaluate which tools are easy to integrate into a more traditional codec design, and show results in the context of the codec being developed by the Alliance for Open Media.Comment: 19 pages, Proceedings of SPIE Workshop on Applications of Digital Image Processing (ADIP), 201

    A comprehensive video codec comparison

    Get PDF
    In this paper, we compare the video codecs AV1 (version 1.0.0-2242 from August 2019), HEVC (HM and x265), AVC (x264), the exploration software JEM which is based on HEVC, and the VVC (successor of HEVC) test model VTM (version 4.0 from February 2019) under two fair and balanced configurations: All Intra for the assessment of intra coding and Maximum Coding Efficiency with all codecs being tuned for their best coding efficiency settings. VTM achieves the highest coding efficiency in both configurations, followed by JEM and AV1. The worst coding efficiency is achieved by x264 and x265, even in the placebo preset for highest coding efficiency. AV1 gained a lot in terms of coding efficiency compared to previous versions and now outperforms HM by 24% BD-Rate gains. VTM gains 5% over AV1 in terms of BD-Rates. By reporting separate numbers for JVET and AOM test sequences, it is ensured that no bias in the test sequences exists. When comparing only intra coding tools, it is observed that the complexity increases exponentially for linearly increasing coding efficiency

    Analisis Perbandingan Teknik Video Codec H.264/AVC, H.265/HEVC, VP9 dan AV1

    Get PDF
    Video applications consume more energy on the Internet and can be accessed by electronic devices, due to an increase in the consumption of high-resolution and high-quality video content, presenting serious issues to delivery infrastructure that needs higher video compression technologies. The focus of this paper is to evaluate the quality of the most current codec, AV1, to its predecessor codec. The comparison was made experimentally at two video resolutions (1080p and 720p) by sampling video frames with various CRF/CQP values and testing several parameters analyses such as encoding duration, compression ratio, bit rate, Mean Square Error (MSE), and Peak Signal to Noise Ratio (PSNR). The AV1 codec is very great in terms of quality and file size, even though it is slower in terms of compression speed. The H.265/HEVC codec, on the other side, beats the other codec in terms of compression ratio. In conclusion, the H.265/HEVC codec is suggested as a material for obtaining a well compressed video with small file size and a short time.Video applications consume more energy on the Internet and can be accessed by electronic devices, due to an increase in the consumption of high-resolution and high-quality video content, presenting serious issues to delivery infrastructure that needs higher video compression technologies. The focus of this paper is to evaluate the quality of the most current codec, AV1, to its predecessor codec. The comparison was made experimentally at two video resolutions (1080p and 720p) by sampling video frames with various CRF/CQP values and testing several parameters analyses such as encoding duration, compression ratio, bit rate, Mean Square Error (MSE), and Peak Signal to Noise Ratio (PSNR). The AV1 codec is very great in terms of quality and file size, even though it is slower in terms of compression speed. The H.265/HEVC codec, on the other side, beats the other codec in terms of compression ratio. In conclusion, the H.265/HEVC codec is suggested as a material for obtaining a well compressed video with small file size and a short time

    Deep perceptual preprocessing for video coding

    Get PDF
    We introduce the concept of rate-aware deep perceptual preprocessing (DPP) for video encoding. DPP makes a single pass over each input frame in order to enhance its visual quality when the video is to be compressed with any codec at any bitrate. The resulting bitstreams can be decoded and displayed at the client side without any post-processing component. DPP comprises a convolutional neural network that is trained via a composite set of loss functions that incorporates: (i) a perceptual loss based on a trained no-reference image quality assessment model, (ii) a reference-based fidelity loss expressing L1 and structural similarity aspects, (iii) a motion-based rate loss via block-based transform, quantization and entropy estimates that converts the essential components of standard hybrid video encoder designs into a trainable framework. Extensive testing using multiple quality metrics and AVC, AV1 and VVC encoders shows that DPP+encoder reduces, on average, the bitrate of the corresponding encoder by 11%. This marks the first time a server-side neural processing component achieves such savings over the state-of-the-art in video coding
    corecore