22 research outputs found

    Efficient Motion Estimation and Mode Decision Algorithms for Advanced Video Coding

    Get PDF
    H.264/AVC video compression standard achieved significant improvements in coding efficiency, but the computational complexity of the H.264/AVC encoder is drastically high. The main complexity of encoder comes from variable block size motion estimation (ME) and rate-distortion optimized (RDO) mode decision methods. This dissertation proposes three different methods to reduce computation of motion estimation. Firstly, the computation of each distortion measure is reduced by proposing a novel two step edge based partial distortion search (TS-EPDS) algorithm. In this algorithm, the entire macroblock is divided into different sub-blocks and the calculation order of partial distortion is determined based on the edge strength of the sub-blocks. Secondly, we have developed an early termination algorithm that features an adaptive threshold based on the statistical characteristics of rate-distortion (RD) cost regarding current block and previously processed blocks and modes. Thirdly, this dissertation presents a novel adaptive search area selection method by utilizing the information of the previously computed motion vector differences (MVDs). In H.264/AVC intra coding, DC mode is used to predict regions with no unified direction and the predicted pixel values are same and thus smooth varying regions are not well de-correlated. This dissertation proposes an improved DC prediction (IDCP) mode based on the distance between the predicted and reference pixels. On the other hand, using the nine prediction modes in intra 4x4 and 8x8 block units needs a lot of overhead bits. In order to reduce the number of overhead bits, an intra mode bit rate reduction method is suggested. This dissertation also proposes an enhanced algorithm to estimate the most probable mode (MPM) of each block. The MPM is derived from the prediction mode direction of neighboring blocks which have different weights according to their positions. This dissertation also suggests a fast enhanced cost function for mode decision of intra encoder. The enhanced cost function uses sum of absolute Hadamard-transformed differences (SATD) and mean absolute deviation of the residual block to estimate distortion part of the cost function. A threshold based large coefficients count is also used for estimating the bit-rate part

    Fast Motion Estimation Algorithms for Block-Based Video Coding Encoders

    Get PDF
    The objective of my research is reducing the complexity of video coding standards in real-time scalable and multi-view applications

    Spatial Prediction in the H.264/AVC FRExt Coder and its Optimization

    Get PDF
    The chapter presents a review of the fast spatial prediction strategy that were designed for the Intra coding mode of the video coding standard H.264/AVC. At the end, the author presents an effective strategy based on belief propagation message passing

    Mode decision for the H.264/AVC video coding standard

    Get PDF
    H.264/AVC video coding standard gives us a very promising future for the field of video broadcasting and communication because of its high coding efficiency compared with other older video coding standards. However, high coding efficiency also carries high computational complexity. Fast motion estimation and fast mode decision are two very useful techniques which can significantly reduce computational complexity. This thesis focuses on the field of fast mode decision. The goal of this thesis is that for very similar RD performance compared with H.264/AVC video coding standard, we aim to find new fast mode decision techniques which can afford significant time savings. [Continues.

    Fast motion estimation algorithms for block-based video coding encoders

    Get PDF
    The objective of my research is reducing the complexity of video coding standards in real-time scalable and multi-view applications.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Complexity adaptation in video encoders for power limited platforms

    Get PDF
    With the emergence of video services on power limited platforms, it is necessary to consider both performance-centric and constraint-centric signal processing techniques. Traditionally, video applications have a bandwidth or computational resources constraint or both. The recent H.264/AVC video compression standard offers significantly improved efficiency and flexibility compared to previous standards, which leads to less emphasis on bandwidth. However, its high computational complexity is a problem for codecs running on power limited plat- forms. Therefore, a technique that integrates both complexity and bandwidth issues in a single framework should be considered. In this thesis we investigate complexity adaptation of a video coder which focuses on managing computational complexity and provides significant complexity savings when applied to recent standards. It consists of three sub functions specially designed for reducing complexity and a framework for using these sub functions; Variable Block Size (VBS) partitioning, fast motion estimation, skip macroblock detection, and complexity adaptation framework. Firstly, the VBS partitioning algorithm based on the Walsh Hadamard Transform (WHT) is presented. The key idea is to segment regions of an image as edges or flat regions based on the fact that prediction errors are mainly affected by edges. Secondly, a fast motion estimation algorithm called Fast Walsh Boundary Search (FWBS) is presented on the VBS partitioned images. Its results outperform other commonly used fast algorithms. Thirdly, a skip macroblock detection algorithm is proposed for use prior to motion estimation by estimating the Discrete Cosine Transform (DCT) coefficients after quantisation. A new orthogonal transform called the S-transform is presented for predicting Integer DCT coefficients from Walsh Hadamard Transform coefficients. Complexity saving is achieved by deciding which macroblocks need to be processed and which can be skipped without processing. Simulation results show that the proposed algorithm achieves significant complexity savings with a negligible loss in rate-distortion performance. Finally, a complexity adaptation framework which combines all three techniques mentioned above is proposed for maximizing the perceptual quality of coded video on a complexity constrained platform

    Fast Intra-frame Coding Algorithm for HEVC Based on TCM and Machine Learning

    Get PDF
    High Efficiency Video Coding (HEVC) is the latest video coding standard. Compared with the previous standard H.264/AVC, it can reduce the bit-rate by around 50% while maintaining the same perceptual quality. This performance gain on compression is achieved mainly by supporting larger Coding Unit (CU) size and more prediction modes. However, since the encoder needs to traverse all possible choices to mine out the best way of encoding data, this large flexibility on block size and prediction modes has caused a tremendous increase in encoding time. In HEVC, intra-frame coding is an important basis, and it is widely used in all configurations. Therefore, fast algorithms are always required to alleviate the computational complexity of HEVC intra-frame coding. In this thesis, a fast intra-frame coding algorithm based on machine learning is proposed to predict CU decisions. Hence the computational complexity can be significantly reduced with negligible loss in the coding efficiency. Machine learning models like Bayes decision, Support Vector Machine (SVM) are used as decision makers while the Laplacian Transparent Composite Model (LPTCM) is selected as a feature extraction tool. In the main version of the proposed algorithm, a set of features named with Summation of Binarized Outlier Coefficients (SBOC) is extracted to train SVM models. An online training structure and a performance control method are introduced to enhance the robustness of decision makers. When applied on All Intra Main (AIM) full test and compared with HM 16.3, the main version of the proposed algorithm can achieve, on average, 48% time reduction with 0.78% BD-rate increase. Through adjusting parameter settings, the algorithm can change the trade-off between encoding time and coding efficiency, which can generate a performance curve to meet different requirements. By testing different methods on the same machine, the performance of proposed method has outperformed all CU decision based HEVC fast intra-frame algorithms in the benchmarks

    Fast Intra-frame Coding Algorithm for HEVC Based on TCM and Machine Learning

    Get PDF
    High Efficiency Video Coding (HEVC) is the latest video coding standard. Compared with the previous standard H.264/AVC, it can reduce the bit-rate by around 50% while maintaining the same perceptual quality. This performance gain on compression is achieved mainly by supporting larger Coding Unit (CU) size and more prediction modes. However, since the encoder needs to traverse all possible choices to mine out the best way of encoding data, this large flexibility on block size and prediction modes has caused a tremendous increase in encoding time. In HEVC, intra-frame coding is an important basis, and it is widely used in all configurations. Therefore, fast algorithms are always required to alleviate the computational complexity of HEVC intra-frame coding. In this thesis, a fast intra-frame coding algorithm based on machine learning is proposed to predict CU decisions. Hence the computational complexity can be significantly reduced with negligible loss in the coding efficiency. Machine learning models like Bayes decision, Support Vector Machine (SVM) are used as decision makers while the Laplacian Transparent Composite Model (LPTCM) is selected as a feature extraction tool. In the main version of the proposed algorithm, a set of features named with Summation of Binarized Outlier Coefficients (SBOC) is extracted to train SVM models. An online training structure and a performance control method are introduced to enhance the robustness of decision makers. When applied on All Intra Main (AIM) full test and compared with HM 16.3, the main version of the proposed algorithm can achieve, on average, 48% time reduction with 0.78% BD-rate increase. Through adjusting parameter settings, the algorithm can change the trade-off between encoding time and coding efficiency, which can generate a performance curve to meet different requirements. By testing different methods on the same machine, the performance of proposed method has outperformed all CU decision based HEVC fast intra-frame algorithms in the benchmarks

    Fast Algorithms for HEVC Rate-Distortion Optimization

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์ „๊ธฐยท์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2013. 8. ์ดํ˜์žฌ.๋””์ง€ํ„ธ ์˜์ƒ ๊ธฐ๊ธฐ์˜ ๋ฐœ์ „๊ณผ ๋”๋ถˆ์–ด, ๊ณ ํ™”์งˆ ์˜์ƒ์— ๋Œ€ํ•œ ์ˆ˜์š” ๋˜ํ•œ ํ•จ๊ป˜ ์ฆ๊ฐ€ํ•˜๊ณ  ์žˆ๋‹ค. ์ตœ๊ทผ์˜ ์Šค๋งˆํŠธํฐ๊ณผ ํƒœ๋ธ”๋ฆฟ PC์˜ ๊ธ‰์†์ ์ธ ์„ฑ์žฅ์€ ์ด๋Ÿฌํ•œ ์ถ”์„ธ๋ฅผ ๊ฐ€์†ํ™” ์‹œํ‚ค๊ณ  ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ๋ณ€ํ™”์— ๋งž์ถ”์–ด, ๊ณ ํ™”์งˆ ์˜์ƒ ์••์ถ•์„ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์˜์ƒ ์••์ถ• ๊ธฐ์ˆ ์˜ ํ‘œ์ค€ํ™”๊ฐ€ ISO/IEC MPEG๊ณผ ITU-T/VCEG์˜ ๊ณต๋™์˜ ํŒ€์œผ๋กœ ์ง„ํ–‰๋˜์–ด ์™”๋‹ค. HEVC๋Š” H.264/AVC์˜ ๋’ค๋ฅผ ์ž‡๋Š” ์ฐจ์„ธ๋Œ€ ์˜์ƒ ์••์ถ• ํ‘œ์ค€ ๊ธฐ์ˆ ๋กœ์„œ, 2013๋…„ 1์›” FDIS (Final Draft International Standard)๊ฐ€ ์ž‘์„ฑ๋˜๋ฉด์„œ, ํ‘œ์ค€ํ™” ๊ณผ์ •์ด ์™„๋ฃŒ๋˜์—ˆ๋‹ค. HEVC๋Š” H.264/AVC ๋Œ€๋น„ ๊ฐ™์€ ํ™”์งˆ์˜ ์˜์ƒ์„ ์ ˆ๋ฐ˜์˜ ๋น„ํŠธ๋Ÿ‰์œผ๋กœ ์••์ถ•ํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•˜์˜€์œผ๋ฉฐ, ์ด๋Ÿฐ ๋ชฉํ‘œ๋ฅผ ๋‹ฌ์„ฑํ•˜๊ธฐ ์œ„ํ•ด, ์ƒˆ๋กœ์šด ๊ธฐ์ˆ ๋“ค์ด ์ œ์•ˆ๋˜์—ˆ๋‹ค. ํŠนํžˆ, ๋ณต์žกํ•œ block ๊ตฌ์กฐ์™€ ํฌ๊ฒŒ ๋Š˜์–ด๋‚œ mode์˜ ์ˆ˜๋Š” ์˜์ƒ ์••์ถ•์˜ ํšจ์œจ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๋ฐ์— ํฌ๊ฒŒ ๊ธฐ์—ฌ๋ฅผ ํ•˜์˜€๊ณ , ์ด๋Š” ์ตœ์ ์˜ mode๋ฅผ ๊ฒฐ์ •ํ•˜๋Š” RDO (Rate-Distortion Optimization)๊ฐ€ ๋”์šฑ ์ค‘์š”ํ•œ ์—ญํ• ์„ ํ•˜๋„๋ก ๋งŒ๋“ค์—ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜, ๋ณต์žกํ•ด์ง„ block ๊ตฌ์กฐ๋Š” RDO์˜ ์—ฐ์‚ฐ๋Ÿ‰ ๋˜ํ•œ ํฌ๊ฒŒ ์ฆ๊ฐ€์‹œ์ผฐ๋‹ค. ์ด๋Ÿฌํ•œ ์ด์œ ๋กœ, H.264/AVC์™€ ๋‹ฌ๋ฆฌ HEVC์—์„œ๋Š” RDO์˜ ์—ฐ์‚ฐ๋Ÿ‰์„ ์ค„์ด๋ฉด์„œ ์••์ถ• ํšจ์œจ์„ ์œ ์ง€ํ•˜๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•œ ์ด์Šˆ๊ฐ€ ๋˜์—ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š”, H.264/AVC์™€ HEVC์—์„œ์˜ RDO์— ์˜ํ•œ RD ์ €ํ•˜์˜ ์ฐจ์ด๋ฅผ ์‹คํ—˜ ๊ฒฐ๊ณผ๋ฅผ ํ†ตํ•ด ์ œ์‹œํ•˜์—ฌ ๋ฌธ์ œ๋ฅผ ์ •์˜ํ•˜๊ณ , RDO์˜ ์—ฐ์‚ฐ๋Ÿ‰์„ ์ค„์ด๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค์„ ์„ธ ๊ฐ€์ง€ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ํ†ตํ•ด ์ œ์•ˆํ•˜์˜€๋‹ค. ์ฒซ ๋ฒˆ์งธ ๋ฐฉํ–ฅ์˜ ์—ฐ๊ตฌ์—์„œ๋Š” RDO์˜ ๊ณผ์ •์„ ๊ตฌ์„ฑํ•˜๋Š” Transform, Quantization, Inverse Quantization, Inverse Transform ๊ทธ๋ฆฌ๊ณ  Entropy Coder ๋“ฑ์˜ ์ผ๋ จ์˜ ๊ณผ์ •์˜ ์—ฐ์‚ฐ์„ ๋‹จ์ˆœํ™”ํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค์ด ์ œ์•ˆ๋˜์—ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ๊ธฐ๋ณธ์ ์œผ๋กœ H.264/AVC์—์„œ ์ด๋ฃจ์–ด์ง„ ์—ฐ๊ตฌ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜์˜€๊ณ , ๊ธฐ์กด ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ํ•œ๊ณ„ ๋˜ํ•œ ๋ถ„์„๋˜์–ด ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œ์ผฐ๋‹ค. ๋” ๋‚˜์•„๊ฐ€์„œ๋Š”, ์ข€ ๋” ๊ณต๊ฒฉ์ ์œผ๋กœ RDO์˜ ์—ฐ์‚ฐ๋Ÿ‰์„ ์ค„์ผ ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜์˜€๋‹ค. ๋‘ ๋ฒˆ์งธ ๋ฐฉํ–ฅ์˜ ์—ฐ๊ตฌ์—์„œ๋Š” Zero Block detection์ด๋ผ๋Š” ๊ธฐ์ˆ ์„ ๊ธฐ๋ฐ˜์œผ๋กœ, HEVC์— ์ ํ•ฉํ•˜๊ฒŒ RDO์˜ ์—ฐ์‚ฐ์„ ์ค„์ด๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜์˜€๋‹ค. H.264/AVC์—์„œ ์ œ์•ˆ๋˜์—ˆ๋˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค์€ HEVC์—์„œ์˜ Zero Block์„ ํŠน์ง•์„ ์ œ๋Œ€๋กœ ๋ฐ˜์˜ํ•˜์ง€ ๋ชปํ•˜๊ธฐ ๋•Œ๋ฌธ์—, ๋‹จ์ˆœ ์ˆ˜์ •์„ ํ†ตํ•ด HEVC์— ์ ์šฉํ•  ๊ฒฝ์šฐ ๊ธฐ๋Œ€ํ•œ ๋งŒํผ์˜ ์„ฑ๋Šฅ์„ ์–ป์„ ์ˆ˜ ์—†๋‹ค. ์ด๋Ÿฌํ•œ ํ•œ๊ณ„์ ์„ ํ•ด๊ฒฐํ•˜์—ฌ HEVC์— ์ ํ•ฉํ•œ ํšจ์œจ์ ์ธ Zero Block detection ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ์ œ์‹œ๋˜์—ˆ๋‹ค. ์„ธ ๋ฒˆ์งธ ๋ฐฉํ–ฅ์˜ ์—ฐ๊ตฌ์—์„œ๋Š”, SATD ๊ธฐ๋ฐ˜์˜ RDO๋ฅผ ํ™œ์šฉํ•˜์—ฌ, SSE ๊ธฐ๋ฐ˜์˜ RDO์˜ ์—ฐ์‚ฐ๋Ÿ‰์„ ์ค„์ด๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜์˜€๋‹ค. SATD ๊ธฐ๋ฐ˜์˜ RDO์™€ SSE ๊ธฐ๋ฐ˜์˜ RDO์˜ ์ฐจ์ด์  ๋ถ„์„๊ณผ ์‹คํ—˜ ๊ฒฐ๊ณผ ๋ฐ”ํƒ•์œผ๋กœ ํšจ์œจ์ ์œผ๋กœ SATD ๊ธฐ๋ฐ˜์˜ RDO์„ ํ™œ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์ด ์ œ์‹œ๋˜์—ˆ๋‹ค. ์ด๋ ‡๊ฒŒ ์ œ์•ˆ๋œ ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค์€ HEVC์˜ reference software์ธ HM์— ๊ตฌํ˜„๋˜์–ด, RDO์˜ ์—ฐ์‚ฐ๋Ÿ‰์„ ํฌ๊ฒŒ ์ค„์ด๋ฉด์„œ๋„, RD ์ €ํ•˜๊ฐ€ ํฌ๊ฒŒ ์ฆ๊ฐ€ํ•˜์ง€ ์•Š๋Š” ์‹คํ—˜ ๊ฒฐ๊ณผ๋ฅผ ๋ณด์ด๊ณ  ์žˆ๋‹ค.์ดˆ๋ก iii ๋ชฉ์ฐจ v ํ‘œ ๋ชฉ์ฐจ viii ๊ทธ๋ฆผ ๋ชฉ์ฐจ x ์ œ 1 ์žฅ ์„œ๋ก  1 1.1 ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ 1 1.2 ์—ฐ๊ตฌ ๋‚ด์šฉ 3 1.3 ๋…ผ๋ฌธ ๊ตฌ์„ฑ 6 ์ œ 2 ์žฅ ๋ฐฐ๊ฒฝ์ง€์‹๊ณผ ์ด์ „ ์—ฐ๊ตฌ 7 2.1 ๋ฐฐ๊ฒฝ์ง€์‹ 7 2.2 ์ด์ „ ์—ฐ๊ตฌ 15 ์ œ 3 ์žฅ Simplified RDO 19 3.1 Simplified SSE 19 3.2 Simplified CABAC 24 3.2.1 CABAC์˜ ๊ตฌ์กฐ 24 3.2.2 Various Complexity CABAC 25 3.2.2.1 High-Complexity CABAC 25 3.2.2.2 Medium-Complexity CABAC 26 3.2.2.3 Low-Complexity CABAC 22 3.2.2.4 Evaluation of Various Complexity CABAC 29 3.2.3 Low-Complexity CABAC for HEVC 30 3.3 Advanced Simplified SSE & CABAC 37 3.3.1 Threshold Algorithm 37 3.3.2 Simplified SSE & CABAC without Transform 41 3.4 Evaluation 48 ์ œ 4 ์žฅ Zero Block Detection 51 4.1 Extension of H.264/AVC Zero Block Detection for HEVC 51 4.1.1 Characteristics of the zero blocks in HEVC 51 4.1.2 ZB detection by an extension of the H.264/AVC algorithm 54 4.2 Zreo Block Detection for HEVC 59 4.2.1 GZB Detection for 16x16 and 32x32 transforms 59 4.2.2 Relaxed conditions for PZB detection 62 4.2.3 Further complexity reduction with SAD(or SATD) test 65 4.2.4 Proposed ZB detection for HEVC 72 4.3 Evaluation 74 ์ œ 5 ์žฅ SATD based RDO EVALUATION 84 5.1 Difference between SSE based RDO and SATD based RDO 84 5.2 SATD based RDO Evaluation for HEVC 88 5.3 Evaluation 93 ์ œ 6 ์žฅ ๊ฒฐ๋ก  95Docto
    corecore