65 research outputs found

    Efficient algorithms for scalable video coding

    Get PDF
    A scalable video bitstream specifically designed for the needs of various client terminals, network conditions, and user demands is much desired in current and future video transmission and storage systems. The scalable extension of the H.264/AVC standard (SVC) has been developed to satisfy the new challenges posed by heterogeneous environments, as it permits a single video stream to be decoded fully or partially with variable quality, resolution, and frame rate in order to adapt to a specific application. This thesis presents novel improved algorithms for SVC, including: 1) a fast inter-frame and inter-layer coding mode selection algorithm based on motion activity; 2) a hierarchical fast mode selection algorithm; 3) a two-part Rate Distortion (RD) model targeting the properties of different prediction modes for the SVC rate control scheme; and 4) an optimised Mean Absolute Difference (MAD) prediction model. The proposed fast inter-frame and inter-layer mode selection algorithm is based on the empirical observation that a macroblock (MB) with slow movement is more likely to be best matched by one in the same resolution layer. However, for a macroblock with fast movement, motion estimation between layers is required. Simulation results show that the algorithm can reduce the encoding time by up to 40%, with negligible degradation in RD performance. The proposed hierarchical fast mode selection scheme comprises four levels and makes full use of inter-layer, temporal and spatial correlation aswell as the texture information of each macroblock. Overall, the new technique demonstrates the same coding performance in terms of picture quality and compression ratio as that of the SVC standard, yet produces a saving in encoding time of up to 84%. Compared with state-of-the-art SVC fast mode selection algorithms, the proposed algorithm achieves a superior computational time reduction under very similar RD performance conditions. The existing SVC rate distortion model cannot accurately represent the RD properties of the prediction modes, because it is influenced by the use of inter-layer prediction. A separate RD model for inter-layer prediction coding in the enhancement layer(s) is therefore introduced. Overall, the proposed algorithms improve the average PSNR by up to 0.34dB or produce an average saving in bit rate of up to 7.78%. Furthermore, the control accuracy is maintained to within 0.07% on average. As aMADprediction error always exists and cannot be avoided, an optimisedMADprediction model for the spatial enhancement layers is proposed that considers the MAD from previous temporal frames and previous spatial frames together, to achieve a more accurateMADprediction. Simulation results indicate that the proposedMADprediction model reduces the MAD prediction error by up to 79% compared with the JVT-W043 implementation

    MIMO無線伝送に適したスケーラブルビデオコーディングに関する研究

    Get PDF
    Because of the COVID-19 pandemic, a new normal has taken over. It affects the higher demand for using video traffic. H.264/SVC is the video compression standard with several advantages compared with the previous standard, such as a smaller storage space and scalability of video quality depending on network quality. The H.264/SVC bitstream includes one base layer (BL), the most important layer, and one or more enhancement layers (EL) which can be leveraged to optimize the video scalability depending on the network condition and user preferences. The method of transmission is powerful as the video coding method. The transmission of the good video quality will not be effective without a suitable transmission method. In this thesis, we study and research the H.264 scalable video coding transmission with IEEE 802.11ac standard MIMO wireless transmission. We focus on the suitable transmission method for H.264/SVC in a different environment. We divide the research focusing on two issues: 1. With the difference channel environment: The suitable H.264/SVC transmission technique in IEEE 802.11ac with the specific quantization parameter of video encoding was proposed. This aim is to compare three techniques in IEEE 802.11ac: STBC, SISO, and MIMO. In this focus, only the accuracy of the video was considered to measure the efficiency of the transmission technique. This part proposed to utilize STBC to improve the quality of H.264/SVC video transmission. We have shown the performance of H.264/SVC video transmission with three multiple antenna techniques. The results show that STBC is the best technique for H.264/SVC transmission under a low-quality channel environment. The best result shows that STBC in channel model D can improve the PSNR by 67 percent and 76 percent compared with SISO and MIMO, respectively, at low SNR of 20 dB. Due to STBC transmitting multiple copies of data, it can increase data reliability. We proved that STBC is the most suitable multiple antenna technique to improve the quality and realizability of video transmission in both PSNR and bit error rate (BER). 2. With the different transmission distance: H.264/SVC video transmission on MIMO with RSSI feedback was proposed. This aim to proposes the allocation of packetization in the transmission packet and the compromising of quantization parameter encoding both vary on the channel efficiency. This part proposed a MIMO transmission system for H.264 scalable video coding that does not require full CSI feedback. Instead of the CSI feedback, we have used the RSSI and table of encoding rules obtained via link simulation in MATLAB. The encoding rule takes the form of the encoding ratio between the base and enhancement layer, which was done by adjusting the quantization parameter. This proposed system has been shown to improve the PSNR by at least 16 dB and increase the effective distance of 6 meters above compared with the conventional method.九州工業大学博士学位論文 学位記番号:情工博甲第372号 学位授与年月日:令和4年12月27日1 Introduction|2 Video Transmission System Overview|3 H.264/SVC Video Transmission by IEEE 802.11ac Techniques|4 H.264/SVC Video Transmission on MIMO with RSSI Feedback|5 Conclusion and Future Work九州工業大学令和4年

    Compressed-domain transcoding of H.264/AVC and SVC video streams

    Get PDF

    R-D Optimal Scalable Video Coding Using Soft Decision Quantization

    Get PDF
    In this thesis, we study the concept of scalable video coding as implemented in the extension to the H.264 video coding standard. Specifically, for the spatial and quality scalability scenarios, we propose an optimization algorithm based on the Soft Decision Quantization (SDQ) concept, which aims at jointly optimizing all layers being encoded. The performance of the algorithm was assessed in an SVC implementation. Experimental results show, that the proposed method significantly improves the coding efficiency when compared to an unmodified SVC encoder

    A parallel H.264/SVC encoder for high definition video conferencing

    Get PDF
    In this paper we present a video encoder specially developed and configured for high definition (HD) video conferencing. This video encoder brings together the following three requirements: H.264/Scalable Video Coding (SVC), parallel encoding on multicore platforms, and parallel-friendly rate control. With the first requirement, a minimum quality of service to every end-user receiver over Internet Protocol networks is guaranteed. With the second one, real-time execution is accomplished and, for this purpose, slice-level parallelism, for the main encoding loop, and block-level parallelism, for the upsampling and interpolation filtering processes, are combined. With the third one, a proper HD video content delivery under certain bit rate and end-to-end delay constraints is ensured. The experimental results prove that the proposed H.264/SVC video encoder is able to operate in real time over a wide range of target bit rates at the expense of reasonable losses in rate-distortion efficiency due to the frame partitioning into slices

    Low-complexity scalable and multiview video coding

    Get PDF

    Motion Scalability for Video Coding with Flexible Spatio-Temporal Decompositions

    Get PDF
    PhDThe research presented in this thesis aims to extend the scalability range of the wavelet-based video coding systems in order to achieve fully scalable coding with a wide range of available decoding points. Since the temporal redundancy regularly comprises the main portion of the global video sequence redundancy, the techniques that can be generally termed motion decorrelation techniques have a central role in the overall compression performance. For this reason the scalable motion modelling and coding are of utmost importance, and specifically, in this thesis possible solutions are identified and analysed. The main contributions of the presented research are grouped into two interrelated and complementary topics. Firstly a flexible motion model with rateoptimised estimation technique is introduced. The proposed motion model is based on tree structures and allows high adaptability needed for layered motion coding. The flexible structure for motion compensation allows for optimisation at different stages of the adaptive spatio-temporal decomposition, which is crucial for scalable coding that targets decoding on different resolutions. By utilising an adaptive choice of wavelet filterbank, the model enables high compression based on efficient mode selection. Secondly, solutions for scalable motion modelling and coding are developed. These solutions are based on precision limiting of motion vectors and creation of a layered motion structure that describes hierarchically coded motion. The solution based on precision limiting relies on layered bit-plane coding of motion vector values. The second solution builds on recently established techniques that impose scalability on a motion structure. The new approach is based on two major improvements: the evaluation of distortion in temporal Subbands and motion search in temporal subbands that finds the optimal motion vectors for layered motion structure. Exhaustive tests on the rate-distortion performance in demanding scalable video coding scenarios show benefits of application of both developed flexible motion model and various solutions for scalable motion coding
    corecore