Search CORE

65 research outputs found

Efficient algorithms for scalable video coding

Author: Lu Xin (Researcher in Computer science)
Publication venue
Publication date
Field of study

A scalable video bitstream specifically designed for the needs of various client terminals, network conditions, and user demands is much desired in current and future video transmission and storage systems. The scalable extension of the H.264/AVC standard (SVC) has been developed to satisfy the new challenges posed by heterogeneous environments, as it permits a single video stream to be decoded fully or partially with variable quality, resolution, and frame rate in order to adapt to a specific application. This thesis presents novel improved algorithms for SVC, including: 1) a fast inter-frame and inter-layer coding mode selection algorithm based on motion activity; 2) a hierarchical fast mode selection algorithm; 3) a two-part Rate Distortion (RD) model targeting the properties of different prediction modes for the SVC rate control scheme; and 4) an optimised Mean Absolute Difference (MAD) prediction model. The proposed fast inter-frame and inter-layer mode selection algorithm is based on the empirical observation that a macroblock (MB) with slow movement is more likely to be best matched by one in the same resolution layer. However, for a macroblock with fast movement, motion estimation between layers is required. Simulation results show that the algorithm can reduce the encoding time by up to 40%, with negligible degradation in RD performance. The proposed hierarchical fast mode selection scheme comprises four levels and makes full use of inter-layer, temporal and spatial correlation aswell as the texture information of each macroblock. Overall, the new technique demonstrates the same coding performance in terms of picture quality and compression ratio as that of the SVC standard, yet produces a saving in encoding time of up to 84%. Compared with state-of-the-art SVC fast mode selection algorithms, the proposed algorithm achieves a superior computational time reduction under very similar RD performance conditions. The existing SVC rate distortion model cannot accurately represent the RD properties of the prediction modes, because it is influenced by the use of inter-layer prediction. A separate RD model for inter-layer prediction coding in the enhancement layer(s) is therefore introduced. Overall, the proposed algorithms improve the average PSNR by up to 0.34dB or produce an average saving in bit rate of up to 7.78%. Furthermore, the control accuracy is maintained to within 0.07% on average. As aMADprediction error always exists and cannot be avoided, an optimisedMADprediction model for the spatial enhancement layers is proposed that considers the MAD from previous temporal frames and previous spatial frames together, to achieve a more accurateMADprediction. Simulation results indicate that the proposedMADprediction model reduces the MAD prediction error by up to 79% compared with the JVT-W043 implementation

Warwick Research Archives Portal Repository

A novel early SKIP mode detection method for coarse grain quality in H.264/SVC

Author: A Eleftheriadis
B Lee
B Lee
C Kannangara
C Kim
C Park
D Wu
G Bjontegaard
G Sullivan
H Li
H Lin
H Lin
H Schwarz
Hao Zhang
I Choi
J Lee
L Shen
L Shen
M Wien
S Jung
S Na
T Katayama
T Katayama
T Zhao
T Zhao
Xiaoyu Zhu
Xuan He
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

Scalable and network aware video coding for advanced communications over heterogeneous networks

Author: Muhammad Sanusi
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2013
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and was awarded by Brunel UniversityThis work addresses the issues concerned with the provision of scalable video services over heterogeneous networks particularly with regards to dynamic adaptation and user’s acceptable quality of service. In order to provide and sustain an adaptive and network friendly multimedia communication service, a suite of techniques that achieved automatic scalability and adaptation are developed. These techniques are evaluated objectively and subjectively to assess the Quality of Service (QoS) provided to diverse users with variable constraints and dynamic resources. The research ensured the consideration of various levels of user acceptable QoS The techniques are further evaluated with view to establish their performance against state of the art scalable and non-scalable techniques. To further improve the adaptability of the designed techniques, several experiments and real time simulations are conducted with the aim of determining the optimum performance with various coding parameters and scenarios. The coding parameters and scenarios are evaluated and analyzed to determine their performance using various types of video content and formats. Several algorithms are developed to provide a dynamic adaptation of coding tools and parameters to specific video content type, format and bandwidth of transmission. Due to the nature of heterogeneous networks where channel conditions, terminals, users capabilities and preferences etc are unpredictably changing, hence limiting the adaptability of a specific technique adopted, a Dynamic Scalability Decision Making Algorithm (SADMA) is developed. The algorithm autonomously selects one of the designed scalability techniques basing its decision on the monitored and reported channel conditions. Experiments were conducted using a purpose-built heterogeneous network simulator and the network-aware selection of the scalability techniques is based on real time simulation results. A technique with a minimum delay, low bit-rate, low frame rate and low quality is adopted as a reactive measure to a predicted bad channel condition. If the use of the techniques is not favoured due to deteriorating channel conditions reported, a reduced layered stream or base layer is used. If the network status does not allow the use of the base layer, then the stream uses parameter identifiers with high efficiency to improve the scalability and adaptation of the video service. To further improve the flexibility and efficiency of the algorithm, a dynamic de-blocking filter and lambda value selection are analyzed and introduced in the algorithm. Various methods, interfaces and algorithms are defined for transcoding from one technique to another and extracting sub-streams when the network conditions do not allow for the transmission of the entire bit-stream

Brunel University Research Archive

MIMO無線伝送に適したスケーラブルビデオコーディングに関する研究

Author: Manthana Tiawongsuwan
Publication venue: 尾知, 博
Publication date: 17/02/2023
Field of study

Because of the COVID-19 pandemic, a new normal has taken over. It affects the higher demand for using video traffic. H.264/SVC is the video compression standard with several advantages compared with the previous standard, such as a smaller storage space and scalability of video quality depending on network quality. The H.264/SVC bitstream includes one base layer (BL), the most important layer, and one or more enhancement layers (EL) which can be leveraged to optimize the video scalability depending on the network condition and user preferences. The method of transmission is powerful as the video coding method. The transmission of the good video quality will not be effective without a suitable transmission method. In this thesis, we study and research the H.264 scalable video coding transmission with IEEE 802.11ac standard MIMO wireless transmission. We focus on the suitable transmission method for H.264/SVC in a different environment. We divide the research focusing on two issues: 1. With the difference channel environment: The suitable H.264/SVC transmission technique in IEEE 802.11ac with the specific quantization parameter of video encoding was proposed. This aim is to compare three techniques in IEEE 802.11ac: STBC, SISO, and MIMO. In this focus, only the accuracy of the video was considered to measure the efficiency of the transmission technique. This part proposed to utilize STBC to improve the quality of H.264/SVC video transmission. We have shown the performance of H.264/SVC video transmission with three multiple antenna techniques. The results show that STBC is the best technique for H.264/SVC transmission under a low-quality channel environment. The best result shows that STBC in channel model D can improve the PSNR by 67 percent and 76 percent compared with SISO and MIMO, respectively, at low SNR of 20 dB. Due to STBC transmitting multiple copies of data, it can increase data reliability. We proved that STBC is the most suitable multiple antenna technique to improve the quality and realizability of video transmission in both PSNR and bit error rate (BER). 2. With the different transmission distance: H.264/SVC video transmission on MIMO with RSSI feedback was proposed. This aim to proposes the allocation of packetization in the transmission packet and the compromising of quantization parameter encoding both vary on the channel efficiency. This part proposed a MIMO transmission system for H.264 scalable video coding that does not require full CSI feedback. Instead of the CSI feedback, we have used the RSSI and table of encoding rules obtained via link simulation in MATLAB. The encoding rule takes the form of the encoding ratio between the base and enhancement layer, which was done by adjusting the quantization parameter. This proposed system has been shown to improve the PSNR by at least 16 dB and increase the effective distance of 6 meters above compared with the conventional method.九州工業大学博士学位論文学位記番号：情工博甲第372号学位授与年月日：令和4年12月27日1 Introduction|2 Video Transmission System Overview|3 H.264/SVC Video Transmission by IEEE 802.11ac Techniques|4 H.264/SVC Video Transmission on MIMO with RSSI Feedback|5 Conclusion and Future Work九州工業大学令和4年

Kyutacar : Kyushu Institute of Technology Academic Repository

Compressed-domain transcoding of H.264/AVC and SVC video streams

Author: De Cock Jan
Publication venue: Ghent University. Faculty of Engineering
Publication date: 01/01/2009
Field of study

Ghent University Academic Bibliography

R-D Optimal Scalable Video Coding Using Soft Decision Quantization

Author: Hebel Krzysztof Michal
Publication venue: 'University of Waterloo'
Publication date: 17/11/2010
Field of study

In this thesis, we study the concept of scalable video coding as implemented in the extension to the H.264 video coding standard. Specifically, for the spatial and quality scalability scenarios, we propose an optimization algorithm based on the Soft Decision Quantization (SDQ) concept, which aims at jointly optimizing all layers being encoded. The performance of the algorithm was assessed in an SVC implementation. Experimental results show, that the proposed method significantly improves the coding efficiency when compared to an unmodified SVC encoder

University of Waterloo's Institutional Repository

A parallel H.264/SVC encoder for high definition video conferencing

Author: Mayer Tobias
Sanz-Rodríguez Sergio
Schierl Tobias
Álvarez-Mesa Mauricio
Publication venue
Publication date: 01/01/2015
Field of study

In this paper we present a video encoder specially developed and configured for high definition (HD) video conferencing. This video encoder brings together the following three requirements: H.264/Scalable Video Coding (SVC), parallel encoding on multicore platforms, and parallel-friendly rate control. With the first requirement, a minimum quality of service to every end-user receiver over Internet Protocol networks is guaranteed. With the second one, real-time execution is accomplished and, for this purpose, slice-level parallelism, for the main encoding loop, and block-level parallelism, for the upsampling and interpolation filtering processes, are combined. With the third one, a proper HD video content delivery under certain bit rate and end-to-end delay constraints is ensured. The experimental results prove that the proposed H.264/SVC video encoder is able to operate in real time over a wide range of target bit rates at the expense of reasonable losses in rate-distortion efficiency due to the frame partitioning into slices

DepositOnce

Low-complexity scalable and multiview video coding

Author: Van Leuven Sebastiaan
Publication venue: Ghent University, Department of Electronics and information systems
Publication date: 01/01/2013
Field of study

Ghent University Academic Bibliography

Motion Scalability for Video Coding with Flexible Spatio-Temporal Decompositions

Author: Mrak Marta
Publication venue
Publication date: 01/01/2007
Field of study

PhDThe research presented in this thesis aims to extend the scalability range of the wavelet-based video coding systems in order to achieve fully scalable coding with a wide range of available decoding points. Since the temporal redundancy regularly comprises the main portion of the global video sequence redundancy, the techniques that can be generally termed motion decorrelation techniques have a central role in the overall compression performance. For this reason the scalable motion modelling and coding are of utmost importance, and specifically, in this thesis possible solutions are identified and analysed. The main contributions of the presented research are grouped into two interrelated and complementary topics. Firstly a flexible motion model with rateoptimised estimation technique is introduced. The proposed motion model is based on tree structures and allows high adaptability needed for layered motion coding. The flexible structure for motion compensation allows for optimisation at different stages of the adaptive spatio-temporal decomposition, which is crucial for scalable coding that targets decoding on different resolutions. By utilising an adaptive choice of wavelet filterbank, the model enables high compression based on efficient mode selection. Secondly, solutions for scalable motion modelling and coding are developed. These solutions are based on precision limiting of motion vectors and creation of a layered motion structure that describes hierarchically coded motion. The solution based on precision limiting relies on layered bit-plane coding of motion vector values. The second solution builds on recently established techniques that impose scalability on a motion structure. The new approach is based on two major improvements: the evaluation of distortion in temporal Subbands and motion search in temporal subbands that finds the optimal motion vectors for layered motion structure. Exhaustive tests on the rate-distortion performance in demanding scalable video coding scenarios show benefits of application of both developed flexible motion model and various solutions for scalable motion coding

Queen Mary Research Online

OpenGrey Repository