39 research outputs found

    Efficient Coding Tree Unit (CTU) Decision Method for Scalable High-Efficiency Video Coding (SHVC) Encoder

    Get PDF
    High-efficiency video coding (HEVC or H.265) is the latest video compression standard developed by the joint collaborative team on video coding (JCT-VC), finalized in 2013. HEVC can achieve an average bit rate decrease of 50% in comparison with H.264/AVC while still maintaining video quality. To upgrade the HEVC used in heterogeneous access networks, the JVT-VC has been approved scalable extension of HEVC (SHVC) in July 2014. The SHVC can achieve the highest coding efficiency but requires a very high computational complexity such that its real-time application is limited. To reduce the encoding complexity of SHVC, in this chapter, we employ the temporal-spatial and inter-layer correlations between base layer (BL) and enhancement layer (EL) to predict the best quadtree of coding tree unit (CTU) for quality SHVC. Due to exist a high correlation between layers, we utilize the coded information from the CTU quadtree in BL, including inter-layer intra/residual prediction and inter-layer motion parameter prediction, to predict the CTU quadtree in EL. Therefore, we develop an efficient CTU decision method by combing temporal-spatial searching order algorithm (TSSOA) in BL and a fast inter-layer searching algorithm (FILSA) in EL to speed up the encoding process of SHVC. The simulation results show that the proposed efficient CTU decision method can achieve an average time improving ratio (TIR) about 52–78% and 47–69% for low delay (LD) and random access (RA) configurations, respectively. It is clear that the proposed method can efficiently reduce the computational complexity of SHVC encoder with negligible loss of coding efficiency with various types of video sequences

    Efficient high-resolution video compression scheme using background and foreground layers

    Get PDF
    Video coding using dynamic background frame achieves better compression compared to the traditional techniques by encoding background and foreground separately. This process reduces coding bits for the overall frame significantly; however, encoding background still requires many bits that can be compressed further for achieving better coding efficiency. The cuboid coding framework has been proven to be one of the most effective methods of image compression which exploits homogeneous pixel correlation within a frame and has better alignment with object boundary compared to traditional block-based coding. In a video sequence, the cuboid-based frame partitioning varies with the changes of the foreground. However, since the background remains static for a group of pictures, the cuboid coding exploits better spatial pixel homogeneity. In this work, the impact of cuboid coding on the background frame for high-resolution videos (Ultra-High-Definition (UHD) and 360-degree videos) is investigated using the multilayer framework of SHVC. After the cuboid partitioning, the method of coarse frame generation has been improved with a novel idea by keeping human-visual sensitive information. Unlike the traditional SHVC scheme, in the proposed method, cuboid coded background and the foreground are encoded in separate layers in an implicit manner. Simulation results show that the proposed video coding method achieves an average BD-Rate reduction of 26.69% and BD-PSNR gain of 1.51 dB against SHVC with significant encoding time reduction for both UHD and 360 videos. It also achieves an average of 13.88% BD-Rate reduction and 0.78 dB BD-PSNR gain compared to the existing relevant method proposed by X. Hoang Van. © 2013 IEEE

    EMB: Efficient Multimedia Broadcast in Multi-tier Mobile Networks

    Get PDF
    Multimedia broadcast and multicast services (MBMS) in mobile networks has been widely addressed, however an investigation of such a technology in emerging, multi-tier, scenarios is still lacking. Notably, user clustering and resource allocation are extremely challenging in multi-tier networks, and imperative to maximize system capacity and improve quality of user-experience (QoE) in MBMS. Thus, in this paper we propose a clustering and resource allocation approach, named EMB, which specifically addresses heterogeneous networks and accounts for the fact that multimedia content is adaptively encoded into scalable layers depending on the QoE requirements and channel conditions of the heterogeneous users. Importantly, we prove that our clustering algorithm yields Pareto efficient broadcasting areas, multimedia encoding parameters, and re- source allocation, in a way that is also fair to the users. Fur- thermore, numerical results obtained under realistic conditions and using real-world video content, show that the proposed EMB results in lower churn count (i.e., higher number of served users), higher throughput, and increased QoE, while using fewer network resources

    Fast Mode Assignment for Quality Scalable Extension of the High Efficiency Video Coding (HEVC) Standard: A Bayesian Approach

    Get PDF
    ABSTRACT The new compression standard, known as the High Efficiency Video Coding (HEVC), aims at significantly improving the compression efficiency compared to previous standards. There has been significant interest in developing a scalable version of this standard. As expected, the HEVC scalable video version, which is called SHVC, increases the complexity of the codec compared to the non-scalable counterpart. In this paper, we propose an adaptive fast mode assigning method based on a Bayesian classifier that reduces SHVC's coding complexity by up to 68.55%, while maintaining the overall quality and bit-rates

    Análise do HEVC escalável : desempenho e controlo de débito

    Get PDF
    Mestrado em Engenharia Eletrónica e TelecomunicaçõesEsta dissertação apresenta um estudo da norma de codificação de vídeo de alta eficiência (HEVC) e a sua extensão para vídeo escalável, SHVC. A norma de vídeo SHVC proporciona um melhor desempenho quando codifica várias camadas em simultâneo do que quando se usa o codificador HEVC numa configuração simulcast. Ambos os codificadores de referência, tanto para a camada base como para a camada superior usam o mesmo modelo de controlo de débito, modelo R-λ, que foi otimizado para o HEVC. Nenhuma otimização de alocação de débito entre camadas foi até ao momento proposto para o modelo de testes (SHM 8) para a escalabilidade do HEVC (SHVC). Derivamos um novo modelo R-λ apropriado para a camada superior e para o caso de escalabilidade espacial, que conduziu a um ganho de BD-débito de 1,81% e de BD-PSNR de 0,025 em relação ao modelo de débito-distorção existente no SHM do SHVC. Todavia, mostrou-se também nesta dissertação que o proposto modelo de R-λ não deve ser usado na camada inferior (camada base) no SHVC e por conseguinte no HEVC.This dissertation provides a study of the High Efficiency Video Coding standard (HEVC) and its scalable extension, SHVC. The SHVC provides a better performance when encoding several layers simultaneously than using an HEVC encoder in a simulcast configuration. Both reference encoders, in the base layer and in the enhancement layer use the same rate control model, R-λ model, which was optimized for HEVC. No optimal bitrate partitioning amongst layers is proposed in scalable HEVC (SHVC) test model (SHM 8). We derived a new R-λ model for the enhancement layer and for the spatial case which led to a DB-rate gain of 1.81% and DB-PSNR gain of 0.025 in relation to the rate-distortion model of SHM-SHVC. Nevertheless, we also show in this dissertation that the proposed model of R-λ should not be used neither in the base layer nor in HEVC

    Reducing the complexity of a multiview H.264/AVC and HEVC hybrid architecture

    Get PDF
    With the advent of 3D displays, an efficient encoder is required to compress the video information needed by them. Moreover, for gradual market acceptance of this new technology, it is advisable to offer backward compatibility with existing devices. Thus, a multiview H.264/Advance Video Coding (AVC) and High Efficiency Video Coding (HEVC) hybrid architecture was proposed in the standardization process of HEVC. However, it requires long encoding times due to the use of HEVC. With the aim of tackling this problem, this paper presents an algorithm that reduces the complexity of this hybrid architecture by reducing the encoding complexity of the HEVC views. By using Na < ve-Bayes classifiers, the proposed technique exploits the information gathered in the encoding of the H.264/AVC view to make decisions on the splitting of coding units in HEVC side views. Given the novelty of the proposal, the only similar work found in the literature is an unoptimized version of the algorithm presented here. Experimental results show that the proposed algorithm can achieve a good tradeoff between coding efficiency and complexity

    Layered Division Multiplexing With Multi-Radio-Frequency Channel Technologies

    Full text link
    "(c) 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.")The advanced television system committee (ATSC) is to release the next-generation U.S. digital terrestrial television standard, known as ATSC 3.0. Layered division multiplexing (LDM) is one of the new physical layer technologies included in the standard, which enables the efficient provision of mobile and fixed services by superposing two independent signals with different power levels. ATSC 3.0 has also adopted a novel transmission technique known as channel bonding (CB), which splits the data of a service into two sub-streams that are modulated and transmitted over two radio-frequency (RF) channels. This paper investigates the potential use cases, implementation aspects, and performance advantages, for combining LDM with CB and also with the multi-RF channel technology time frequency slicing (TFS) introduced in digital video broadcasting - terrestrial second generation (DVB-T2) (as an informative annex) and digital video broadcasting - next generation handheld (DVB-NGH) which allows distributing the data of a service across two or more RF channels by means of time slicing and frequency hopping.Parts of this paper have been published in the Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, Ghent, Belgium, in 2015. This work was supported by the ICT Research and Development Program of MSIP/IITP. [R0101-15-294, Development of Service and Transmission Technology for Convergent Realistic Broadcast.]Garro Crevillén, E.; Gimenez Gandia, JJ.; Park, SI.; Gómez Barquero, D. (2016). Layered Division Multiplexing With Multi-Radio-Frequency Channel Technologies. IEEE Transactions on Broadcasting. 62(2):365-374. doi:10.1109/TBC.2015.2492474S36537462

    Advanced Layered Divsion Multiplexing Technologies for Next-Gen Broadcast

    Full text link
    Tesis por compendioDesde comienzos del siglo XXI, los sistemas de radiodifusión terrestre han sido culpados de un uso ineficiente del espectro asignado. Para aumentar la eficiencia espectral, los organismos de estandarización de TV digital comenzaron a desarrollar la evolución técnica de los sistemas de TDT de primera generación. Entre otros, uno de los objetivos principales de los sistemas de TDT de próxima generación (DVB-T2 y ATSC 3.0) es proporcionar simultáneamente servicios de TV a dispositivos móviles y fijos. El principal inconveniente de esta entrega simultánea son los diferentes requisitos de cada condición de recepción. Para abordar estas limitaciones, se han considerado diferentes técnicas de multiplexación. Mientras que DVB-T2 acomete la entrega simultánea de los dos servicios mediante TDM, ATSC 3.0 adoptó la Multiplexación por División en Capas (LDM). LDM puede superar a TDM y a FDM al aprovechar la relación de Protección de Error Desigual (UEP), ya que ambos servicios, llamados capas, utilizan todos los recursos de frecuencia y tiempo con diferentes niveles de potencia. En el lado del receptor, se distinguen dos implementaciones, de acuerdo con la capa a decodificar. Los receptores móviles solo están destinados a obtener la capa superior, conocida como Core Layer (CL). Para no aumentar su complejidad en comparación con los receptores de capa única, la capa inferior, conocida como Enhanced Layer (EL), es tratada como un ruido adicional en la decodificación. Los receptores fijos aumentan su complejidad, ya que deben realizar un proceso de Cancelación de Interferencia (SIC) sobre la CL para obtener la EL. Para limitar la complejidad adicional de los receptores fijos, las capas de LDM en ATSC 3.0 están configuradas con diferentes capacidades de corrección, pero comparten el resto de bloques de la capa física, incluido el TIL, el PP, el tamaño de FFT, y el GI. Esta disertación investiga tecnologías avanzadas para optimizar el rendimiento de LDM. Primero se propone una optimización del proceso de demapeo para las dos capas de LDM. El algoritmo propuesto logra un aumento de capacidad, al tener en cuenta la forma de la EL en el proceso de demapeo de la CL. Sin embargo, el número de distancias Euclidianas a computar puede aumentar significativamente, conduciendo no solo a receptores fijos más complejos, sino también a receptores móviles más complejos. A continuación, se determina la configuración de piloto ATSC 3.0 más adecuada para LDM. Teniendo en cuenta que las dos capas comparten el mismo PP, surge una contrapartida entre la densidad de pilotos (CL) y la redundancia sobre los datos (EL). A partir de los resultados de rendimiento, se recomienda el uso de un PP no muy denso, ya que ya han sido diseñados para hacer frente a ecos largos y altas velocidades. La amplitud piloto óptima depende del estimador de canal en los receptores (ej., se recomienda la amplitud mínima para una implementación Wiener, mientras que la máxima para una implementación FFT). También se investiga la potencial transmisión conjunta de LDM con tres tecnologías avanzadas adoptadas en ATSC 3.0: las tecnologías de agregación MultiRF, los esquemas de MISO distribuido y los de MIMO colocalizado. Se estudian los potenciales casos de uso, los aspectos de implementación del transmisor y el receptor, y las ganancias de rendimiento de las configuraciones conjuntas para las dos capas de LDM. Las restricciones adicionales de combinar LDM con las tecnologías avanzadas se consideran admisibles, ya que las mayores demandas ya están contempladas en ATSC 3.0 (ej., una segunda cadena de recepción). Se obtienen ganancias significativas en condiciones de recepción peatonal gracias a la diversidad en frecuencia proporcionada por las tecnologías MultiRF. La conjunción de LDM con esquemas de MISO proporciona ganancias de rendimiento significativas en redes SFN para la capa fija con el esquema de Alamouti.Since the beginning of the 21st century, terrestrial broadcasting systems have been blamed of an inefficient use of the allocated spectrum. To increase the spectral efficiency, digital television Standards Developing Organizations settled to develop the technical evolution of the first-generation DTT systems. Among others, a primary goal of next-generation DTT systems (DVB-T2 and ATSC 3.0) is to simultaneously provide TV services to mobile and fixed devices. The major drawback of this simultaneous delivery is the different requirement of each reception condition. To address these constraints different multiplexing techniques have been considered. While DVB-T2 fulfilled the simultaneous delivery of the two services by TDM, ATSC 3.0 adopted the LDM technology. LDM can outperform TDM and FDM by taking advantage of the UEP ratio, as both services, namely layers, utilize all the frequency and time resources with different power levels. At receiver side, two implementations are distinguished, according to the intended layer. Mobile receivers are only intended to obtain the upper layer, known as CL. In order not to increase their complexity compared to single layer receivers, the lower layer, known as EL is treated as an additional noise on the CL decoding. Fixed receivers, increase their complexity, as they should performed a SIC process on the CL for getting the EL. To limit the additional complexity of fixed receivers, the LDM layers in ATSC 3.0 are configured with different error correction capabilities, but share the rest of physical layer parameters, including the TIL, the PP, the FFT size, and the GI. This dissertation investigates advanced technologies to optimize the LDM performance. A demapping optimization for the two LDM layers is first proposed. A capacity increase is achieved by the proposed algorithm, which takes into account the underlying layer shape in the demapping process. Nevertheless, the number of Euclidean distances to be computed can be significantly increased, contributing to not only more complex fixed receivers, but also more complex mobile receivers. Next, the most suitable ATSC 3.0 pilot configuration for LDM is determined. Considering the two layers share the same PP a trade-off between pilot density (CL) and data overhead (EL) arises. From the performance results, it is recommended the use of a not very dense PP, as they have been already designed to cope with long echoes and high speeds. The optimum pilot amplitude depends on the channel estimator at receivers (e.g. the minimum amplitude is recommended for a Wiener implementation, while the maximum for a FFT implementation). The potential combination of LDM with three advanced technologies that have been adopted in ATSC 3.0 is also investigated: MultiRF technologies, distributed MISO schemes, and co-located MIMO schemes. The potential use cases, the transmitter and receiver implementations, and the performance gains of the joint configurations are studied for the two LDM layers. The additional constraints of combining LDM with the advanced technologies is considered admissible, as the greatest demands (e.g. a second receiving chain) are already contemplated in ATSC 3.0. Significant gains are found for the mobile layer at pedestrian reception conditions thanks to the frequency diversity provided by MultiRF technologies. The conjunction of LDM with distributed MISO schemes provides significant performance gains on SFNs for the fixed layer with Alamouti scheme. Last, considering the complexity in the mobile receivers and the CL performance, the recommended joint configuration is MISO in the CL and MIMO in the EL.Des de començaments del segle XXI, els sistemes de radiodifusió terrestre han sigut culpats d'un ús ineficient de l'espectre assignat. Per a augmentar l'eficiència espectral, els organismes d'estandardització de TV digital van començar a desenvolupar l'evolució tècnica dels sistemes de TDT de primera generació. Entre altres, un dels objectius principals dels sistemes de TDT de pròxima generació (DVB-T2 i el ATSC 3.0) és proporcionar simultàniament serveis de TV a dispositius mòbils i fixos. El principal inconvenient d'aquest lliurament simultani són els diferents requisits de cada condició de recepció. Per a abordar aquestes limitacions, s'han considerat diferents tècniques de multiplexació. Mentre que DVB-T2 escomet el lliurament simultani dels dos serveis mitjançant TDM, ATSC 3.0 va adoptar la Multiplexació per Divisió en Capes (LDM). LDM pot superar a TDM i a FDM en aprofitar la relació de Protecció d'Error Desigual (UEP), ja que tots dos serveis, cridats capes, utilitzen tots els recursos de freqüència i temps amb diferents nivells de potència. En el costat del receptor, es distingeixen dues implementacions, d'acord amb la capa a decodificar. Els receptors mòbils solament estan destinats a obtenir la capa superior, coneguda com Core Layer (CL). Per a no augmentar la seua complexitat en comparació amb els receptors de capa única, la capa inferior, coneguda com Enhanced Layer (EL), és tractada com un soroll addicional en la decodificació. Els receptors fixos augmenten la seua complexitat, ja que han de realitzar un procés de Cancel·lació d'Interferència (SIC) sobre la CL per a obtenir l'EL. Per a limitar la complexitat addicional dels receptors fixos, les capes de LDM en ATSC 3.0 estan configurades amb diferents capacitats de correcció, però comparteixen la resta de blocs de la capa física, inclòs el TIL, el PP, la grandària de FFT i el GI. Aquesta dissertació investiga tecnologies avançades per a optimitzar el rendiment de LDM. Primer es proposa una optimització del procés de demapeo per a les dues capes de LDM. L'algoritme proposat aconsegueix un augment de capacitat, en tenir en compte la forma de l'EL en el procés de demapeo de la CL. No obstant açò, el nombre de distàncies Euclidianes a computar pot augmentar significativament, conduint NO sols a receptors fixos més complexos, sinó també a receptors mòbils més complexos. A continuació, es determina la configuració de pilot ATSC 3.0 més adequada per a LDM. Tenint en compte que les dues capes comparteixen el mateix PP, es produeix una contrapartida entre la densitat de pilots (CL) i la redundància sobre les dades (EL). A partir dels resultats de rendiment, es recomana l'ús d'un PP no gaire dens, ja que ja han sigut dissenyats per a fer front a ecos llargs i altes velocitats. L'amplitud pilot òptima depèn de l'estimador de canal en els receptors (ex., es recomana l'amplitud mínima per a una implementació Wiener, mentre que la màxima per a una implementació FFT). També s'investiga la potencial transmissió conjunta de LDM amb tres tecnologies avançades adoptades en ATSC 3.0: les tecnologies d'agregació de MultiRF, els esquemes de MISO distribuït i els de MIMO colocalitzat. S'estudien els potencials casos d'ús, els principals aspectes d'implementació del transmissor i el receptor, i els guanys de rendiment de les configuracions conjuntes per a les dues capes de LDM. Les restriccions addicionals de combinar LDM amb les tecnologies avançades es consideren admissibles, ja que les majors demandes ja estan contemplades en ATSC 3.0 (ex., una segona cadena de recepció). S'obtenen guanys significatius per a la capa mòbil en condicions de recepció per als vianants gràcies a la diversitat en freqüència proporcionada per les tecnologies MultiRF. La conjunció de LDM amb esquemes MISO distribuïts proporciona guanys de rendiment significatius en xarxes SFN per a la capa fixa amb l'esquema d'Alamouti.Garro Crevillén, E. (2018). Advanced Layered Divsion Multiplexing Technologies for Next-Gen Broadcast [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/105559TESISCompendi
    corecore