Search CORE

51 research outputs found

Region-Based Template Matching Prediction for Intra Coding

Author: Marpe Detlev
Müller Karsten
Pfaff Jonathan
Schwarz Heiko
Venugopal Gayathri
Wiegand Thomas
Publication venue
Publication date: 01/01/2023
Field of study

Copy prediction is a renowned category of prediction techniques in video coding where the current block is predicted by copying the samples from a similar block that is present somewhere in the already decoded stream of samples. Motion-compensated prediction, intra block copy, template matching prediction etc. are examples. While the displacement information of the similar block is transmitted to the decoder in the bit-stream in the first two approaches, it is derived at the decoder in the last one by repeating the same search algorithm which was carried out at the encoder. Region-based template matching is a recently developed prediction algorithm that is an advanced form of standard template matching. In this method, the reference area is partitioned into multiple regions and the region to be searched for the similar block(s) is conveyed to the decoder in the bit-stream. Further, its final prediction signal is a linear combination of already decoded similar blocks from the given region. It was demonstrated in previous publications that region-based template matching is capable of achieving coding efficiency improvements for intra as well as inter-picture coding with considerably less decoder complexity than conventional template matching. In this paper, a theoretical justification for region-based template matching prediction subject to experimental data is presented. Additionally, the test results of the aforementioned method on the latest H.266/Versatile Video Coding (VVC) test model (version VTM-14.0) yield an average Bjøntegaard-Delta (BD) bit-rate savings of −0.75% using all intra (AI) configuration with 130% encoder run-time and 104% decoder run-time for a particular parameter selection

Institutional Repository of the Freie Universität Berlin

Deep Learning in Visual Computing and Signal Processing

Author: Danfeng Xie
Lei Zhang
Li Bai
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2017
Field of study

Crossref

Intelligent Algorithm for Enhancing MPEG-DASH QoE in eMBMS

Author: Abdullah Miran Taha
Canovas Solbes Alejandro
Jimenez Jose M.
Lloret Jaime
Publication venue: 'Macrothink Institute, Inc.'
Publication date: 31/12/2017
Field of study

[EN] Multimedia streaming is the most demanding and bandwidth hungry application in today¿s world of Internet. MPEG-DASH as a video technology standard is designed for delivering live or on-demand streams in Internet to deliver best quality content with the fewest dropouts and least possible buffering. Hybrid architecture of DASH and eMBMS has attracted a great attention from the telecommunication industry and multimedia services. It is deployed in response to the immense demand in multimedia traffic. However, handover and limited available resources of the system affected on dropping segments of the adaptive video streaming in eMBMS and it creates an adverse impact on Quality of Experience (QoE), which is creating trouble for service providers and network providers towards delivering the service. In this paper, we derive a case study in eMBMS to approach to provide test measures evaluating MPEG-DASH QoE, by defining the metrics are influenced on QoE in eMBMS such as bandwidth and packet loss then we observe the objective metrics like stalling (number, duration and place), buffer length and accumulative video time. Moreover, we build a smart algorithm to predict rate of segments are lost in multicast adaptive video streaming. The algorithm deploys an estimation decision regards how to recover the lost segments. According to the obtained results based on our proposal algorithm, rate of lost segments is highly decreased by comparing to the traditional approach of MPEG-DASH multicast and unicast for high number of users.This work has been partially supported by the Postdoctoral Scholarship Contratos Postdoctorales UPV 2014 (PAID-10-14) of the Universitat Politècnica de València , by the Programa para la Formación de Personal Investigador (FPI-2015-S2-884) of the Universitat Politècnica de València , by the Ministerio de Economía y Competitividad , through the Convocatoria 2014. Proyectos I+D - Programa Estatal de Investigación Científica y Técnica de Excelencia in the Subprograma Estatal de Generación de Conocimiento , project TIN2014-57991-C3-1-P and through the Convocatoria 2017 - Proyectos I+D+I - Programa Estatal de Investigación, Desarrollo e Innovación, convocatoria excelencia (Project TIN2017-84802-C2-1-P).Abdullah, MT.; Jimenez, JM.; Canovas Solbes, A.; Lloret, J. (2017). Intelligent Algorithm for Enhancing MPEG-DASH QoE in eMBMS. Network Protocols and Algorithms. 9(3-4):94-114. https://doi.org/10.5296/npa.v9i3-4.12573S9411493-

Crossref

RiuNet

Quality of Experience (QoE)-Aware Fast Coding Unit Size Selection for HEVC Intra-prediction

Author: Erabadda Buddhiprabha
Fernando Anil
Hewage Chaminda
Mallikarachchi Thanuja
Publication venue: 'MDPI AG'
Publication date: 11/08/2019
Field of study

The exorbitant increase in the computational complexity of modern video coding standards, such as High Efficiency Video Coding (HEVC), is a compelling challenge for resource-constrained consumer electronic devices. For instance, the brute force evaluation of all possible combinations of available coding modes and quadtree-based coding structure in HEVC to determine the optimum set of coding parameters for a given content demand a substantial amount of computational and energy resources. Thus, the resource requirements for real time operation of HEVC has become a contributing factor towards the Quality of Experience (QoE) of the end users of emerging multimedia and future internet applications. In this context, this paper proposes a content-adaptive Coding Unit (CU) size selection algorithm for HEVC intra-prediction. The proposed algorithm builds content-specific weighted Support Vector Machine (SVM) models in real time during the encoding process, to provide an early estimate of CU size for a given content, avoiding the brute force evaluation of all possible coding mode combinations in HEVC. The experimental results demonstrate an average encoding time reduction of 52.38%, with an average Bjøntegaard Delta Bit Rate (BDBR) increase of 1.19% compared to the HM16.1 reference encoder. Furthermore, the perceptual visual quality assessments conducted through Video Quality Metric (VQM) show minimal visual quality impact on the reconstructed videos of the proposed algorithm compared to state-of-the-art approaches

University of Surrey

Cardiff Metropolitan Research Repository (DSpace)

GAN-Based Differential Private Image Privacy Protection Framework for the Internet of Multimedia Things.

Author: Ding M
Liu B
Wang Y
Xue H
Yu J
Zhu S
Publication venue: 'MDPI AG'
Publication date: 29/01/2021
Field of study

With the development of the Internet of Multimedia Things (IoMT), an increasing amount of image data is collected by various multimedia devices, such as smartphones, cameras, and drones. This massive number of images are widely used in each field of IoMT, which presents substantial challenges for privacy preservation. In this paper, we propose a new image privacy protection framework in an effort to protect the sensitive personal information contained in images collected by IoMT devices. We aim to use deep neural network techniques to identify the privacy-sensitive content in images, and then protect it with the synthetic content generated by generative adversarial networks (GANs) with differential privacy (DP). Our experiment results show that the proposed framework can effectively protect users' privacy while maintaining image utility

OPUS - University of Technology Sydney

A Multiple Level-of-Detail 3D Data Transmission Approach for Low-Latency Remote Visualisation in Teleoperation Tasks

Author: Caliskanelli Ipek
Niu Hanlin
Pacheco-Gutierrez Salvador
Skilton Robert
Publication venue: 'MDPI AG'
Publication date: 13/07/2021
Field of study

From MDPI via Jisc Publications RouterHistory: accepted 2021-07-13, pub-electronic 2021-07-14Publication status: PublishedFunder: Engineering and Physical Sciences Research Council; Grant(s): EP/S03286X/1In robotic teleoperation, the knowledge of the state of the remote environment in real time is paramount. Advances in the development of highly accurate 3D cameras able to provide high-quality point clouds appear to be a feasible solution for generating live, up-to-date virtual environments. Unfortunately, the exceptional accuracy and high density of these data represent a burden for communications requiring a large bandwidth affecting setups where the local and remote systems are particularly geographically distant. This paper presents a multiple level-of-detail (LoD) compression strategy for 3D data based on tree-like codification structures capable of compressing a single data frame at multiple resolutions using dynamically configured parameters. The level of compression (resolution) of objects is prioritised based on: (i) placement on the scene; and (ii) the type of object. For the former, classical point cloud fitting and segmentation techniques are implemented; for the latter, user-defined prioritisation is considered. The results obtained are compared using a single LoD (whole-scene) compression technique previously proposed by the authors. Results showed a considerable improvement to the transmitted data size and updated frame rate while maintaining low distortion after decompression

ChesterRep

Prototype to Increase Crosswalk Safety by Integrating Computer Vision with ITS-G5 Technologies

Author: Costa Paulo
Gaspar Francisco
Guerreiro Vitor
Loureiro Paulo
Mendes Sílvio
Rabadão Carlos
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

Human errors are probably the main cause of car accidents, and this type of vehicle is one of the most dangerous forms of transport for people. The danger comes from the fact that on public roads there are simultaneously different types of actors (drivers, pedestrians or cyclists) and many objects that change their position over time, making difficult to predict their immediate movements. The intelligent transport system (ITS-G5) standard specifies the European communication technologies and protocols to assist public road users, providing them with relevant information. The scientific community is developing ITS-G5 applications for various purposes, among which is the increasing of pedestrian safety. This paper describes the developed work to implement an ITS-G5 prototype that aims at the increasing of pedestrian and driver safety in the vicinity of a pedestrian crosswalk by sending ITS-G5 decentralized environmental notification messages (DENM) to the vehicles. These messages are analyzed, and if they are relevant, they are presented to the driver through a car’s onboard infotainment system. This alert allows the driver to take safety precautions to prevent accidents. The implemented prototype was tested in a controlled environment pedestrian crosswalk. The results showed the capacity of the prototype for detecting pedestrians, suitable message sending, the reception and processing on a vehicle onboard unit (OBU) module and its presentation on the car onboard infotainment system.info:eu-repo/semantics/publishedVersio

IC-online

Deep Learning in Visual Computing and Signal Processing

Author: Danfeng Xie
Francesco Carlo Morabito
Lei Zhang
Li Bai
Publication venue
Publication date: 24/04/2020
Field of study

Deep learning is a subfield of machine learning, which aims to learn a hierarchy of features from input data. Nowadays, researchers have intensively investigated deep learning algorithms for solving challenging problems in many areas such as image classification, speech recognition, signal processing, and natural language processing. In this study, we not only review typical deep learning algorithms in computer vision and signal processing but also provide detailed information on how to apply deep learning to specific areas such as road crack detection, fault diagnosis, and human activity detection. Besides, this study also discusses the challenges of designing and training deep neural networks

CiteSeerX

Vulnerable road users and connected autonomous vehicles interaction: a survey

Author: Guerrero Ibañez Juan
Reyes Muñoz María Angélica
Publication venue: 'MDPI AG'
Publication date: 18/06/2022
Field of study

There is a group of users within the vehicular traffic ecosystem known as Vulnerable Road Users (VRUs). VRUs include pedestrians, cyclists, motorcyclists, among others. On the other hand, connected autonomous vehicles (CAVs) are a set of technologies that combines, on the one hand, communication technologies to stay always ubiquitous connected, and on the other hand, automated technologies to assist or replace the human driver during the driving process. Autonomous vehicles are being visualized as a viable alternative to solve road accidents providing a general safe environment for all the users on the road specifically to the most vulnerable. One of the problems facing autonomous vehicles is to generate mechanisms that facilitate their integration not only within the mobility environment, but also into the road society in a safe and efficient way. In this paper, we analyze and discuss how this integration can take place, reviewing the work that has been developed in recent years in each of the stages of the vehicle-human interaction, analyzing the challenges of vulnerable users and proposing solutions that contribute to solving these challenges.This work was partially funded by the Ministry of Economy, Industry, and Competitiveness of Spain under Grant: Supervision of drone fleet and optimization of commercial operations flight plans, PID2020-116377RB-C21.Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC