Search CORE

45 research outputs found

Efficient reverse-play algorithms for MPEG video with VCR support

Author: Chan YL
Fu CH
Siu WC
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/12/2014
Field of study

Centre for Multimedia Signal Processing, Department of Electronic and Information Engineering2005-2006 > Academic research: refereed > Publication in refereed journalVersion of RecordPublishe

PolyU Institutional Repository

Performance evaluation of transcoding algorithms for H.264

Author: Bull DR
Canagarajah CN
Lefol D
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/02/2006
Field of study

Explore Bristol Research

Signal processing for improved MPEG-based communication systems

Author: Eerenberg O.
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2015
Field of study

Repository TU/e

Pure OAI Repository

An Optimal Framework of Video Adaptation and Its Application to Rate Adaptation Transcoding

Author: Hyung-Myung Kim
Jae-Gon Kim
Shih-Fu Chang
Yong Wang
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Adaptive video delivery using semantics

Author: Steiger Olivier
Publication venue: Lausanne, EPFL
Publication date: 16/03/2005
Field of study

The diffusion of network appliances such as cellular phones, personal digital assistants and hand-held computers has created the need to personalize the way media content is delivered to the end user. Moreover, recent devices, such as digital radio receivers with graphics displays, and new applications, such as intelligent visual surveillance, require novel forms of video analysis for content adaptation and summarization. To cope with these challenges, we propose an automatic method for the extraction of semantics from video, and we present a framework that exploits these semantics in order to provide adaptive video delivery. First, an algorithm that relies on motion information to extract multiple semantic video objects is proposed. The algorithm operates in two stages. In the first stage, a statistical change detector produces the segmentation of moving objects from the background. This process is robust with regard to camera noise and does not need manual tuning along a sequence or for different sequences. In the second stage, feedbacks between an object partition and a region partition are used to track individual objects along the frames. These interactions allow us to cope with multiple, deformable objects, occlusions, splitting, appearance and disappearance of objects, and complex motion. Subsequently, semantics are used to prioritize visual data in order to improve the performance of adaptive video delivery. The idea behind this approach is to organize the content so that a particular network or device does not inhibit the main content message. Specifically, we propose two new video adaptation strategies. The first strategy combines semantic analysis with a traditional frame-based video encoder. Background simplifications resulting from this approach do not penalize overall quality at low bitrates. The second strategy uses metadata to efficiently encode the main content message. The metadata-based representation of object's shape and motion suffices to convey the meaning and action of a scene when the objects are familiar. The impact of different video adaptation strategies is then quantified with subjective experiments. We ask a panel of human observers to rate the quality of adapted video sequences on a normalized scale. From these results, we further derive an objective quality metric, the semantic peak signal-to-noise ratio (SPSNR), that accounts for different image areas and for their relevance to the observer in order to reflect the focus of attention of the human visual system. At last, we determine the adaptation strategy that provides maximum value for the end user by maximizing the SPSNR for given client resources at the time of delivery. By combining semantic video analysis and adaptive delivery, the solution presented in this dissertation permits the distribution of video in complex media environments and supports a large variety of content-based applications

Infoscience - École polytechnique fédérale de Lausanne

Recommended from our members

Scalable and network aware video coding for advanced communications over heterogeneous networks

Author: Muhammad Sanusi
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2013
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and was awarded by Brunel UniversityThis work addresses the issues concerned with the provision of scalable video services over heterogeneous networks particularly with regards to dynamic adaptation and user’s acceptable quality of service. In order to provide and sustain an adaptive and network friendly multimedia communication service, a suite of techniques that achieved automatic scalability and adaptation are developed. These techniques are evaluated objectively and subjectively to assess the Quality of Service (QoS) provided to diverse users with variable constraints and dynamic resources. The research ensured the consideration of various levels of user acceptable QoS The techniques are further evaluated with view to establish their performance against state of the art scalable and non-scalable techniques. To further improve the adaptability of the designed techniques, several experiments and real time simulations are conducted with the aim of determining the optimum performance with various coding parameters and scenarios. The coding parameters and scenarios are evaluated and analyzed to determine their performance using various types of video content and formats. Several algorithms are developed to provide a dynamic adaptation of coding tools and parameters to specific video content type, format and bandwidth of transmission. Due to the nature of heterogeneous networks where channel conditions, terminals, users capabilities and preferences etc are unpredictably changing, hence limiting the adaptability of a specific technique adopted, a Dynamic Scalability Decision Making Algorithm (SADMA) is developed. The algorithm autonomously selects one of the designed scalability techniques basing its decision on the monitored and reported channel conditions. Experiments were conducted using a purpose-built heterogeneous network simulator and the network-aware selection of the scalability techniques is based on real time simulation results. A technique with a minimum delay, low bit-rate, low frame rate and low quality is adopted as a reactive measure to a predicted bad channel condition. If the use of the techniques is not favoured due to deteriorating channel conditions reported, a reduced layered stream or base layer is used. If the network status does not allow the use of the base layer, then the stream uses parameter identifiers with high efficiency to improve the scalability and adaptation of the video service. To further improve the flexibility and efficiency of the algorithm, a dynamic de-blocking filter and lambda value selection are analyzed and introduced in the algorithm. Various methods, interfaces and algorithms are defined for transcoding from one technique to another and extracting sub-streams when the network conditions do not allow for the transmission of the entire bit-stream

Brunel University Research Archive

Content Fragile Watermarking for H.264/AVC Video Authentication

Author: Ait Saadi Karima
Bouridane Ahmed
Guessoum Abderrezak
Khelifi Fouad
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2017
Field of study

Discrete Cosine transform (DCT) to generate the authentication data that are treated as a fragile watermark. This watermark is embedded in the motion vectors (MVs) The advances in multimedia technologies and digital processing tools have brought with them new challenges for the source and content authentication. To ensure the integrity of the H.264/AVC video stream, we introduce an approach based on a content fragile video watermarking method using an independent authentication of each Group of Pictures (GOPs) within the video. This technique uses robust visual features extracted from the video pertaining to the set of selected macroblocs (MBs) which hold the best partition mode in a tree-structured motion compensation process. An additional security degree is offered by the proposed method through using a more secured keyed function HMAC-SHA-256 and randomly choosing candidates from already selected MBs. In here, the watermark detection and verification processes are blind, whereas the tampered frames detection is not since it needs the original frames within the tampered GOPs. The proposed scheme achieves an accurate authentication technique with a high fragility and fidelity whilst maintaining the original bitrate and the perceptual quality. Furthermore, its ability to detect the tampered frames in case of spatial, temporal and colour manipulations, is confirmed

Northumbria Research Link

Development of advanced multimedia services in P2P architectures

Author: González Cela Alberto José
Publication venue: Universitat Politècnica de Catalunya
Publication date: 09/02/2007
Field of study

La transmissió de fluxos multimèdia en temps real (streaming) s’ha convertit en un tema punter i de gran interès al món de la recerca d’Internet, especialment quan ens referim a aplicacions de transmissió d’àudio i vídeo en directe a través de xarxes peer-to-peer (P2P). Generalment, aquestes aplicacions han de fer front a molts problemes en el seu disseny i implementació deguts a la dinamicitat i heterogeneïtat que per natura caracteritzen les xarxes P2P. En aquest projecte, s’introdueixen noves característiques que les aplicacions de transmissió multimèdia P2P actuals no contemplen. Els requisits de connexió i maquinari són diferents per a la transmissió de fluxos de baixa i alta capacitat, no obstant, tots els nodes s’acostumen a considerar idèntics, cosa que no representa una aproximació gaire encertada tenint en compte un medi tan heterogeni. A més a més, amb la finalitat d’aconseguir distinció entre nodes, es fa necessari la introducció d’un mecanisme que permeti l’intercanvi de les capacitats específiques de cada node, incloent-hi les de transcodificació de fluxos. Un altre aspecte a destacar és el fet que aquestes aplicacions són difícils d’ampliar, incorporar nous serveis o modificar les dades que porten precarregades com ara la llista de canals de televisió disponibles, cosa que impossibilita garantir la disponibilitat de la font tot el temps. Per altra banda, els serveis interactius tampoc s’han desenvolupat gaire. Aquest projecte proposa el disseny i implementació d’una plataforma de difusió multimèdia P2P cooperativa i interactiva que permet superar els problemes esmentats. La plataforma integra diferents mecanismes que permeten la distribució en temps real de continguts multimèdia en diferents qualitats incloent fluxos d’alta capacitat (com per exemple HD). Aquesta plataforma és una solució novedosa basada en JXTA, DONET i ALM (Arbres Multicast a nivell d’Aplicació) que proporciona un sistema ampliable segons noves necessitats i facilita la inserció de nous serveis de valor afegit. La plataforma proposada es fonamenta en la creació d’una arquitectura de 2 capes lògiques superposades: una capa lògica JXTA, encarregada bàsicament de la senyalització i intercanvi de metadades, i una capa de transmissió basada en sockets UDP unicast. D’aquesta manera, la diferència entre la capa de transmissió i la capa física es pot veure reduïda a partir de la informació obtinguda de la capa JXTA, la qual es va actualitzant al llarg del temps

UPCommons. Portal del coneixement obert de la UPC

Recommended from our members

3D multiple description coding for error resilience over wireless networks

Author: Umar Abubakar
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2011
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Mobile communications has gained a growing interest from both customers and service providers alike in the last 1-2 decades. Visual information is used in many application domains such as remote health care, video –on demand, broadcasting, video surveillance etc. In order to enhance the visual effects of digital video content, the depth perception needs to be provided with the actual visual content. 3D video has earned a significant interest from the research community in recent years, due to the tremendous impact it leaves on viewers and its enhancement of the user’s quality of experience (QoE). In the near future, 3D video is likely to be used in most video applications, as it offers a greater sense of immersion and perceptual experience. When 3D video is compressed and transmitted over error prone channels, the associated packet loss leads to visual quality degradation. When a picture is lost or corrupted so severely that the concealment result is not acceptable, the receiver typically pauses video playback and waits for the next INTRA picture to resume decoding. Error propagation caused by employing predictive coding may degrade the video quality severely. There are several ways used to mitigate the effects of such transmission errors. One widely used technique in International Video Coding Standards is error resilience. The motivation behind this research work is that, existing schemes for 2D colour video compression such as MPEG, JPEG and H.263 cannot be applied to 3D video content. 3D video signals contain depth as well as colour information and are bandwidth demanding, as they require the transmission of multiple high-bandwidth 3D video streams. On the other hand, the capacity of wireless channels is limited and wireless links are prone to various types of errors caused by noise, interference, fading, handoff, error burst and network congestion. Given the maximum bit rate budget to represent the 3D scene, optimal bit-rate allocation between texture and depth information rendering distortion/losses should be minimised. To mitigate the effect of these errors on the perceptual 3D video quality, error resilience video coding needs to be investigated further to offer better quality of experience (QoE) to end users. This research work aims at enhancing the error resilience capability of compressed 3D video, when transmitted over mobile channels, using Multiple Description Coding (MDC) in order to improve better user’s quality of experience (QoE). Furthermore, this thesis examines the sensitivity of the human visual system (HVS) when employed to view 3D video scenes. The approach used in this study is to use subjective testing in order to rate people’s perception of 3D video under error free and error prone conditions through the use of a carefully designed bespoke questionnaire.Petroleum Technology Development Fund (PTDF

Brunel University Research Archive

Flexi-WVSNP-DASH: A Wireless Video Sensor Network Platform for the Internet of Things

Author
Publication venue
Publication date: 01/01/2017
Field of study

abstract: Video capture, storage, and distribution in wireless video sensor networks (WVSNs) critically depends on the resources of the nodes forming the sensor networks. In the era of big data, Internet of Things (IoT), and distributed demand and solutions, there is a need for multi-dimensional data to be part of the Sensor Network data that is easily accessible and consumable by humanity as well as machinery. Images and video are expected to become as ubiquitous as is the scalar data in traditional sensor networks. The inception of video-streaming over the Internet, heralded a relentless research for effective ways of distributing video in a scalable and cost effective way. There has been novel implementation attempts across several network layers. Due to the inherent complications of backward compatibility and need for standardization across network layers, there has been a refocused attention to address most of the video distribution over the application layer. As a result, a few video streaming solutions over the Hypertext Transfer Protocol (HTTP) have been proposed. Most notable are Apple’s HTTP Live Streaming (HLS) and the Motion Picture Experts Groups Dynamic Adaptive Streaming over HTTP (MPEG-DASH). These frameworks, do not address the typical and future WVSN use cases. A highly flexible Wireless Video Sensor Network Platform and compatible DASH (WVSNP-DASH) are introduced. The platform's goal is to usher video as a data element that can be integrated into traditional and non-Internet networks. A low cost, scalable node is built from the ground up to be fully compatible with the Internet of Things Machine to Machine (M2M) concept, as well as the ability to be easily re-targeted to new applications in a short time. Flexi-WVSNP design includes a multi-radio node, a middle-ware for sensor operation and communication, a cross platform client facing data retriever/player framework, scalable security as well as a cohesive but decoupled hardware and software design.Dissertation/ThesisDoctoral Dissertation Electrical Engineering 201

ASU Digital Repository