45 research outputs found

    Efficient reverse-play algorithms for MPEG video with VCR support

    Get PDF
    Centre for Multimedia Signal Processing, Department of Electronic and Information Engineering2005-2006 > Academic research: refereed > Publication in refereed journalVersion of RecordPublishe

    Performance evaluation of transcoding algorithms for H.264

    Get PDF

    Signal processing for improved MPEG-based communication systems

    Get PDF

    Adaptive video delivery using semantics

    Get PDF
    The diffusion of network appliances such as cellular phones, personal digital assistants and hand-held computers has created the need to personalize the way media content is delivered to the end user. Moreover, recent devices, such as digital radio receivers with graphics displays, and new applications, such as intelligent visual surveillance, require novel forms of video analysis for content adaptation and summarization. To cope with these challenges, we propose an automatic method for the extraction of semantics from video, and we present a framework that exploits these semantics in order to provide adaptive video delivery. First, an algorithm that relies on motion information to extract multiple semantic video objects is proposed. The algorithm operates in two stages. In the first stage, a statistical change detector produces the segmentation of moving objects from the background. This process is robust with regard to camera noise and does not need manual tuning along a sequence or for different sequences. In the second stage, feedbacks between an object partition and a region partition are used to track individual objects along the frames. These interactions allow us to cope with multiple, deformable objects, occlusions, splitting, appearance and disappearance of objects, and complex motion. Subsequently, semantics are used to prioritize visual data in order to improve the performance of adaptive video delivery. The idea behind this approach is to organize the content so that a particular network or device does not inhibit the main content message. Specifically, we propose two new video adaptation strategies. The first strategy combines semantic analysis with a traditional frame-based video encoder. Background simplifications resulting from this approach do not penalize overall quality at low bitrates. The second strategy uses metadata to efficiently encode the main content message. The metadata-based representation of object's shape and motion suffices to convey the meaning and action of a scene when the objects are familiar. The impact of different video adaptation strategies is then quantified with subjective experiments. We ask a panel of human observers to rate the quality of adapted video sequences on a normalized scale. From these results, we further derive an objective quality metric, the semantic peak signal-to-noise ratio (SPSNR), that accounts for different image areas and for their relevance to the observer in order to reflect the focus of attention of the human visual system. At last, we determine the adaptation strategy that provides maximum value for the end user by maximizing the SPSNR for given client resources at the time of delivery. By combining semantic video analysis and adaptive delivery, the solution presented in this dissertation permits the distribution of video in complex media environments and supports a large variety of content-based applications

    Content Fragile Watermarking for H.264/AVC Video Authentication

    Get PDF
    Discrete Cosine transform (DCT) to generate the authentication data that are treated as a fragile watermark. This watermark is embedded in the motion vectors (MVs) The advances in multimedia technologies and digital processing tools have brought with them new challenges for the source and content authentication. To ensure the integrity of the H.264/AVC video stream, we introduce an approach based on a content fragile video watermarking method using an independent authentication of each Group of Pictures (GOPs) within the video. This technique uses robust visual features extracted from the video pertaining to the set of selected macroblocs (MBs) which hold the best partition mode in a tree-structured motion compensation process. An additional security degree is offered by the proposed method through using a more secured keyed function HMAC-SHA-256 and randomly choosing candidates from already selected MBs. In here, the watermark detection and verification processes are blind, whereas the tampered frames detection is not since it needs the original frames within the tampered GOPs. The proposed scheme achieves an accurate authentication technique with a high fragility and fidelity whilst maintaining the original bitrate and the perceptual quality. Furthermore, its ability to detect the tampered frames in case of spatial, temporal and colour manipulations, is confirmed

    Development of advanced multimedia services in P2P architectures

    Get PDF
    La transmissió de fluxos multimèdia en temps real (streaming) s’ha convertit en un tema punter i de gran interès al món de la recerca d’Internet, especialment quan ens referim a aplicacions de transmissió d’àudio i vídeo en directe a través de xarxes peer-to-peer (P2P). Generalment, aquestes aplicacions han de fer front a molts problemes en el seu disseny i implementació deguts a la dinamicitat i heterogeneïtat que per natura caracteritzen les xarxes P2P. En aquest projecte, s’introdueixen noves característiques que les aplicacions de transmissió multimèdia P2P actuals no contemplen. Els requisits de connexió i maquinari són diferents per a la transmissió de fluxos de baixa i alta capacitat, no obstant, tots els nodes s’acostumen a considerar idèntics, cosa que no representa una aproximació gaire encertada tenint en compte un medi tan heterogeni. A més a més, amb la finalitat d’aconseguir distinció entre nodes, es fa necessari la introducció d’un mecanisme que permeti l’intercanvi de les capacitats específiques de cada node, incloent-hi les de transcodificació de fluxos. Un altre aspecte a destacar és el fet que aquestes aplicacions són difícils d’ampliar, incorporar nous serveis o modificar les dades que porten precarregades com ara la llista de canals de televisió disponibles, cosa que impossibilita garantir la disponibilitat de la font tot el temps. Per altra banda, els serveis interactius tampoc s’han desenvolupat gaire. Aquest projecte proposa el disseny i implementació d’una plataforma de difusió multimèdia P2P cooperativa i interactiva que permet superar els problemes esmentats. La plataforma integra diferents mecanismes que permeten la distribució en temps real de continguts multimèdia en diferents qualitats incloent fluxos d’alta capacitat (com per exemple HD). Aquesta plataforma és una solució novedosa basada en JXTA, DONET i ALM (Arbres Multicast a nivell d’Aplicació) que proporciona un sistema ampliable segons noves necessitats i facilita la inserció de nous serveis de valor afegit. La plataforma proposada es fonamenta en la creació d’una arquitectura de 2 capes lògiques superposades: una capa lògica JXTA, encarregada bàsicament de la senyalització i intercanvi de metadades, i una capa de transmissió basada en sockets UDP unicast. D’aquesta manera, la diferència entre la capa de transmissió i la capa física es pot veure reduïda a partir de la informació obtinguda de la capa JXTA, la qual es va actualitzant al llarg del temps

    Flexi-WVSNP-DASH: A Wireless Video Sensor Network Platform for the Internet of Things

    Get PDF
    abstract: Video capture, storage, and distribution in wireless video sensor networks (WVSNs) critically depends on the resources of the nodes forming the sensor networks. In the era of big data, Internet of Things (IoT), and distributed demand and solutions, there is a need for multi-dimensional data to be part of the Sensor Network data that is easily accessible and consumable by humanity as well as machinery. Images and video are expected to become as ubiquitous as is the scalar data in traditional sensor networks. The inception of video-streaming over the Internet, heralded a relentless research for effective ways of distributing video in a scalable and cost effective way. There has been novel implementation attempts across several network layers. Due to the inherent complications of backward compatibility and need for standardization across network layers, there has been a refocused attention to address most of the video distribution over the application layer. As a result, a few video streaming solutions over the Hypertext Transfer Protocol (HTTP) have been proposed. Most notable are Apple’s HTTP Live Streaming (HLS) and the Motion Picture Experts Groups Dynamic Adaptive Streaming over HTTP (MPEG-DASH). These frameworks, do not address the typical and future WVSN use cases. A highly flexible Wireless Video Sensor Network Platform and compatible DASH (WVSNP-DASH) are introduced. The platform's goal is to usher video as a data element that can be integrated into traditional and non-Internet networks. A low cost, scalable node is built from the ground up to be fully compatible with the Internet of Things Machine to Machine (M2M) concept, as well as the ability to be easily re-targeted to new applications in a short time. Flexi-WVSNP design includes a multi-radio node, a middle-ware for sensor operation and communication, a cross platform client facing data retriever/player framework, scalable security as well as a cohesive but decoupled hardware and software design.Dissertation/ThesisDoctoral Dissertation Electrical Engineering 201
    corecore