2,746 research outputs found

    Fusion of Global and Local Motion Estimation Using Foreground Objects for Distributed Video Coding

    Get PDF
    International audienceThe side information in distributed video coding is estimated using the available decoded frames, and exploited for the decoding and reconstruction of other frames. The quality of the side information has a strong impact on the performance of distributed video coding. Here we propose a new approach that combines both global and local side information to improve coding performance. Since the background pixels in a frame are assigned to global estimation and the foreground objects to local estimation, one needs to estimate foreground objects in the side information using the backward and forward foreground objects, The background pixels are directly taken from the global side information. Specifically, elastic curves and local motion compensation are used to generate the foreground objects masks in the side information. Experimental results show that, as far as the rate-distortion performance is concerned, the proposed approach can achieve a PSNR improvement of up to 1.39 dB for a GOP size of 2, and up to 4.73 dB for larger GOP sizes, with respect to the reference DISCOVER codec. Index Terms A. ABOU-ELAILAH, F. DUFAUX, M. CAGNAZZO, and B. PESQUET-POPESCU are with the Signal and Image Processin

    Segmentation-based mesh design for motion estimation

    Get PDF
    Dans la plupart des codec vidĂ©o standard, l'estimation des mouvements entre deux images se fait gĂ©nĂ©ralement par l'algorithme de concordance des blocs ou encore BMA pour « Block Matching Algorithm ». BMA permet de reprĂ©senter l'Ă©volution du contenu des images en dĂ©composant normalement une image par blocs 2D en mouvement translationnel. Cette technique de prĂ©diction conduit habituellement Ă  de sĂ©vĂšres distorsions de 1'artefact de bloc lorsque Ie mouvement est important. De plus, la dĂ©composition systĂ©matique en blocs rĂ©guliers ne dent pas compte nullement du contenu de l'image. Certains paramĂštres associes aux blocs, mais inutiles, doivent ĂȘtre transmis; ce qui rĂ©sulte d'une augmentation de dĂ©bit de transmission. Pour paillier a ces dĂ©fauts de BMA, on considĂšre les deux objectifs importants dans Ie codage vidĂ©o, qui sont de recevoir une bonne qualitĂ© d'une part et de rĂ©duire la transmission a trĂšs bas dĂ©bit d'autre part. Dans Ie but de combiner les deux exigences quasi contradictoires, il est nĂ©cessaire d'utiliser une technique de compensation de mouvement qui donne, comme transformation, de bonnes caractĂ©ristiques subjectives et requiert uniquement, pour la transmission, l'information de mouvement. Ce mĂ©moire propose une technique de compensation de mouvement en concevant des mailles 2D triangulaires a partir d'une segmentation de l'image. La dĂ©composition des mailles est construite a partir des nƓuds repartis irrĂ©guliĂšrement Ie long des contours dans l'image. La dĂ©composition rĂ©sultant est ainsi basĂ©e sur Ie contenu de l'image. De plus, Ă©tant donnĂ© la mĂȘme mĂ©thode de sĂ©lection des nƓuds appliquĂ©e Ă  l'encodage et au dĂ©codage, la seule information requise est leurs vecteurs de mouvement et un trĂšs bas dĂ©bit de transmission peut ainsi ĂȘtre rĂ©alise. Notre approche, comparĂ©e avec BMA, amĂ©liore Ă  la fois la qualitĂ© subjective et objective avec beaucoup moins d'informations de mouvement. Dans la premier chapitre, une introduction au projet sera prĂ©sentĂ©e. Dans Ie deuxiĂšme chapitre, on analysera quelques techniques de compression dans les codec standard et, surtout, la populaire BMA et ses dĂ©fauts. Dans Ie troisiĂšme chapitre, notre algorithme propose et appelĂ© la conception active des mailles a base de segmentation, sera discute en dĂ©tail. Ensuite, les estimation et compensation de mouvement seront dĂ©crites dans Ie chapitre 4. Finalement, au chapitre 5, les rĂ©sultats de simulation et la conclusion seront prĂ©sentĂ©s.Abstract: In most video compression standards today, the generally accepted method for temporal prediction is motion compensation using block matching algorithm (BMA). BMA represents the scene content evolution with 2-D rigid translational moving blocks. This kind of predictive scheme usually leads to distortions such as block artefacts especially when the motion is important. The two most important aims in video coding are to receive a good quality on one hand and a low bit-rate on the other. This thesis proposes a motion compensation scheme using segmentation-based 2-D triangular mesh design method. The mesh is constructed by irregularly spread nodal points selected along image contour. Based on this, the generated mesh is, to a great extent, image content based. Moreover, the nodes are selected with the same method on the encoder and decoder sides, so that the only information that has to be transmitted are their motion vectors, and thus very low bit-rate can be achieved. Compared with BMA, our approach could improve subjective and objective quality with much less motion information."--RĂ©sumĂ© abrĂ©gĂ© par UM

    Semi-automatic video object segmentation for multimedia applications

    Get PDF
    A semi-automatic video object segmentation tool is presented for segmenting both still pictures and image sequences. The approach comprises both automatic segmentation algorithms and manual user interaction. The still image segmentation component is comprised of a conventional spatial segmentation algorithm (Recursive Shortest Spanning Tree (RSST)), a hierarchical segmentation representation method (Binary Partition Tree (BPT)), and user interaction. An initial segmentation partition of homogeneous regions is created using RSST. The BPT technique is then used to merge these regions and hierarchically represent the segmentation in a binary tree. The semantic objects are then manually built by selectively clicking on image regions. A video object-tracking component enables image sequence segmentation, and this subsystem is based on motion estimation, spatial segmentation, object projection, region classification, and user interaction. The motion between the previous frame and the current frame is estimated, and the previous object is then projected onto the current partition. A region classification technique is used to determine which regions in the current partition belong to the projected object. User interaction is allowed for object re-initialisation when the segmentation results become inaccurate. The combination of all these components enables offline video sequence segmentation. The results presented on standard test sequences illustrate the potential use of this system for object-based coding and representation of multimedia

    Motion compensation and very low bit rate video coding

    Get PDF
    Recently, many activities of the International Telecommunication Union (ITU) and the International Standard Organization (ISO) are leading to define new standards for very low bit-rate video coding, such as H.263 and MPEG-4 after successful applications of the international standards H.261 and MPEG-1/2 for video coding above 64kbps. However, at very low bit-rate the classic block matching based DCT video coding scheme suffers seriously from blocking artifacts which degrade the quality of reconstructed video frames considerably. To solve this problem, a new technique in which motion compensation is based on dense motion field is presented in this dissertation. Four efficient new video coding algorithms based on this new technique for very low bit-rate are proposed. (1) After studying model-based video coding algorithms, we propose an optical flow based video coding algorithm with thresh-olding techniques. A statistic model is established for distribution of intensity difference between two successive frames, and four thresholds are used to control the bit-rate and the quality of reconstructed frames. It outperforms the typical model-based techniques in terms of complexity and quality of reconstructed frames. (2) An efficient algorithm using DCT coded optical flow. It is found that dense motion fields can be modeled as the first order auto-regressive model, and efficiently compressed with DCT technique, hence achieving very low bit-rate and higher visual quality than the H.263/TMN5. (3) A region-based discrete wavelet transform video coding algorithm. This algorithm implements dense motion field and regions are segmented according to their content significance. The DWT is applied to residual images region by region, and bits are adaptively allocated to regions. It improves the visual quality and PSNR of significant regions while maintaining low bit-rate. (4) A segmentation-based video coding algorithm for stereo sequence. A correlation-feedback algorithm with Kalman filter is utilized to improve the accuracy of optical flow fields. Three criteria, which are associated with 3-D information, 2-D connectivity and motion vector fields, respectively, are defined for object segmentation. A chain code is utilized to code the shapes of the segmented objects. it can achieve very high compression ratio up to several thousands

    MPEG-4 Software Video Encoding

    Get PDF
    A Thesis submitted in fulfillment of the requirements of the degree of doctor of Philosophy in the University of LondonThis thesis presents a software model that allows a parallel decomposition of the MPEG-4 video encoder onto shared memory architectures, in order to reduce its total video encoding time. Since a video sequence consists of video objects each of which is likely to have different encoding requirements, the model incorporates a scheduler which (a) always selects the most appropriate video object for encoding and, (b) employs a mechanism for dynamically allocating video objects allocation onto the system processors, based on video object size information. Further spatial video object parallelism is exploited by applying the single program multiple data (SPMD) paradigm within the different modules of the MPEG-4 video encoder. Due to the fact that not all macroblocks have the same processing requirements, the model also introduces a data partition scheme that generates tiles with identical processing requirements. Since, macroblock data dependencies preclude data parallelism at the shape encoder the model also introduces a new mechanism that allows parallelism using a circular pipeline macroblock technique The encoding time depends partly on an encoder’s computational complexity. This thesis also addresses the problem of the motion estimation, as its complexity has a significant impact on the encoder’s complexity. In particular, two fast motion estimation algorithms have been developed for the model which reduce the computational complexity significantly. The thesis includes experimental results on a four processor shared memory platform, Origin200

    Depth-based Multi-View 3D Video Coding

    Get PDF
    • 

    corecore