Search CORE

2,054 research outputs found

High-Level Synthesis Based VLSI Architectures for Video Coding

Author: Ahmad Waqar
Publication venue: Politecnico di Torino
Publication date: 01/01/2017
Field of study

High Efficiency Video Coding (HEVC) is state-of-the-art video coding standard. Emerging applications like free-viewpoint video, 360degree video, augmented reality, 3D movies etc. require standardized extensions of HEVC. The standardized extensions of HEVC include HEVC Scalable Video Coding (SHVC), HEVC Multiview Video Coding (MV-HEVC), MV-HEVC+ Depth (3D-HEVC) and HEVC Screen Content Coding. 3D-HEVC is used for applications like view synthesis generation, free-viewpoint video. Coding and transmission of depth maps in 3D-HEVC is used for the virtual view synthesis by the algorithms like Depth Image Based Rendering (DIBR). As first step, we performed the profiling of the 3D-HEVC standard. Computational intensive parts of the standard are identified for the efficient hardware implementation. One of the computational intensive part of the 3D-HEVC, HEVC and H.264/AVC is the Interpolation Filtering used for Fractional Motion Estimation (FME). The hardware implementation of the interpolation filtering is carried out using High-Level Synthesis (HLS) tools. Xilinx Vivado Design Suite is used for the HLS implementation of the interpolation filters of HEVC and H.264/AVC. The complexity of the digital systems is greatly increased. High-Level Synthesis is the methodology which offers great benefits such as late architectural or functional changes without time consuming in rewriting of RTL-code, algorithms can be tested and evaluated early in the design cycle and development of accurate models against which the final hardware can be verified

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

MASCOT : metadata for advanced scalable video coding tools : final report

Author: Bernard C.
Domanski M.
Heijmans H.J.A.M. (Henk)
Pesquet-Popescu B.
Schelkens P.
Smolic A.
Torres L.
Publication venue: CWI
Publication date: 01/01/2003
Field of study

The goal of the MASCOT project was to develop new video coding schemes and tools that provide both an increased coding efficiency as well as extended scalability features compared to technology that was available at the beginning of the project. Towards that goal the following tools would be used: - metadata-based coding tools; - new spatiotemporal decompositions; - new prediction schemes. Although the initial goal was to develop one single codec architecture that was able to combine all new coding tools that were foreseen when the project was formulated, it became clear that this would limit the selection of the new tools. Therefore the consortium decided to develop two codec frameworks within the project, a standard hybrid DCT-based codec and a 3D wavelet-based codec, which together are able to accommodate all tools developed during the course of the project

CWI's Institutional Repository

New visual coding exploration in MPEG: Super-MultiView and free navigation in free viewpoint TV

Author: Carballeira López Pablo
Ceulemans Beerend
Domanski Marek
García Lobo Sergio Carlos
Goorts Patrik
Grajek Tomasz
Jorissen Lode
Jung Joël
Kovács Péter Tamás
Lafruit Gauthier
Monteanu Adrian
Senoh Takanori
Tanimoto Masayuki
Wegner Krzysztof
Publication venue: 'Society for Imaging Science & Technology'
Publication date: 01/02/2016
Field of study

ISO/IEC MPEG and ITU-T VCEG have recently jointly issued a new multiview video compression standard, called 3D-HEVC, which reaches unpreceded compression performances for linear,dense camera arrangements. In view of supporting future highquality,auto-stereoscopic 3D displays and Free Navigation virtual/augmented reality applications with sparse, arbitrarily arranged camera setups, innovative depth estimation and virtual view synthesis techniques with global optimizations over all camera views should be developed. Preliminary studies in response to the MPEG-FTV (Free viewpoint TV) Call for Evidence suggest these targets are within reach, with at least 6% bitrate gains over 3DHEVC technology

DI-fusion

Archivo Digital UPM

Depth-based Multi-View 3D Video Coding

Author: Zamarin Marco
Publication venue: Technical University of Denmark
Publication date: 01/01/2013
Field of study

Online Research Database In Technology

Cross-layer Optimized Wireless Video Surveillance

Author: Ye Yun
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 31/07/2013
Field of study

A wireless video surveillance system contains three major components, the video capture and preprocessing, the video compression and transmission over wireless sensor networks (WSNs), and the video analysis at the receiving end. The coordination of different components is important for improving the end-to-end video quality, especially under the communication resource constraint. Cross-layer control proves to be an efficient measure for optimal system configuration. In this dissertation, we address the problem of implementing cross-layer optimization in the wireless video surveillance system. The thesis work is based on three research projects. In the first project, a single PTU (pan-tilt-unit) camera is used for video object tracking. The problem studied is how to improve the quality of the received video by jointly considering the coding and transmission process. The cross-layer controller determines the optimal coding and transmission parameters, according to the dynamic channel condition and the transmission delay. Multiple error concealment strategies are developed utilizing the special property of the PTU camera motion. In the second project, the binocular PTU camera is adopted for video object tracking. The presented work studied the fast disparity estimation algorithm and the 3D video transcoding over the WSN for real-time applications. The disparity/depth information is estimated in a coarse-to-fine manner using both local and global methods. The transcoding is coordinated by the cross-layer controller based on the channel condition and the data rate constraint, in order to achieve the best view synthesis quality. The third project is applied for multi-camera motion capture in remote healthcare monitoring. The challenge is the resource allocation for multiple video sequences. The presented cross-layer design incorporates the delay sensitive, content-aware video coding and transmission, and the adaptive video coding and transmission to ensure the optimal and balanced quality for the multi-view videos. In these projects, interdisciplinary study is conducted to synergize the surveillance system under the cross-layer optimization framework. Experimental results demonstrate the efficiency of the proposed schemes. The challenges of cross-layer design in existing wireless video surveillance systems are also analyzed to enlighten the future work. Adviser: Song C

DigitalCommons@University of Nebraska

Cross-layer Optimized Wireless Video Surveillance

Author: Ye Yun
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2013
Field of study

DigitalCommons@University of Nebraska

Livrable D5.2 of the PERSEE project : 2D/3D Codec architecture

Author: Bosc Emilie
Cagnazzo Marco
Gautier Josselin
Guillemot Christine
Guillo Laurent
Le Callet Patrick
Le Meur Olivier
Morin Luce
Pesquet-Popescu Beatrice
Ricordel Vincent
Wang Junle
Publication venue: HAL CCSD
Publication date: 01/05/2011
Field of study

Livrable D5.2 du projet ANR PERSEECe rapport a été réalisé dans le cadre du projet ANR PERSEE (n° ANR-09-BLAN-0170). Exactement il correspond au livrable D5.2 du projet. Son titre : 2D/3D Codec architectur

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1

FVV Live: A real-time free-viewpoint video system with consumer electronics hardware

Author: Arnaldo Sergio
Berjón Daniel
Cabrera Julián
Carballeira Pablo
Carmona Carlos
Corregidor Daniel
Doblado Carmen
Díaz César
García Narciso
Martín María del Mar
Morán Francisco
Publication venue
Publication date: 01/07/2020
Field of study

FVV Live is a novel end-to-end free-viewpoint video system, designed for low cost and real-time operation, based on off-the-shelf components. The system has been designed to yield high-quality free-viewpoint video using consumer-grade cameras and hardware, which enables low deployment costs and easy installation for immersive event-broadcasting or videoconferencing. The paper describes the architecture of the system, including acquisition and encoding of multiview plus depth data in several capture servers and virtual view synthesis on an edge server. All the blocks of the system have been designed to overcome the limitations imposed by hardware and network, which impact directly on the accuracy of depth data and thus on the quality of virtual view synthesis. The design of FVV Live allows for an arbitrary number of cameras and capture servers, and the results presented in this paper correspond to an implementation with nine stereo-based depth cameras. FVV Live presents low motion-to-photon and end-to-end delays, which enables seamless free-viewpoint navigation and bilateral immersive communications. Moreover, the visual quality of FVV Live has been assessed through subjective assessment with satisfactory results, and additional comparative tests show that it is preferred over state-of-the-art DIBR alternatives

arXiv.org e-Print Archive