Search CORE

2,054 research outputs found

Digital recording of performing arts: formats and conversion

Author: Barbarien Joeri
Coppens SamUGent000141139848802000264764977310338918F99AD20E-F0ED-11E1-A9DE-61C894A0A6B4
De Cock Jan001999328230801001861346F71F8358-F0ED-11E1-A9DE-61C894A0A6B4
Evens TomeditorPS010020011964888020000000360000-0002-7274-7432F83A851C-F0ED-11E1-A9DE-61C894A0A6B4
Jacobs Marc
Mannens ErikTW060019873040688010020739380000-0001-7946-4884F80364A6-F0ED-11E1-A9DE-61C894A0A6B4
Moreels DrieseditorCA208020010888600000-0002-5297-107437DCF06A-F0EE-11E1-A9DE-61C894A0A6B4
Notebaert StijnUGent801001861750002001630362053248469389F71FEFC8-F0ED-11E1-A9DE-61C894A0A6B4
Schelkens Peter
Van de Walle RikCA01CA05TW068010009619730000-0002-7491-5145F4CFF146-F0ED-11E1-A9DE-61C894A0A6B4
Publication venue: Vlaams Theater Instituut (VTI)
Publication date: 01/01/2009
Field of study

Ghent University Academic Bibliography

Archivsystem Ask23

How to Train Your Dragon: Tamed Warping Network for Semantic Video Segmentation

Author: Chen Yifeng
Cui Jiabao
Feng Junyi
Huang Fuxian
Li Songyuan
Li Xi
Publication venue
Publication date: 20/07/2020
Field of study

Real-time semantic segmentation on high-resolution videos is challenging due to the strict requirements of speed. Recent approaches have utilized the inter-frame continuity to reduce redundant computation by warping the feature maps across adjacent frames, greatly speeding up the inference phase. However, their accuracy drops significantly owing to the imprecise motion estimation and error accumulation. In this paper, we propose to introduce a simple and effective correction stage right after the warping stage to form a framework named Tamed Warping Network (TWNet), aiming to improve the accuracy and robustness of warping-based models. The experimental results on the Cityscapes dataset show that with the correction, the accuracy (mIoU) significantly increases from 67.3% to 71.6%, and the speed edges down from 65.5 FPS to 61.8 FPS. For non-rigid categories such as "human" and "object", the improvements of IoU are even higher than 18 percentage points

arXiv.org e-Print Archive

Video Compression from the Hardware Perspective

Author: Grzegorz Pastuszak
Publication venue: 'IntechOpen'
Publication date: 05/04/2012
Field of study

IntechOpen

Dynamic Body VSLAM with Semantic Constraints

Author: Chari Visesh
Krishna K. Madhava
Reddy N. Dinesh
Singhal Prateek
Publication venue
Publication date: 27/04/2015
Field of study

Image based reconstruction of urban environments is a challenging problem that deals with optimization of large number of variables, and has several sources of errors like the presence of dynamic objects. Since most large scale approaches make the assumption of observing static scenes, dynamic objects are relegated to the noise modeling section of such systems. This is an approach of convenience since the RANSAC based framework used to compute most multiview geometric quantities for static scenes naturally confine dynamic objects to the class of outlier measurements. However, reconstructing dynamic objects along with the static environment helps us get a complete picture of an urban environment. Such understanding can then be used for important robotic tasks like path planning for autonomous navigation, obstacle tracking and avoidance, and other areas. In this paper, we propose a system for robust SLAM that works in both static and dynamic environments. To overcome the challenge of dynamic objects in the scene, we propose a new model to incorporate semantic constraints into the reconstruction algorithm. While some of these constraints are based on multi-layered dense CRFs trained over appearance as well as motion cues, other proposed constraints can be expressed as additional terms in the bundle adjustment optimization process that does iterative refinement of 3D structure and camera / object motion trajectories. We show results on the challenging KITTI urban dataset for accuracy of motion segmentation and reconstruction of the trajectory and shape of moving objects relative to ground truth. We are able to show average relative error reduction by a significant amount for moving object trajectory reconstruction relative to state-of-the-art methods like VISO 2, as well as standard bundle adjustment algorithms

arXiv.org e-Print Archive

Crossref

Recent Progress in Image Deblurring

Author: Tao Dacheng
Wang Ruxin
Publication venue
Publication date: 24/09/2014
Field of study

This paper comprehensively reviews the recent development of image deblurring, including non-blind/blind, spatially invariant/variant deblurring techniques. Indeed, these techniques share the same objective of inferring a latent sharp image from one or several corresponding blurry images, while the blind deblurring techniques are also required to derive an accurate blur kernel. Considering the critical role of image restoration in modern imaging systems to provide high-quality images under complex environments such as motion, undesirable lighting conditions, and imperfect system components, image deblurring has attracted growing attention in recent years. From the viewpoint of how to handle the ill-posedness which is a crucial issue in deblurring tasks, existing methods can be grouped into five categories: Bayesian inference framework, variational methods, sparse representation-based methods, homography-based modeling, and region-based methods. In spite of achieving a certain level of development, image deblurring, especially the blind case, is limited in its success by complex application conditions which make the blur kernel hard to obtain and be spatially variant. We provide a holistic understanding and deep insight into image deblurring in this review. An analysis of the empirical evidence for representative methods, practical issues, as well as a discussion of promising future directions are also presented.Comment: 53 pages, 17 figure

arXiv.org e-Print Archive

CiteSeerX

Low energy HEVC and VVC video compression hardware

Author: Azgın Hasan
Publication venue
Publication date: 19/07/2019
Field of study

Video compression standards compress a digital video by reducing and removing redundancy in the digital video using computationally complex algorithms. As spatial and temporal resolutions of videos increase, compression efficiencies of video compression algorithms are also increasing. However, increased compression efficiency comes with increased computational complexity. Therefore, it is necessary to reduce computational complexities of video compression algorithms without reducing their visual quality in order to reduce area and energy consumption of their hardware implementations. In this thesis, we propose a novel technique for reducing amount of computations performed by HEVC intra prediction algorithm. We designed low energy, reconfigurable HEVC intra prediction hardware using the proposed technique. We also designed a low energy FPGA implementation of HEVC intra prediction algorithm using the proposed technique and DSP blocks. We propose a reconfigurable VVC intra prediction hardware architecture. We also propose an efficient VVC intra prediction hardware architecture using DSP blocks. We designed low energy VVC fractional interpolation hardware. We propose a novel approximate absolute difference technique. We designed low energy approximate absolute difference hardware using the proposed technique. We propose a novel approximate constant multiplication technique. We designed approximate constant multiplication hardware using the proposed technique. We quantified computation reductions achieved by the proposed techniques and video quality loss caused by the proposed approximation techniques. The proposed approximate absolute difference technique and approximate constant multiplication technique cause very small PSNR loss. The other proposed techniques cause no PSNR loss. We implemented the proposed hardware architectures in Verilog HDL. We mapped the Verilog RTL codes to Xilinx Virtex 6 or Xilinx Virtex 7 FPGAs and estimated their power consumptions using Xilinx XPower Analyzer tool. The proposed techniques significantly reduced power and energy consumptions of these FPGA implementation

Sabanci University Research Database

Detecting Biological Motion for Human-Robot Interaction: A Link between Perception and Action

Author: Alessandra Sciutti
Alessia Vignolo
Alessia Vignolo
Francesca Odone
Francesco Rea
Giulio Sandini
Nicoletta Noceti
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

One of the fundamental skills supporting safe and comfortable interaction between humans is their capability to understand intuitively each other's actions and intentions. At the basis of this ability is a special-purpose visual processing that human brain has developed to comprehend human motion. Among the first "building blocks" enabling the bootstrapping of such visual processing is the ability to detect movements performed by biological agents in the scene, a skill mastered by human babies in the first days of their life. In this paper, we present a computational model based on the assumption that such visual ability must be based on local low-level visual motion features, which are independent of shape, such as the configuration of the body and perspective. Moreover, we implement it on the humanoid robot iCub, embedding it into a software architecture that leverages the regularities of biological motion also to control robot attention and oculomotor behaviors. In essence, we put forth a model in which the regularities of biological motion link perception and action enabling a robotic agent to follow a human-inspired sensory-motor behavior. We posit that this choice facilitates mutual understanding and goal prediction during collaboration, increasing the pleasantness and safety of the interactio

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

Archivio istituzionale della ricerca - Università di Genova

Intra Coding Strategy for Video Error Resiliency: Behavioral Analysis

Author: Ghanbari Mohammed
Kazemi Mohammad
Shirmohammadi Shervin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2020
Field of study

One challenge in video transmission is to deal with packet loss. Since the compressed video streams are sensitive to data loss, the error resiliency of the encoded video becomes important. When video data is lost and retransmission is not possible, the missed data should be concealed. But loss concealment causes distortion in the lossy frame which also propagates into the next frames even if their data are received correctly. One promising solution to mitigate this error propagation is intra coding. There are three approaches for intra coding: intra coding of a number of blocks selected randomly or regularly, intra coding of some specific blocks selected by an appropriate cost function, or intra coding of a whole frame. But Intra coding reduces the compression ratio; therefore, there exists a trade-off between bitrate and error resiliency achieved by intra coding. In this paper, we study and show the best strategy for getting the best rate-distortion performance. Considering the error propagation, an objective function is formulated, and with some approximations, this objective function is simplified and solved. The solution demonstrates that periodical I-frame coding is preferred over coding only a number of blocks as intra mode in P-frames. Through examination of various test sequences, it is shown that the best intra frame period depends on the coding bitrate as well as the packet loss rate. We then propose a scheme to estimate this period from curve fitting of the experimental results, and show that our proposed scheme outperforms other methods of intra coding especially for higher loss rates and coding bitrates

University of Essex Research Repository

Crossref