Search CORE

161 research outputs found

Multi-View Video Packet Scheduling

Author: Frossard Pascal
Maugey Thomas
Toni Laura
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

In multiview applications, multiple cameras acquire the same scene from different viewpoints and generally produce correlated video streams. This results in large amounts of highly redundant data. In order to save resources, it is critical to handle properly this correlation during encoding and transmission of the multiview data. In this work, we propose a correlation-aware packet scheduling algorithm for multi-camera networks, where information from all cameras are transmitted over a bottleneck channel to clients that reconstruct the multiview images. The scheduling algorithm relies on a new rate-distortion model that captures the importance of each view in the scene reconstruction. We propose a problem formulation for the optimization of the packet scheduling policies, which adapt to variations in the scene content. Then, we design a low complexity scheduling algorithm based on a trellis search that selects the subset of candidate packets to be transmitted towards effective multiview reconstruction at clients. Extensive simulation results confirm the gain of our scheduling algorithm when inter-source correlation information is used in the scheduler, compared to scheduling policies with no information about the correlation or non-adaptive scheduling policies. We finally show that increasing the optimization horizon in the packet scheduling algorithm improves the transmission performance, especially in scenarios where the level of correlation rapidly varies with time

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

UCL Discovery

Optimized Data Representation for Interactive Multiview Navigation

Author: Frossard Pascal
Ma Rui
Maugey Thomas
Publication venue
Publication date: 21/09/2017
Field of study

In contrary to traditional media streaming services where a unique media content is delivered to different users, interactive multiview navigation applications enable users to choose their own viewpoints and freely navigate in a 3-D scene. The interactivity brings new challenges in addition to the classical rate-distortion trade-off, which considers only the compression performance and viewing quality. On the one hand, interactivity necessitates sufficient viewpoints for richer navigation; on the other hand, it requires to provide low bandwidth and delay costs for smooth navigation during view transitions. In this paper, we formally describe the novel trade-offs posed by the navigation interactivity and classical rate-distortion criterion. Based on an original formulation, we look for the optimal design of the data representation by introducing novel rate and distortion models and practical solving algorithms. Experiments show that the proposed data representation method outperforms the baseline solution by providing lower resource consumptions and higher visual quality in all navigation configurations, which certainly confirms the potential of the proposed data representation in practical interactive navigation systems

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Rate-Distortion Analysis of Multiview Coding in a DIBR Framework

Author: Frossard Pascal
Maugey Thomas
Pourreza Hamid-Reza
Rajaei Boshra
Publication venue
Publication date: 17/10/2012
Field of study

Depth image based rendering techniques for multiview applications have been recently introduced for efficient view generation at arbitrary camera positions. Encoding rate control has thus to consider both texture and depth data. Due to different structures of depth and texture images and their different roles on the rendered views, distributing the available bit budget between them however requires a careful analysis. Information loss due to texture coding affects the value of pixels in synthesized views while errors in depth information lead to shift in objects or unexpected patterns at their boundaries. In this paper, we address the problem of efficient bit allocation between textures and depth data of multiview video sequences. We adopt a rate-distortion framework based on a simplified model of depth and texture images. Our model preserves the main features of depth and texture images. Unlike most recent solutions, our method permits to avoid rendering at encoding time for distortion estimation so that the encoding complexity is not augmented. In addition to this, our model is independent of the underlying inpainting method that is used at decoder. Experiments confirm our theoretical results and the efficiency of our rate allocation strategy

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Navigation domain representation for interactive multiview imaging

Author: Gene Cheung
Ismael Daribo
Pascal Frossard
Senior Member
Senior Member
Thomas Maugey
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 17/06/2013
Field of study

Enabling users to interactively navigate through different viewpoints of a static scene is a new interesting functionality in 3D streaming systems. While it opens exciting perspectives towards rich multimedia applications, it requires the design of novel representations and coding techniques in order to solve the new challenges imposed by interactive navigation. Interactivity clearly brings new design constraints: the encoder is unaware of the exact decoding process, while the decoder has to reconstruct information from incomplete subsets of data since the server can generally not transmit images for all possible viewpoints due to resource constrains. In this paper, we propose a novel multiview data representation that permits to satisfy bandwidth and storage constraints in an interactive multiview streaming system. In particular, we partition the multiview navigation domain into segments, each of which is described by a reference image and some auxiliary information. The auxiliary information enables the client to recreate any viewpoint in the navigation segment via view synthesis. The decoder is then able to navigate freely in the segment without further data request to the server; it requests additional data only when it moves to a different segment. We discuss the benefits of this novel representation in interactive navigation systems and further propose a method to optimize the partitioning of the navigation domain into independent segments, under bandwidth and storage constraints. Experimental results confirm the potential of the proposed representation; namely, our system leads to similar compression performance as classical inter-view coding, while it provides the high level of flexibility that is required for interactive streaming. Hence, our new framework represents a promising solution for 3D data representation in novel interactive multimedia services

arXiv.org e-Print Archive

CiteSeerX

Optimized Packet Scheduling in Multiview Video Navigation Systems

Author: Frossard Pascal
Maugey Thomas
Toni Laura
Publication venue
Publication date: 02/12/2014
Field of study

In multiview video systems, multiple cameras generally acquire the same scene from different perspectives, such that users have the possibility to select their preferred viewpoint. This results in large amounts of highly redundant data, which needs to be properly handled during encoding and transmission over resource-constrained channels. In this work, we study coding and transmission strategies in multicamera systems, where correlated sources send data through a bottleneck channel to a central server, which eventually transmits views to different interactive users. We propose a dynamic correlation-aware packet scheduling optimization under delay, bandwidth, and interactivity constraints. The optimization relies both on a novel rate-distortion model, which captures the importance of each view in the 3D scene reconstruction, and on an objective function that optimizes resources based on a client navigation model. The latter takes into account the distortion experienced by interactive clients as well as the distortion variations that might be observed by clients during multiview navigation. We solve the scheduling problem with a novel trellis-based solution, which permits to formally decompose the multivariate optimization problem thereby significantly reducing the computation complexity. Simulation results show the gain of the proposed algorithm compared to baseline scheduling policies. More in details, we show the gain offered by our dynamic scheduling policy compared to static camera allocation strategies and to schemes with constant coding strategies. Finally, we show that the best scheduling policy consistently adapts to the most likely user navigation path and that it minimizes distortion variations that can be very disturbing for users in traditional navigation systems

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

UCL Discovery

HAL-Rennes 1

Visual data compression: beyond conventional approaches

Author: Maugey Thomas
Publication venue: HAL CCSD
Publication date: 27/06/2022
Field of study

INRIA a CCSD electronic archive server

Dense Disparity Estimation in a Multi-view Distributed Video Coding System

Author: Maugey Thomas
Miled Wided
Publication venue
Publication date: 27/12/2010
Field of study

Distributed video coding (DVC) is a recent paradigm which aims at transferring part of the coding complexity from the encoder to the decoder. The performance of such a coding scheme strongly depends on the capacity to estimate correlation at the decoder and, consequently, on the side information quality. In this paper we consider a multi-view DVC framework and propose a very efficient dense dis- parity estimation technique for side information construction, based on a variational formulation. The simulation results show that our approach clearly outperforms the existing methods for inter-view side-information generation

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Codage vidéo multi-vue pour une vision interactive au récepteur

Author: Frossard Pascal
Maugey Thomas
Publication venue
Publication date: 27/01/2012
Field of study

Dans cet article, nous proposons un nouveau système de codage de vidéo multi-vues, dont la spécificité est de proposer à l’utilisateur une interactivité lui permettant de changer de vue en temps réel. Contrairement aux schémas de codages existant dans la littérature, notre approche tire son originalité du fait qu’elle permet un décodage à faible charge de calcul, au prix d’un ajout en débit qui s’avère être raisonnable dans les expériences présentées. Afin de réduire encore plus ce coût induit par la possibilité d’interactivité, nous proposons également d’optimiser les performances de transmission en se fondant sur une modélisation du comportement de l’utilisateur

Infoscience - École polytechnique fédérale de Lausanne

Interactive Multiview Video System With Low Complexity 2D Look Around at Decoder

Author: Frossard Pascal
Maugey Thomas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/01/2012
Field of study

Multiview video with interactive and smooth view switching at the receiver is a challenging application with several issues in terms of effective use of storage and bandwidth resources, reactivity of the system, quality of the viewing experience and system complexity. The classical decoding system for generating virtual views first projects a reference or encoded frame to a given viewpoint and then fills in the holes due to potential occlusions. This last step still constitutes a complex operation with specific software or hardware at the receiver and requires a certain quantity of information from the neighboring frames for insuring consistency between the virtual images. In this work we propose a new approach that shifts most of the burden due to interactivity from the decoder to the encoder, by anticipating the navigation of the decoder and sending auxiliary information that guarantees temporal and interview consistency. This leads to an additional cost in terms of transmission rate and storage, which we minimize by using optimization techniques based on the user behavior modeling. We show by experiments that the proposed system represents a valid solution for interactive multiview systems with classical decoders

Infoscience - École polytechnique fédérale de Lausanne