6 research outputs found
In-Network View Synthesis for Interactive Multiview Video Systems
To enable Interactive multiview video systems with a minimum view-switching
delay, multiple camera views are sent to the users, which are used as reference
images to synthesize additional virtual views via depth-image-based rendering.
In practice, bandwidth constraints may however restrict the number of reference
views sent to clients per time unit, which may in turn limit the quality of the
synthesized viewpoints. We argue that the reference view selection should
ideally be performed close to the users, and we study the problem of in-network
reference view synthesis such that the navigation quality is maximized at the
clients. We consider a distributed cloud network architecture where data stored
in a main cloud is delivered to end users with the help of cloudlets, i.e.,
resource-rich proxies close to the users. In order to satisfy last-hop
bandwidth constraints from the cloudlet to the users, a cloudlet re-samples
viewpoints of the 3D scene into a discrete set of views (combination of
received camera views and virtual views synthesized) to be used as reference
for the synthesis of additional virtual views at the client. This in-network
synthesis leads to better viewpoint sampling given a bandwidth constraint
compared to simple selection of camera views, but it may however carry a
distortion penalty in the cloudlet-synthesized reference views. We therefore
cast a new reference view selection problem where the best subset of views is
defined as the one minimizing the distortion over a view navigation window
defined by the user under some transmission bandwidth constraints. We show that
the view selection problem is NP-hard, and propose an effective polynomial time
algorithm using dynamic programming to solve the optimization problem.
Simulation results finally confirm the performance gain offered by virtual view
synthesis in the network
Graph-based representation for multiview image geometry
In this paper, we propose a new representation for multiview image sets. Our approach relies on graphs to describe geometry information in a compact and controllable way. The links of the graph connect pixels in different images and describe the proximity between pixels in the 3D space. These connections are dependent on the geometry of the scene and provide the right amount of information that is necessary for coding and reconstructing multiple views. This multiview image representation is very compact and adapts the transmitted geometry information as a function of the complexity of the prediction performed at the decoder side. To achieve this, our GBR adapts the accuracy of the geometry representation, in contrast with depth coding, which directly compresses with losses the original geometry signal. We present the principles of this graph-based representation (GBR) and we build a complete prototype coding scheme for multiview images. Experimental results demonstrate the potential of this new representation as compared to a depth-based approach. GBR can achieve a gain of 2 dB in reconstructed quality over depth-based schemes operating at similar rates
Multi-view video representation based on fast Monte Carlo surface reconstruction
This paper provides an alternative solution to the
costly representation of multi-view video data, which can be
used for both rendering and scene analyses. Initially, a new
efficient Monte Carlo discrete surface reconstruction method for
foreground objects with static background is presented, which
outperforms volumetric techniques and is suitable for GPU
environments. Some extensions are also presented, which allow a
speeding up of the reconstruction by exploiting multi-resolution
and temporal correlations. Then, a fast meshing algorithm is
applied, which allows interpolating a continuous surface from
the discrete reconstructed points. As shown by the experimental
results, the original video frames can be approximated with high
accuracy by projecting the reconstructed foreground objects onto
the original viewpoints. Furthermore, the reconstructed scene
can be easily projected onto any desired virtual viewpoint, thus
simplifying the design of free-viewpoint video applications. In our
experimental results, we show that our techniques for reconstruc-
tion and meshing compare favorably with the state-of-the-art, and
we also introduce a rule-of-thumb for effective application of the
method with a good quality versus representation cost trade-offPeer ReviewedPostprint (published version
Multi-view video representation based on fast Monte Carlo surface reconstruction
This paper provides an alternative solution to the
costly representation of multi-view video data, which can be
used for both rendering and scene analyses. Initially, a new
efficient Monte Carlo discrete surface reconstruction method for
foreground objects with static background is presented, which
outperforms volumetric techniques and is suitable for GPU
environments. Some extensions are also presented, which allow a
speeding up of the reconstruction by exploiting multi-resolution
and temporal correlations. Then, a fast meshing algorithm is
applied, which allows interpolating a continuous surface from
the discrete reconstructed points. As shown by the experimental
results, the original video frames can be approximated with high
accuracy by projecting the reconstructed foreground objects onto
the original viewpoints. Furthermore, the reconstructed scene
can be easily projected onto any desired virtual viewpoint, thus
simplifying the design of free-viewpoint video applications. In our
experimental results, we show that our techniques for reconstruc-
tion and meshing compare favorably with the state-of-the-art, and
we also introduce a rule-of-thumb for effective application of the
method with a good quality versus representation cost trade-offPeer Reviewe
IEEE Transactions On Image Processing : Vol. 22, No. 9 - 10, September - October 2013
1. Automatic View Synthesis by Image-Domain-Warping
2. Multi-View Video Representation Based on Fast Monte Carlo Surface Reconstruction
3. Stereo Matching and View Interpolation Based on Image Domain Triangulation
4. 3D High-Efficiency Video Coding for Multi-View Video and Depth Data
5. No-Reference Quality Assessment of Natural Stereopairs
Etc