Search CORE

151 research outputs found

Neural View-Interpolation for Sparse Light Field Video

Author: Bemana M.
Myszkowski K.
Ritschel T.
Seidel H.
Publication venue
Publication date: 01/01/2019
Field of study

We suggest representing light field (LF) videos as "one-off" neural networks (NN), i.e., a learned mapping from view-plus-time coordinates to high-resolution color values, trained on sparse views. Initially, this sounds like a bad idea for three main reasons: First, a NN LF will likely have less quality than a same-sized pixel basis representation. Second, only few training data, e.g., 9 exemplars per frame are available for sparse LF videos. Third, there is no generalization across LFs, but across view and time instead. Consequently, a network needs to be trained for each LF video. Surprisingly, these problems can turn into substantial advantages: Other than the linear pixel basis, a NN has to come up with a compact, non-linear i.e., more intelligent, explanation of color, conditioned on the sparse view and time coordinates. As observed for many NN however, this representation now is interpolatable: if the image output for sparse view coordinates is plausible, it is for all intermediate, continuous coordinates as well. Our specific network architecture involves a differentiable occlusion-aware warping step, which leads to a compact set of trainable parameters and consequently fast learning and fast execution

MPG.PuRe

Large-Scale Light Field Capture and Reconstruction

Author: Gao Yuan
Publication venue: Universitatsbibliothek Kiel
Publication date: 01/01/2020
Field of study

This thesis discusses approaches and techniques to convert Sparsely-Sampled Light Fields (SSLFs) into Densely-Sampled Light Fields (DSLFs), which can be used for visualization on 3DTV and Virtual Reality (VR) devices. Exemplarily, a movable 1D large-scale light field acquisition system for capturing SSLFs in real-world environments is evaluated. This system consists of 24 sparsely placed RGB cameras and two Kinect V2 sensors. The real-world SSLF data captured with this setup can be leveraged to reconstruct real-world DSLFs. To this end, three challenging problems require to be solved for this system: (i) how to estimate the rigid transformation from the coordinate system of a Kinect V2 to the coordinate system of an RGB camera; (ii) how to register the two Kinect V2 sensors with a large displacement; (iii) how to reconstruct a DSLF from a SSLF with moderate and large disparity ranges. To overcome these three challenges, we propose: (i) a novel self-calibration method, which takes advantage of the geometric constraints from the scene and the cameras, for estimating the rigid transformations from the camera coordinate frame of one Kinect V2 to the camera coordinate frames of 12-nearest RGB cameras; (ii) a novel coarse-to-fine approach for recovering the rigid transformation from the coordinate system of one Kinect to the coordinate system of the other by means of local color and geometry information; (iii) several novel algorithms that can be categorized into two groups for reconstructing a DSLF from an input SSLF, including novel view synthesis methods, which are inspired by the state-of-the-art video frame interpolation algorithms, and Epipolar-Plane Image (EPI) inpainting methods, which are inspired by the Shearlet Transform (ST)-based DSLF reconstruction approaches

MACAU: Open Access Repository of Kiel University

Light field reconstruction from multi-view images

Author: Han Kang
Publication venue
Publication date: 01/01/2022
Field of study

Kang Han studied recovering the 3D world from multi-view images. He proposed several algorithms to deal with occlusions in depth estimation and effective representations in view rendering. the proposed algorithms can be used for many innovative applications based on machine intelligence, such as autonomous driving and Metaverse

ResearchOnline at James Cook University

A benchmark of light field view interpolation methods

Author: Batz M
Keinert J
Khan Gul MS
Mantiuk R
Yue D
Publication venue: 2020 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2020
Publication date: 01/07/2020
Field of study

Light field view interpolation provides a solution that reduces the prohibitive size of a dense light field. This paper examines state-ofthe-art light field view interpolation methods with a comprehensive benchmark on challenging scenarios specific for interpolation tasks. Each method is analyzed in terms of their strengths and weaknesses in handling different challenges. We find that large disparities in a scene are the main source of challenge for the light field view interpolation methods. We also find that a basic backward warping based on the depth estimation from optical flow provides comparable performance against usually complex learning-based methods

Crossref

Apollo (Cambridge)

Learning-based Spatial and Angular Information Separation for Light Field Compression

Author: Guillemot Christine
Shi Jinglei
Xu Yihong
Publication venue
Publication date: 13/04/2023
Field of study

Light fields are a type of image data that capture both spatial and angular scene information by recording light rays emitted by a scene from different orientations. In this context, spatial information is defined as features that remain static regardless of perspectives, while angular information refers to features that vary between viewpoints. We propose a novel neural network that, by design, can separate angular and spatial information of a light field. The network represents spatial information using spatial kernels shared among all Sub-Aperture Images (SAIs), and angular information using sets of angular kernels for each SAI. To further improve the representation capability of the network without increasing parameter number, we also introduce angular kernel allocation and kernel tensor decomposition mechanisms. Extensive experiments demonstrate the benefits of information separation: when applied to the compression task, our network outperforms other state-of-the-art methods by a large margin. And angular information can be easily transferred to other scenes for rendering dense views, showing the successful separation and the potential use case for the view synthesis task. We plan to release the code upon acceptance of the paper to encourage further research on this topic

arXiv.org e-Print Archive

EffLiFe: Efficient Light Field Generation via Hierarchical Sparse Gradient Descent

Author: Deng Yijie
Fang Lu
Han Lei
Li Lin
Lin Tianpeng
Zhang Jinzhi
Publication venue
Publication date: 06/07/2023
Field of study

With the rise of Extended Reality (XR) technology, there is a growing need for real-time light field generation from sparse view inputs. Existing methods can be classified into offline techniques, which can generate high-quality novel views but at the cost of long inference/training time, and online methods, which either lack generalizability or produce unsatisfactory results. However, we have observed that the intrinsic sparse manifold of Multi-plane Images (MPI) enables a significant acceleration of light field generation while maintaining rendering quality. Based on this insight, we introduce EffLiFe, a novel light field optimization method, which leverages the proposed Hierarchical Sparse Gradient Descent (HSGD) to produce high-quality light fields from sparse view images in real time. Technically, the coarse MPI of a scene is first generated using a 3D CNN, and it is further sparsely optimized by focusing only on important MPI gradients in a few iterations. Nevertheless, relying solely on optimization can lead to artifacts at occlusion boundaries. Therefore, we propose an occlusion-aware iterative refinement module that removes visual artifacts in occluded regions by iteratively filtering the input. Extensive experiments demonstrate that our method achieves comparable visual quality while being 100x faster on average than state-of-the-art offline methods and delivering better performance (about 2 dB higher in PSNR) compared to other online approaches.Comment: Submitted to IEEE TPAM

arXiv.org e-Print Archive

Capture, Reconstruction, and Representation of the Visual Real World for Virtual Reality

Author: A Collet
A Dai
A Davis
A Dosovitskiy
A Meka
A Parra Pozo
A Serrano
B Luo
B Mildenhall
C Keysers
C Kim
C Lipski
C Schroers
C Weissig
D Lanman
E Penner
F Perazzi
F Prada
G Chaurasia
G Chaurasia
G Nam
G Wetzstein
G Wetzstein
GA Koulieris
H Hukkelås
H Ishiguro
H Kim
H Rhodin
HY Shum
J Lee
J Thies
J Ventura
J Zaragoza
JL Schönberger
K Yücer
M Mori
M Nießner
M Tatarchenko
M Zollhöfer
N Snavely
NK Kalantari
P Hedman
P Hedman
P Hedman
P Hedman
P Moulon
R Anderson
R Konrad
R Martin-Brualla
R Szeliski
R Szeliski
RS Overbeck
S Lombardi
S Niklaus
S Peleg
S Tulsiani
SE Wei
SMA Eslami
T Bertel
T Whelan
T Zhou
T Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/03/2020
Field of study

We provide an overview of the concerns, current practice, and limitations for capturing, reconstructing, and representing the real world visually within virtual reality. Given that our goals are to capture, transmit, and depict complex real-world phenomena to humans, these challenges cover the opto-electro-mechanical, computational, informational, and perceptual fields. Practically producing a system for real-world VR capture requires navigating a complex design space and pushing the state of the art in each of these areas. As such, we outline several promising directions for future work to improve the quality and flexibility of real-world VR capture systems

OPUS

Crossref