Search CORE

9,870 research outputs found

Learning to Predict Image-based Rendering Artifacts with Respect to a Hidden Reference Image

Author: Bemana Mojtaba
Bätz Michel
Keinert Joachim
Myszkowski Karol
Ritschel Tobias
Seidel Hans-Peter
Ziegler Matthias
Publication venue
Publication date: 01/01/2019
Field of study

Image metrics predict the perceived per-pixel difference between a reference image and its degraded (e. g., re-rendered) version. In several important applications, the reference image is not available and image metrics cannot be applied. We devise a neural network architecture and training procedure that allows predicting the MSE, SSIM or VGG16 image difference from the distorted image alone while the reference is not observed. This is enabled by two insights: The first is to inject sufficiently many un-distorted natural image patches, which can be found in arbitrary amounts and are known to have no perceivable difference to themselves. This avoids false positives. The second is to balance the learning, where it is carefully made sure that all image errors are equally likely, avoiding false negatives. Surprisingly, we observe, that the resulting no-reference metric, subjectively, can even perform better than the reference-based one, as it had to become robust against mis-alignments. We evaluate the effectiveness of our approach in an image-based rendering context, both quantitatively and qualitatively. Finally, we demonstrate two applications which reduce light field capture time and provide guidance for interactive depth adjustment.Comment: 13 pages, 11 figure

arXiv.org e-Print Archive

MPG.PuRe

Context-aware Synthesis for Video Frame Interpolation

Author: Liu Feng
Niklaus Simon
Publication venue
Publication date: 29/03/2018
Field of study

Video frame interpolation algorithms typically estimate optical flow or its variations and then use it to guide the synthesis of an intermediate frame between two consecutive original frames. To handle challenges like occlusion, bidirectional flow between the two input frames is often estimated and used to warp and blend the input frames. However, how to effectively blend the two warped frames still remains a challenging problem. This paper presents a context-aware synthesis approach that warps not only the input frames but also their pixel-wise contextual information and uses them to interpolate a high-quality intermediate frame. Specifically, we first use a pre-trained neural network to extract per-pixel contextual information for input frames. We then employ a state-of-the-art optical flow algorithm to estimate bidirectional flow between them and pre-warp both input frames and their context maps. Finally, unlike common approaches that blend the pre-warped frames, our method feeds them and their context maps to a video frame synthesis neural network to produce the interpolated frame in a context-aware fashion. Our neural network is fully convolutional and is trained end to end. Our experiments show that our method can handle challenging scenarios such as occlusion and large motion and outperforms representative state-of-the-art approaches.Comment: CVPR 2018, http://graphics.cs.pdx.edu/project/ctxsy

arXiv.org e-Print Archive

Crossref

PDXScholar (Portland State University)

MusA: Using Indoor Positioning and Navigation to Enhance Cultural Experiences in a museum

Author: Alciatore
Andrea Bottino
Andrea Martina
Baharuddin
Bellotti
Bihler
Bitgood
Bitgood
Bruno
Chen
Chen
Csikszentmihalyi
Dean
Douglas
Emmanouilidis
Falk
Faugeras
Fischler
Ghiani
Ghiani
Giovanni Malnati
Guillemaut
Hausmann
Hausmann
Hsu
Huang
Irene Rubino
Iurgel
Jetmir Xhembulla
Kang
Kenteris
Maybank
Mulloni
Packer
Proctor
Rounds
Russo
Ruíz
Schweighofer
Serrell
Stock
Traum
Tsai
Veron
Wang
Yanying
Zhang
Zhang
Publication venue: MDPI
Publication date: 01/01/2013
Field of study

In recent years there has been a growing interest into the use of multimedia mobile guides in museum environments. Mobile devices have the capabilities to detect the user context and to provide pieces of information suitable to help visitors discovering and following the logical and emotional connections that develop during the visit. In this scenario, location based services (LBS) currently represent an asset, and the choice of the technology to determine users' position, combined with the definition of methods that can effectively convey information, become key issues in the design process. In this work, we present MusA (Museum Assistant), a general framework for the development of multimedia interactive guides for mobile devices. Its main feature is a vision-based indoor positioning system that allows the provision of several LBS, from way-finding to the contextualized communication of cultural contents, aimed at providing a meaningful exploration of exhibits according to visitors' personal interest and curiosity. Starting from the thorough description of the system architecture, the article presents the implementation of two mobile guides, developed to respectively address adults and children, and discusses the evaluation of the user experience and the visitors' appreciation of these application

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino