95 research outputs found
Novel Methods and Algorithms for Presenting 3D Scenes
In recent years, improvements in the acquisition and creation of 3D models gave rise to
an increasing availability of 3D content and to a widening of the audience such content
is created for, which brought into focus the need for effective ways to visualize and
interact with it.
Until recently, the task of virtual inspection of a 3D object or navigation inside a 3D
scene was carried out by using human machine interaction (HMI) metaphors controlled
through mouse and keyboard events.
However, this interaction approach may be cumbersome for the general audience.
Furthermore, the inception and spread of touch-based mobile devices, such as smartphones
and tablets, redefined the interaction problem entirely, since neither mouse nor
keyboards are available anymore. The problem is made even worse by the fact that these
devices are typically lower power if compared to desktop machines, while high-quality
rendering is a computationally intensive task.
In this thesis, we present a series of novel methods for the easy presentation of 3D
content both when it is already available in a digitized form and when it must be acquired
from the real world by image-based techniques. In the first case, we propose
a method which takes as input the 3D scene of interest and an example video, and it
automatically produces a video of the input scene that resembles the given video example.
In other words, our algorithm allows the user to replicate an existing video, for
example, a video created by a professional animator, on a different 3D scene.
In the context of image-based techniques, exploiting the inherent spatial organization
of photographs taken for the 3D reconstruction of a scene, we propose an intuitive
interface for the smooth stereoscopic navigation of the acquired scene providing an immersive
experience without the need of a complete 3D reconstruction.
Finally, we propose an interactive framework for improving low-quality 3D reconstructions
obtained through image-based reconstruction algorithms. Using few strokes on
the input images, the user can specify high-level geometric hints to improve incomplete
or noisy reconstructions which are caused by various quite common conditions
often arising for objects such as buildings, streets and numerous other human-made
functional elements
ieee access special section editorial multimedia analysis for internet of things
Big data processing includes both data management and data analytics. The data management step requires efficient cleaning, knowledge extraction, and integration and aggregation methods, whereas Internet-of-Multimedia-Things (IoMT) analysis is based on knowledge modeling and interpretation, which is more often performed by exploiting deep learning architectures. In the past couple of years, merging conventional and deep learning methodologies has exhibited great promise in ingesting multimedia big data, exploring the paradigm of transfer learning, association rule mining, and predictive analytics etc
Fusing Multimedia Data Into Dynamic Virtual Environments
In spite of the dramatic growth of virtual and augmented reality (VR and AR) technology, content creation for immersive and dynamic virtual environments remains a significant challenge. In this dissertation, we present our research in fusing multimedia data, including text, photos, panoramas, and multi-view videos, to create rich and compelling virtual environments.
First, we present Social Street View, which renders geo-tagged social media in its natural geo-spatial context provided by 360° panoramas. Our system takes into account visual saliency and uses maximal Poisson-disc placement with spatiotemporal filters to render social multimedia in an immersive setting. We also present a novel GPU-driven pipeline for saliency computation in 360° panoramas using spherical harmonics (SH). Our spherical residual model can be applied to virtual cinematography in 360° videos. We further present Geollery, a mixed-reality platform to render an interactive mirrored world in real time with three-dimensional (3D) buildings, user-generated content, and geo-tagged social media. Our user study has identified several use cases for these systems, including immersive social storytelling, experiencing the culture, and crowd-sourced tourism.
We next present Video Fields, a web-based interactive system to create, calibrate, and render dynamic videos overlaid on 3D scenes. Our system renders dynamic entities from multiple videos, using early and deferred texture sampling. Video Fields can be used for immersive surveillance in virtual environments. Furthermore, we present VRSurus and ARCrypt projects to explore the applications of gestures recognition, haptic feedback, and visual cryptography for virtual and augmented reality.
Finally, we present our work on Montage4D, a real-time system for seamlessly fusing multi-view video textures with dynamic meshes. We use geodesics on meshes with view-dependent rendering to mitigate spatial occlusion seams while maintaining temporal consistency. Our experiments show significant enhancement in rendering quality, especially for salient regions such as faces. We believe that Social Street View, Geollery, Video Fields, and Montage4D will greatly facilitate several applications such as virtual tourism, immersive telepresence, and remote education
Advances in Image Processing, Analysis and Recognition Technology
For many decades, researchers have been trying to make computers’ analysis of images as effective as the system of human vision is. For this purpose, many algorithms and systems have previously been created. The whole process covers various stages, including image processing, representation and recognition. The results of this work can be applied to many computer-assisted areas of everyday life. They improve particular activities and provide handy tools, which are sometimes only for entertainment, but quite often, they significantly increase our safety. In fact, the practical implementation of image processing algorithms is particularly wide. Moreover, the rapid growth of computational complexity and computer efficiency has allowed for the development of more sophisticated and effective algorithms and tools. Although significant progress has been made so far, many issues still remain, resulting in the need for the development of novel approaches
- …