Search CORE

13 research outputs found

Doctor of Philosophy

Author: Summa Brian Mark
Publication venue: University of Utah
Publication date: 01/05/2013
Field of study

dissertationInteractive editing and manipulation of digital media is a fundamental component in digital content creation. One media in particular, digital imagery, has seen a recent increase in popularity of its large or even massive image formats. Unfortunately, current systems and techniques are rarely concerned with scalability or usability with these large images. Moreover, processing massive (or even large) imagery is assumed to be an off-line, automatic process, although many problems associated with these datasets require human intervention for high quality results. This dissertation details how to design interactive image techniques that scale. In particular, massive imagery is typically constructed as a seamless mosaic of many smaller images. The focus of this work is the creation of new technologies to enable user interaction in the formation of these large mosaics. While an interactive system for all stages of the mosaic creation pipeline is a long-term research goal, this dissertation concentrates on the last phase of the mosaic creation pipeline - the composition of registered images into a seamless composite. The work detailed in this dissertation provides the technologies to fully realize interactive editing in mosaic composition on image collections ranging from the very small to massive in scale

The University of Utah: J. Willard Marriott Digital Library

Real-Time Computational Gigapixel Multi-Camera Systems

Author: Popovic Vladan
Publication venue: Lausanne, EPFL
Publication date: 06/01/2016
Field of study

The standard cameras are designed to truthfully mimic the human eye and the visual system. In recent years, commercially available cameras are becoming more complex, and offer higher image resolutions than ever before. However, the quality of conventional imaging methods is limited by several parameters, such as the pixel size, lens system, the diffraction limit, etc. The rapid technological advancements, increase in the available computing power, and introduction of Graphics Processing Units (GPU) and Field-Programmable-Gate-Arrays (FPGA) open new possibilities in the computer vision and computer graphics communities. The researchers are now focusing on utilizing the immense computational power offered on the modern processing platforms, to create imaging systems with novel or significantly enhanced capabilities compared to the standard ones. One popular type of the computational imaging systems offering new possibilities is a multi-camera system. This thesis will focus on FPGA-based multi-camera systems that operate in real-time. The aim of themulti-camera systems presented in this thesis is to offer a wide field-of-view (FOV) video coverage at high frame rates. The wide FOV is achieved by constructing a panoramic image from the images acquired by the multi-camera system. Two new real-time computational imaging systems that provide new functionalities and better performance compared to conventional cameras are presented in this thesis. Each camera system design and implementation are analyzed in detail, built and tested in real-time conditions. Panoptic is a miniaturized low-cost multi-camera system that reconstructs a 360 degrees view in real-time. Since it is an easily portable system, it provides means to capture the complete surrounding light field in dynamic environment, such as when mounted on a vehicle or a flying drone. The second presented system, GigaEye II , is a modular high-resolution imaging system that introduces the concept of distributed image processing in the real-time camera systems. This thesis explains in detail howsuch concept can be efficiently used in real-time computational imaging systems. The purpose of computational imaging systems in the form of multi-camera systems does not end with real-time panoramas. The application scope of these cameras is vast. They can be used in 3D cinematography, for broadcasting live events, or for immersive telepresence experience. The final chapter of this thesis presents three potential applications of these systems: object detection and tracking, high dynamic range (HDR) imaging, and observation of multiple regions of interest. Object detection and tracking, and observation of multiple regions of interest are extremely useful and desired capabilities of surveillance systems, in security and defense industry, or in the fast-growing industry of autonomous vehicles. On the other hand, high dynamic range imaging is becoming a common option in the consumer market cameras, and the presented method allows instantaneous capture of HDR videos. Finally, this thesis concludes with the discussion of the real-time multi-camera systems, their advantages, their limitations, and the future predictions

Infoscience - École polytechnique fédérale de Lausanne

High-quality Panorama Stitching based on Asymmetric Bidirectional Optical Flow

Author: Liu Shaojun
Meng Mingyuan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/08/2020
Field of study

In this paper, we propose a panorama stitching algorithm based on asymmetric bidirectional optical flow. This algorithm expects multiple photos captured by fisheye lens cameras as input, and then, through the proposed algorithm, these photos can be merged into a high-quality 360-degree spherical panoramic image. For photos taken from a distant perspective, the parallax among them is relatively small, and the obtained panoramic image can be nearly seamless and undistorted. For photos taken from a close perspective or with a relatively large parallax, a seamless though partially distorted panoramic image can also be obtained. Besides, with the help of Graphics Processing Unit (GPU), this algorithm can complete the whole stitching process at a very fast speed: typically, it only takes less than 30s to obtain a panoramic image of 9000-by-4000 pixels, which means our panorama stitching algorithm is of high value in many real-time applications. Our code is available at https://github.com/MungoMeng/Panorama-OpticalFlow.Comment: Published at the 5th International Conference on Computational Intelligence and Applications (ICCIA 2020

arXiv.org e-Print Archive

Crossref

Imaging methods for understanding and improving visual training in the geosciences

Author: May Brandon
Publication venue: RIT Scholar Works
Publication date: 19/02/2013
Field of study

Experience in the field is a critical educational component of every student studying geology. However, it is typically difficult to ensure that every student gets the necessary experience because of monetary and scheduling limitations. Thus, we proposed to create a virtual field trip based off of an existing 10-day field trip to California taken as part of an undergraduate geology course at the University of Rochester. To assess the effectiveness of this approach, we also proposed to analyze the learning and observation processes of both students and experts during the real and virtual field trips. At sites intended for inclusion in the virtual field trip, we captured gigapixel resolution panoramas by taking hundreds of images using custom built robotic imaging systems. We gathered data to analyze the learning process by fitting each geology student and expert with a portable eye- tracking system that records a video of their eye movements and a video of the scene they are observing. An important component of analyzing the eye-tracking data requires mapping the gaze of each observer into a common reference frame. We have made progress towards developing a software tool that helps automate this procedure by using image feature tracking and registration methods to map the scene video frames from each eye-tracker onto a reference panorama for each site. For the purpose of creating a virtual field trip, we have a large scale semi-immersive display system that consists of four tiled projectors, which have been colorimetrically and photometrically calibrated, and a curved widescreen display surface. We use this system to present the previously captured panoramas, which simulates the experience of visiting the sites in person. In terms of broader geology education and outreach, we have created an interactive website that uses Google Earth as the interface for visually exploring the panoramas captured for each site

RIT Scholar Works

Interactive Content-Aware Zooming

Author: Drettakis George
Idrissi Khalid
Laffont Pierre-Yves
Tai Yu-Wing
Wolf Christian
Yoon Sung-Eui
Yun Jun Jong
Publication venue: Canadian Information Processing Society Toronto, Ont., Canada
Publication date: 31/05/2010
Field of study

SESSION: Photo zoomInternational audienceWe propose a novel, interactive content-aware zooming operator that allows effective and efficient visualization of high resolution images on small screens, which may have different aspect ratios compared to the input images. Our approach applies an image retargeting method in order to fit an entire image into the limited screen space. This can provide global, but approximate views for lower zoom levels. However, as we zoom more closely into the image, we continuously unroll the distortion to provide local, but more detailed and accurate views for higher zoom levels. In addition, we propose to use an adaptive view-dependent mesh to achieve high retargeting quality, while maintaining interactive performance. We demonstrate the effectiveness of the proposed operator by comparing it against the traditional zooming approach, and a method stemming from a direct combination of existing works

INRIA a CCSD electronic archive server

Hal-Diderot

Fehlerkaschierte Bildbasierte Darstellungsverfahren

Author: Eisemann Martin
Publication venue: Monsenstein und Vannerdat
Publication date: 06/07/2011
Field of study

Creating photo-realistic images has been one of the major goals in computer graphics since its early days. Instead of modeling the complexity of nature with standard modeling tools, image-based approaches aim at exploiting real-world footage directly,as they are photo-realistic by definition. A drawback of these approaches has always been that the composition or combination of different sources is a non-trivial task, often resulting in annoying visible artifacts. In this thesis we focus on different techniques to diminish visible artifacts when combining multiple images in a common image domain. The results are either novel images, when dealing with the composition task of multiple images, or novel video sequences rendered in real-time, when dealing with video footage from multiple cameras.Fotorealismus ist seit jeher eines der großen Ziele in der Computergrafik. Anstatt die Komplexität der Natur mit standardisierten Modellierungswerkzeugen nachzubauen, gehen bildbasierte Ansätze den umgekehrten Weg und verwenden reale Bildaufnahmen zur Modellierung, da diese bereits per Definition fotorealistisch sind. Ein Nachteil dieser Variante ist jedoch, dass die Komposition oder Kombination mehrerer Quellbilder eine nichttriviale Aufgabe darstellt und häufig unangenehm auffallende Artefakte im erzeugten Bild nach sich zieht. In dieser Dissertation werden verschiedene Ansätze verfolgt, um Artefakte zu verhindern oder abzuschwächen, welche durch die Komposition oder Kombination mehrerer Bilder in einer gemeinsamen Bilddomäne entstehen. Im Ergebnis liefern die vorgestellten Verfahren neue Bilder oder neue Ansichten einer Bildsammlung oder Videosequenz, je nachdem, ob die jeweilige Aufgabe die Komposition mehrerer Bilder ist oder die Kombination mehrerer Videos verschiedener Kameras darstellt

Digitale Bibliothek Braunschweig

Large databases of real and synthetic images for feature evaluation and prediction

Author: Kaneva Biliana K
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2012
Field of study

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2012.Cataloged from PDF version of thesis.Includes bibliographical references (p. 157-167).Image features are widely used in computer vision applications from stereo matching to panorama stitching to object and scene recognition. They exploit image regularities to capture structure in images both locally, using a patch around an interest point, and globally, over the entire image. Image features need to be distinctive and robust toward variations in scene content, camera viewpoint and illumination conditions. Common tasks are matching local features across images and finding semantically meaningful matches amongst a large set of images. If there is enough structure or regularity in the images, we should be able not only to find good matches but also to predict parts of the objects or the scene that were not directly captured by the camera. One of the difficulties in evaluating the performance of image features in both the prediction and matching tasks is the availability of ground truth data. In this dissertation, we take two different approaches. First, we propose using a photorealistic virtual world for evaluating local feature descriptors and leaning new feature detectors. Acquiring ground truth data and, in particular pixel to pixel correspondences between images, in complex 3D scenes under different viewpoint and illumination conditions in a controlled way is nearly impossible in a real world setting. Instead, we use a high-resolution 3D model of a city to gain complete and repeatable control of the environment. We calibrate our virtual world evaluations by comparing against feature rankings made from photographic data of the same subject matter (the Statue of Liberty). We then use our virtual world to study the effects on descriptor performance of controlled changes in viewpoint and illumination. We further employ machine learning techniques to train a model that would recognize visually rich interest points and optimize the performance of a given descriptor. In the latter part of the thesis, we take advantage of the large amounts of image data available on the Internet to explore the regularities in outdoor scenes and, more specifically, the matching and prediction tasks in street level images. Generally, people are very adept at predicting what they might encounter as they navigate through the world. They use all of their prior experience to make such predictions even when placed in unfamiliar environment. We propose a system that can predict what lies just beyond the boundaries of the image using a large photo collection of images of the same class, but not from the same location in the real world. We evaluate the performance of the system using different global or quantized densely extracted local features. We demonstrate how to build seamless transitions between the query and prediction images, thus creating a photorealistic virtual space from real world images.by Biliana K. Kaneva.Ph.D

DSpace@MIT

Digital Stack Photography and Its Applications

Author: Hu Jun
Publication venue
Publication date
Field of study

This work centers on digital stack photography and its applications.A stack of images refer, in a broader sense, to an ensemble ofassociated images taken with variation in one or more than one various values in one or more parameters in system configuration or setting.An image stack captures and contains potentially more information thanany of the constituent images. Digital stack photography (DST)techniques explore the rich information to render a synthesized imagethat oversteps the limitation in a digital camera's capabilities.This work considers in particular two basic DST problems, which hadbeen challenging, and their applications. One is high-dynamic-range(HDR) imaging of non-stationary dynamic scenes, in which the stackedimages vary in exposure conditions. The otheris large scale panorama composition from multiple images. In thiscase, the image components are related to each other by the spatialrelation among the subdomains of the same scene they covered andcaptured jointly. We consider the non-conventional, practical andchallenge situations where the spatial overlap among the sub-images issparse (S), irregular in geometry and imprecise from the designedgeometry (I), and the captured data over the overlap zones are noisy(N) or lack of features. We refer to these conditions simply as theS.I.N. conditions.There are common challenging issues with both problems. For example,both faced the dominant problem with image alignment forseamless and artifact-free image composition. Our solutions to thecommon problems are manifested differently in each of the particularproblems, as a result of adaption to the specific properties in eachtype of image ensembles. For the exposure stack, existingalignment approaches struggled to overcome three main challenges:inconsistency in brightness, large displacement in dynamic scene andpixel saturation. We exploit solutions in the following threeaspects. In the first, we introduce a model that addresses and admitschanges in both geometric configurations and optical conditions, whilefollowing the traditional optical flow description. Previous modelstreated these two types of changes one or the other, namely, withmutual exclusions. Next, we extend the pixel-based optical flow modelto a patch-based model. There are two-fold advantages. A patch hastexture and local content that individual pixels fail to present. Italso renders opportunities for faster processing, such as viatwo-scale or multiple-scale processing. The extended model is thensolved efficiently with an EM-like algorithm, which is reliable in thepresence of large displacement. Thirdly, we present a generativemodel for reducing or eliminating typical artifacts as a side effectof an inadequate alignment for clipped pixels. A patch-based texturesynthesis is combined with the patch-based alignment to achieve anartifact free result.For large-scale panorama composition under the S.I.N. conditions, wehave developed an effective solution scheme that significantly reducesboth processing time and artifacts. Previously existing approaches canbe roughly categorized as either geometry-based composition or featurebased composition. In the former approach, one relies on preciseknowledge of the system geometry, by design and/or calibration. Itworks well with a far-away scene, in which case there is only limitedvariation in projective geometry among the sub-images. However, thesystem geometry is not invariant to physical conditions such asthermal variation, stress variation and etc.. The composition withthis approach is typically done in the spatial space. The otherapproach is more robust to geometric and optical conditions. It workssurprisingly well with feature-rich and stationary scenes, not wellwith the absence of recognizable features. The composition based onfeature matching is typically done in the spatial gradient domain. Inshort, both approaches are challenged by the S.I.N. conditions. Withcertain snapshot data sets obtained and contributed by Brady et al, these methods either fail in composition or render images withvisually disturbing artifacts. To overcome the S.I.N. conditions, wehave reconciled these two approaches and made successful andcomplementary use of both priori and approximate information aboutgeometric system configuration and the feature information from theimage data. We also designed and developed a software architecturewith careful extraction of primitive function modules that can beefficiently implemented and executed in parallel. In addition to amuch faster processing speed, the resulting images are clear andsharper at the overlapping zones, without typical ghosting artifacts.Dissertatio

DukeSpace

Modeling and Simulation in Engineering

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

This book provides an open platform to establish and share knowledge developed by scholars, scientists, and engineers from all over the world, about various applications of the modeling and simulation in the design process of products, in various engineering fields. The book consists of 12 chapters arranged in two sections (3D Modeling and Virtual Prototyping), reflecting the multidimensionality of applications related to modeling and simulation. Some of the most recent modeling and simulation techniques, as well as some of the most accurate and sophisticated software in treating complex systems, are applied. All the original contributions in this book are jointed by the basic principle of a successful modeling and simulation process: as complex as necessary, and as simple as possible. The idea is to manipulate the simplifying assumptions in a way that reduces the complexity of the model (in order to make a real-time simulation), but without altering the precision of the results

Directory of Open Access Books (DOAB)

Electronic Imaging & the Visual Arts. EVA 2012 Florence

Author
Publication venue: 'Firenze University Press'
Publication date: 31/05/2022
Field of study

The key aim of this Event is to provide a forum for the user, supplier and scientific research communities to meet and exchange experiences, ideas and plans in the wide area of Culture & Technology. Participants receive up to date news on new EC and international arts computing & telecommunications initiatives as well as on Projects in the visual arts field, in archaeology and history. Working Groups and new Projects are promoted. Scientific and technical demonstrations are presented

Directory of Open Access Books (DOAB)