Search CORE

11,860 research outputs found

Contour Generator Points for Threshold Selection and a Novel Photo-Consistency Measure for Space Carving

Author: Isidoro John
Sclaroff Stan
Publication venue: Boston University Computer Science Department
Publication date: 02/12/2003
Field of study

Space carving has emerged as a powerful method for multiview scene reconstruction. Although a wide variety of methods have been proposed, the quality of the reconstruction remains highly-dependent on the photometric consistency measure, and the threshold used to carve away voxels. In this paper, we present a novel photo-consistency measure that is motivated by a multiset variant of the chamfer distance. The new measure is robust to high amounts of within-view color variance and also takes into account the projection angles of back-projected pixels. Another critical issue in space carving is the selection of the photo-consistency threshold used to determine what surface voxels are kept or carved away. In this paper, a reliable threshold selection technique is proposed that examines the photo-consistency values at contour generator points. Contour generators are points that lie on both the surface of the object and the visual hull. To determine the threshold, a percentile ranking of the photo-consistency values of these generator points is used. This improved technique is applicable to a wide variety of photo-consistency measures, including the new measure presented in this paper. Also presented in this paper is a method to choose between photo-consistency measures, and voxel array resolutions prior to carving using receiver operating characteristic (ROC) curves

Boston University Institutional Repository (OpenBU)

STEFANN: Scene Text Editor using Font Adaptive Neural Network

Author: Bhattacharya Saumik
Ghosh Subhankar
Pal Umapada
Roy Prasun
Publication venue
Publication date: 25/04/2020
Field of study

Textual information in a captured scene plays an important role in scene interpretation and decision making. Though there exist methods that can successfully detect and interpret complex text regions present in a scene, to the best of our knowledge, there is no significant prior work that aims to modify the textual information in an image. The ability to edit text directly on images has several advantages including error correction, text restoration and image reusability. In this paper, we propose a method to modify text in an image at character-level. We approach the problem in two stages. At first, the unobserved character (target) is generated from an observed character (source) being modified. We propose two different neural network architectures - (a) FANnet to achieve structural consistency with source font and (b) Colornet to preserve source color. Next, we replace the source character with the generated character maintaining both geometric and visual consistency with neighboring characters. Our method works as a unified platform for modifying text in images. We present the effectiveness of our method on COCO-Text and ICDAR datasets both qualitatively and quantitatively.Comment: Accepted in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 202

arXiv.org e-Print Archive

Crossref

Efficient moving point handling for incremental 3D manifold reconstruction

Author: Matteucci Matteo
Romanoni Andrea
Publication venue
Publication date: 01/01/2015
Field of study

As incremental Structure from Motion algorithms become effective, a good sparse point cloud representing the map of the scene becomes available frame-by-frame. From the 3D Delaunay triangulation of these points, state-of-the-art algorithms build a manifold rough model of the scene. These algorithms integrate incrementally new points to the 3D reconstruction only if their position estimate does not change. Indeed, whenever a point moves in a 3D Delaunay triangulation, for instance because its estimation gets refined, a set of tetrahedra have to be removed and replaced with new ones to maintain the Delaunay property; the management of the manifold reconstruction becomes thus complex and it entails a potentially big overhead. In this paper we investigate different approaches and we propose an efficient policy to deal with moving points in the manifold estimation process. We tested our approach with four sequences of the KITTI dataset and we show the effectiveness of our proposal in comparison with state-of-the-art approaches.Comment: Accepted in International Conference on Image Analysis and Processing (ICIAP 2015

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

A framework for digital sunken relief generation based on 3D geometric models

Author: A. Agrawal
A. Pasko
A. Sourin
D. Decarlo
D. Decarlo
F. Cole
F. Cole
G. Zeng
J. Flaxman
J. Kerber
J. Kerber
J. Kerber
J.J. Koenderink
Jens Kerber
Jian Chang
Jian J. Zhang
L. Markosian
M. Alexa
M. Wang
M. Wang
M.C. Sousa
M.M. Oliveira
Meili Wang
P. Cignoni
P.N. Belhumeur
R. Fattal
R. Raskar
R. Rogers
R. Zhang
R.D. Kalnins
R.N. Perry
T. Weyrich
T.P. Wu
W. Song
X. Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Sunken relief is a special art form of sculpture whereby the depicted shapes are sunk into a given surface. This is traditionally created by laboriously carving materials such as stone. Sunken reliefs often utilize the engraved lines or strokes to strengthen the impressions of a 3D presence and to highlight the features which otherwise are unrevealed. In other types of reliefs, smooth surfaces and their shadows convey such information in a coherent manner. Existing methods for relief generation are focused on forming a smooth surface with a shallow depth which provides the presence of 3D figures. Such methods unfortunately do not help the art form of sunken reliefs as they omit the presence of feature lines. We propose a framework to produce sunken reliefs from a known 3D geometry, which transforms the 3D objects into three layers of input to incorporate the contour lines seamlessly with the smooth surfaces. The three input layers take the advantages of the geometric information and the visual cues to assist the relief generation. This framework alters existing techniques in line drawings and relief generation, and then combines them organically for this particular purpose

Crossref

Bournemouth University Research Online

MPG.PuRe