28 research outputs found

    Rectification Based Single-Shot Structured Light for Accurate and Dense 3D Reconstruction

    Get PDF
    Structured Light (SL) patterns generated based on pseudo-random arrays are widely used for single-shot 3D reconstruction using projector-camera systems. These SL images consist of a set of tags with different appearances, where these patterns will be projected on a target surface, then captured by a camera and decoded. The precision of localizing these tags from captured camera images affects the quality of the pixel-correspondences between the projector and the camera, and consequently that of the derived 3D shape. In this paper, we incorporate a quadrilateral representation for the detected SL tags that allows the construction of robust and accurate pixel-correspondences and the application of a spatial rectification module that leads to high tag classification accuracy. When applying the proposed method to single-shot 3D reconstruction, we show the effectiveness of this method over a baseline in estimating denser and more accurate 3D point-clouds

    Locally Adaptive Thresholding for Single-Shot Structured Light Patterns

    Get PDF
    Image thresholding is a challenging task due to its sensitivity to environmental variations and degradation in the quality of the captured image. Although many image thresholding methods have been introduced, most of them require the fine tuning of a thresholding parameter that is not suitable for single-shot structured light (SSSL) based projector-camera applications. In this paper, we introduce a locally adaptive thresholding method with automatic parameter selection based on the local statistics of the distinct image partitions. For assessing the proposed scheme, we introduce an evaluation that relies on mapping SSSL patterns between the camera and projector spaces. Experimental results demonstrate the effectiveness of the proposed technique by maintaining the thresholding accuracy of the baseline method, without the need to fine tune the obtained thresholding parameter or any noticeable change in the qualitative results

    Sensitivity Assessment for Projector Camera Geometry Reconstruction Systems

    Get PDF
    The principal point is an important parameter in the characterisa-tion of optical systems. We wish to better understand the opticalsystem parameters and their sensitivity to a good or poor estima-tion of principal point, to which the focal length, in particular, can behighly sensitive, which this work seeks to understand

    Challenges of Deep Learning-based Text Detection in the Wild

    Get PDF
    The reported accuracy of recent state-of-the-art text detection methods, mostly deep learning approaches, is in the order of 80% to 90% on standard benchmark datasets. These methods have relaxed some of the restrictions of structured text and environment (i.e., "in the wild") which are usually required for classical OCR to properly function. Even with this relaxation, there are still circumstances where these state-of-the-art methods fail.  Several remaining challenges in wild images, like in-plane-rotation, illumination reflection, partial occlusion, complex font styles, and perspective distortion, cause exciting methods to perform poorly. In order to evaluate current approaches in a formal way, we standardize the datasets and metrics for comparison which had made comparison between these methods difficult in the past. We use three benchmark datasets for our evaluations: ICDAR13, ICDAR15, and COCO-Text V2.0. The objective of the paper is to quantify the current shortcomings and to identify the challenges for future text detection research

    Fast Radiometric Compensation for Nonlinear Projectors

    Get PDF
    Radiometric compensation can be accomplished on nonlinearprojector-camera systems through the use of pixelwise lookup ta-bles. Existing methods are both computationally and memory inten-sive. Such methods are impractical to be implemented for currenthigh-end projector technology. In this paper, a novel computation-ally efficient method for nonlinear radiometric compensation of pro-jectors is proposed. The compensation accuracy of the proposedmethod is assessed with the use of a spectroradiometer. Experi-mental results show both the effectiveness of the method and thereduction in compensation time compared to a recent state-of-the-art method

    2D Positional Embedding-based Transformer for Scene Text Recognition

    Get PDF
    Recent state-of-the-art scene text recognition methods are primarily based on Recurrent Neural Networks (RNNs), however, these methods require one-dimensional (1D) features and are not designed for recognizing irregular-text instances due to the loss of spatial information present in the original two-dimensional (2D) images.  In this paper, we leverage a Transformer-based architecture for recognizing both regular and irregular text-in-the-wild images. The proposed method takes advantage of using a 2D positional encoder with the Transformer architecture to better preserve the spatial information of 2D image features than previous methods. The experiments on popular benchmarks, including the challenging COCO-Text dataset, demonstrate that the proposed scene text recognition method outperformed the state-of-the-art in most cases, especially on irregular-text recognition

    Constraints for Time-Multiplexed Structured Light with a Hand-held Camera

    Get PDF
    Multi-frame structured light in projector-camera systems affords high-density and non-contact methods of 3D surface reconstruction. However, they have strict setup constraints which can become expensive and time-consuming. Here, we investigate the conditions under which a projective homography can be used to compensate for small perturbations in pose caused by a hand-held camera. We synthesize data using a pinhole camera model and use it to determine the average 2D reprojection error per point correspondence. This error map is grouped into regions with specified upper-bounds to classify which regions produce sufficiently minimal error to be considered feasible for a structured-light projector-camera system with a hand-held camera. Empirical results demonstrate that a sub-pixel reprojection accuracy is achievable with a feasible geometric constraint

    Time-Series Causality with Missing Data

    Get PDF
    Over the past years, researchers have proposed various methods to discover causal relationships among time-series data as well as algorithms to fill in missing entries in time-series data. Little to no work has been done in combining the two strategies for the purpose of learning causal relationships using unevenly sampled multivariate time-series data. In this paper, we examine how the causal parameters learnt from unevenly sampled data (with missing entries) deviates from the parameters learnt using the evenly sampled data (without missing entries). However, to obtain the causal relationship from a given time-series requires evenly sampled data, which suggests filling the missing data values before obtaining the causal parameters. Therefore, the proposed method is based on applying a Gaussian Process Regression (GPR) model for missing data recovery, followed by several pairwise Granger causality equations in Vector Autoregssive form to fit the recovered data and obtain the causal parameters. Experimental results show that the causal parameters generated by using GPR data filling offers much lower RMSE than the dummy model (fill with last seen entry) under all missing values percentage, suggesting that GPR data filling can better preserve the causal relationships when compared with dummy data filling, thus should be considered when dealing with unevenly sampled time-series causality learning

    Text Enhancement in Projected Imagery

    Get PDF
    There is great interest in improving the visual quality of projectedimagery. In particular, for image enhancement, we would assertthat text and non-text regions should be enhanced differently inseeking to maximize perceived quality, since the spatial and statis-tical characteristics of text and non-text images are quite distinct.In this paper, we present a text enhancement scheme based on anovel local dynamic range statistical thresholding. Given an inputimage, text-like regions are obtained on the basis of computing thelocal statistics of regions having a high dynamic range, allowing apixel-wise classification into text-like or background classes. Theactual enhancement is obtained via class-dependent Wiener filter-ing, with text-like regions sharpened more than the background.Experimental results on four challenging images show that the pro- posed scheme offers a better visual quality than projection with- out enhancement as well as a recent state-of-the-art enhancementmethod

    untitled

    No full text
    Abstrac
    corecore