1,313 research outputs found

    Multi-Reference Frame Image Registration for Rotation, Translation, and Scale

    Get PDF
    This thesis investigates applications of multi-reference frame image registration for image sets with various translation, rotation, and scale combinations. It focuses on registration accuracy improvement over traditional pairwise registration, and also compares the quality of scene estimation from frame averaging. Three experiments are developed which use cross-correlation to estimate translation, the Radon transform to estimate translation and rotation, and the Fourier-Mellin transform to estimate translation, rotation, and scale. Results from applying multi-reference frame registration in these experiments show distinct improvements in both registration accuracy and quality of frame averaging compared to single-reference frame registration. Furthermore, it is shown that the new registration technique is equivalent to the optimal Gauss-Markov estimator of the relative shifts given all pairwise shifts

    Off-line Arabic Handwriting Recognition System Using Fast Wavelet Transform

    Get PDF
    In this research, off-line handwriting recognition system for Arabic alphabet is introduced. The system contains three main stages: preprocessing, segmentation and recognition stage. In the preprocessing stage, Radon transform was used in the design of algorithms for page, line and word skew correction as well as for word slant correction. In the segmentation stage, Hough transform approach was used for line extraction. For line to words and word to characters segmentation, a statistical method using mathematic representation of the lines and words binary image was used. Unlike most of current handwriting recognition system, our system simulates the human mechanism for image recognition, where images are encoded and saved in memory as groups according to their similarity to each other. Characters are decomposed into a coefficient vectors, using fast wavelet transform, then, vectors, that represent a character in different possible shapes, are saved as groups with one representative for each group. The recognition is achieved by comparing a vector of the character to be recognized with group representatives. Experiments showed that the proposed system is able to achieve the recognition task with 90.26% of accuracy. The system needs only 3.41 seconds a most to recognize a single character in a text of 15 lines where each line has 10 words on average

    One RING to Rule Them All: Radon Sinogram for Place Recognition, Orientation and Translation Estimation

    Full text link
    LiDAR-based global localization is a fundamental problem for mobile robots. It consists of two stages, place recognition and pose estimation, and yields the current orientation and translation, using only the current scan as query and a database of map scans. Inspired by the definition of a recognized place, we consider that a good global localization solution should keep the pose estimation accuracy with a lower place density. Following this idea, we propose a novel framework towards sparse place-based global localization, which utilizes a unified and learning-free representation, Radon sinogram (RING), for all sub-tasks. Based on the theoretical derivation, a translation invariant descriptor and an orientation invariant metric are proposed for place recognition, achieving certifiable robustness against arbitrary orientation and large translation between query and map scan. In addition, we also utilize the property of RING to propose a global convergent solver for both orientation and translation estimation, arriving at global localization. Evaluation of the proposed RING based framework validates the feasibility and demonstrates a superior performance even under a lower place density

    Flaw reconstruction in NDE using a limited number of x-ray radiographic projections

    Get PDF
    One of the major problems in nondestructive evaluation (NDE) is the evaluation of flaw sizes and locations in a limited inspectability environment. In NDE x-ray radiography, this frequently occurs when the geometry of the part under test does not allow x-ray penetration in certain directions. Other times, the inspection setup in the field does not allow for inspection at all angles around the object. This dissertation presents a model based reconstruction technique which requires a small number of x-ray projections from one side of the object under test. The estimation and reconstruction of model parameters rather than the flaw distribution itself requires much less information, thereby reducing the number of required projections. Crack-like flaws are modeled as piecewise linear curves (connected points) and are reconstructed stereographically from at least two projections by matching corresponding endpoints of the linear segments. Volumetric flaws are modeled as ellipsoids and elliptical slices through ellipsoids. The elliptical principal axes lengths, orientation angles and locations are estimated by fitting a forward model to the projection data. The fitting procedure is highly nonlinear and requires stereographic projections to obtain initial estimates of the model parameters. The methods are tested both on simulated and experimental data. Comparisons are made with models from the field of stereology. Finally, analysis of reconstruction errors is presented for both models

    MULTIRIDGELETS FOR TEXTURE ANALYSIS

    Get PDF
    Directional wavelets have orientation selectivity and thus are able to efficiently represent highly anisotropic elements such as line segments and edges. Ridgelet transform is a kind of directional multi-resolution transform and has been successful in many image processing and texture analysis applications. The objective of this research is to develop multi-ridgelet transform by applying multiwavelet transform to the Radon transform so as to attain attractive improvements. By adapting the cardinal orthogonal multiwavelets to the ridgelet transform, it is shown that the proposed cardinal multiridgelet transform (CMRT) possesses cardinality, approximate translation invariance, and approximate rotation invariance simultaneously, whereas no single ridgelet transform can hold all these properties at the same time. These properties are beneficial to image texture analysis. This is demonstrated in three studies of texture analysis applications. Firstly a texture database retrieval study taking a portion of the Brodatz texture album as an example has demonstrated that the CMRT-based texture representation for database retrieval performed better than other directional wavelet methods. Secondly the study of the LCD mura defect detection was based upon the classification of simulated abnormalities with a linear support vector machine classifier, the CMRT-based analysis of defects were shown to provide efficient features for superior detection performance than other competitive methods. Lastly and the most importantly, a study on the prostate cancer tissue image classification was conducted. With the CMRT-based texture extraction, Gaussian kernel support vector machines have been developed to discriminate prostate cancer Gleason grade 3 versus grade 4. Based on a limited database of prostate specimens, one classifier was trained to have remarkable test performance. This approach is unquestionably promising and is worthy to be fully developed

    Recent Progress in Image Deblurring

    Full text link
    This paper comprehensively reviews the recent development of image deblurring, including non-blind/blind, spatially invariant/variant deblurring techniques. Indeed, these techniques share the same objective of inferring a latent sharp image from one or several corresponding blurry images, while the blind deblurring techniques are also required to derive an accurate blur kernel. Considering the critical role of image restoration in modern imaging systems to provide high-quality images under complex environments such as motion, undesirable lighting conditions, and imperfect system components, image deblurring has attracted growing attention in recent years. From the viewpoint of how to handle the ill-posedness which is a crucial issue in deblurring tasks, existing methods can be grouped into five categories: Bayesian inference framework, variational methods, sparse representation-based methods, homography-based modeling, and region-based methods. In spite of achieving a certain level of development, image deblurring, especially the blind case, is limited in its success by complex application conditions which make the blur kernel hard to obtain and be spatially variant. We provide a holistic understanding and deep insight into image deblurring in this review. An analysis of the empirical evidence for representative methods, practical issues, as well as a discussion of promising future directions are also presented.Comment: 53 pages, 17 figure
    • …
    corecore