1,313 research outputs found
Multi-Reference Frame Image Registration for Rotation, Translation, and Scale
This thesis investigates applications of multi-reference frame image registration for image sets with various translation, rotation, and scale combinations. It focuses on registration accuracy improvement over traditional pairwise registration, and also compares the quality of scene estimation from frame averaging. Three experiments are developed which use cross-correlation to estimate translation, the Radon transform to estimate translation and rotation, and the Fourier-Mellin transform to estimate translation, rotation, and scale. Results from applying multi-reference frame registration in these experiments show distinct improvements in both registration accuracy and quality of frame averaging compared to single-reference frame registration. Furthermore, it is shown that the new registration technique is equivalent to the optimal Gauss-Markov estimator of the relative shifts given all pairwise shifts
Off-line Arabic Handwriting Recognition System Using Fast Wavelet Transform
In this research, off-line handwriting recognition system for Arabic alphabet is
introduced. The system contains three main stages: preprocessing, segmentation and
recognition stage. In the preprocessing stage, Radon transform was used in the design
of algorithms for page, line and word skew correction as well as for word slant
correction. In the segmentation stage, Hough transform approach was used for line
extraction. For line to words and word to characters segmentation, a statistical method
using mathematic representation of the lines and words binary image was used.
Unlike most of current handwriting recognition system, our system simulates the
human mechanism for image recognition, where images are encoded and saved in
memory as groups according to their similarity to each other. Characters are
decomposed into a coefficient vectors, using fast wavelet transform, then, vectors,
that represent a character in different possible shapes, are saved as groups with one
representative for each group. The recognition is achieved by comparing a vector of
the character to be recognized with group representatives.
Experiments showed that the proposed system is able to achieve the recognition task
with 90.26% of accuracy. The system needs only 3.41 seconds a most to recognize a
single character in a text of 15 lines where each line has 10 words on average
One RING to Rule Them All: Radon Sinogram for Place Recognition, Orientation and Translation Estimation
LiDAR-based global localization is a fundamental problem for mobile robots.
It consists of two stages, place recognition and pose estimation, and yields
the current orientation and translation, using only the current scan as query
and a database of map scans. Inspired by the definition of a recognized place,
we consider that a good global localization solution should keep the pose
estimation accuracy with a lower place density. Following this idea, we propose
a novel framework towards sparse place-based global localization, which
utilizes a unified and learning-free representation, Radon sinogram (RING), for
all sub-tasks. Based on the theoretical derivation, a translation invariant
descriptor and an orientation invariant metric are proposed for place
recognition, achieving certifiable robustness against arbitrary orientation and
large translation between query and map scan. In addition, we also utilize the
property of RING to propose a global convergent solver for both orientation and
translation estimation, arriving at global localization. Evaluation of the
proposed RING based framework validates the feasibility and demonstrates a
superior performance even under a lower place density
Flaw reconstruction in NDE using a limited number of x-ray radiographic projections
One of the major problems in nondestructive evaluation (NDE) is the evaluation of flaw sizes and locations in a limited inspectability environment. In NDE x-ray radiography, this frequently occurs when the geometry of the part under test does not allow x-ray penetration in certain directions. Other times, the inspection setup in the field does not allow for inspection at all angles around the object. This dissertation presents a model based reconstruction technique which requires a small number of x-ray projections from one side of the object under test. The estimation and reconstruction of model parameters rather than the flaw distribution itself requires much less information, thereby reducing the number of required projections. Crack-like flaws are modeled as piecewise linear curves (connected points) and are reconstructed stereographically from at least two projections by matching corresponding endpoints of the linear segments. Volumetric flaws are modeled as ellipsoids and elliptical slices through ellipsoids. The elliptical principal axes lengths, orientation angles and locations are estimated by fitting a forward model to the projection data. The fitting procedure is highly nonlinear and requires stereographic projections to obtain initial estimates of the model parameters. The methods are tested both on simulated and experimental data. Comparisons are made with models from the field of stereology. Finally, analysis of reconstruction errors is presented for both models
MULTIRIDGELETS FOR TEXTURE ANALYSIS
Directional wavelets have orientation selectivity and thus are able to efficiently represent highly anisotropic elements such as line segments and edges. Ridgelet transform is a kind of directional multi-resolution transform and has been successful in many image processing and texture analysis applications. The objective of this research is to develop multi-ridgelet transform by applying multiwavelet transform to the Radon transform so as to attain attractive improvements. By adapting the cardinal orthogonal multiwavelets to the ridgelet transform, it is shown that the proposed cardinal multiridgelet transform (CMRT) possesses cardinality, approximate translation invariance, and approximate rotation invariance simultaneously, whereas no single ridgelet transform can hold all these properties at the same time. These properties are beneficial to image texture analysis. This is demonstrated in three studies of texture analysis applications. Firstly a texture database retrieval study taking a portion of the Brodatz texture album as an example has demonstrated that the CMRT-based texture representation for database retrieval performed better than other directional wavelet methods. Secondly the study of the LCD mura defect detection was based upon the classification of simulated abnormalities with a linear support vector machine classifier, the CMRT-based analysis of defects were shown to provide efficient features for superior detection performance than other competitive methods. Lastly and the most importantly, a study on the prostate cancer tissue image classification was conducted. With the CMRT-based texture extraction, Gaussian kernel support vector machines have been developed to discriminate prostate cancer Gleason grade 3 versus grade 4. Based on a limited database of prostate specimens, one classifier was trained to have remarkable test performance. This approach is unquestionably promising and is worthy to be fully developed
Recent Progress in Image Deblurring
This paper comprehensively reviews the recent development of image
deblurring, including non-blind/blind, spatially invariant/variant deblurring
techniques. Indeed, these techniques share the same objective of inferring a
latent sharp image from one or several corresponding blurry images, while the
blind deblurring techniques are also required to derive an accurate blur
kernel. Considering the critical role of image restoration in modern imaging
systems to provide high-quality images under complex environments such as
motion, undesirable lighting conditions, and imperfect system components, image
deblurring has attracted growing attention in recent years. From the viewpoint
of how to handle the ill-posedness which is a crucial issue in deblurring
tasks, existing methods can be grouped into five categories: Bayesian inference
framework, variational methods, sparse representation-based methods,
homography-based modeling, and region-based methods. In spite of achieving a
certain level of development, image deblurring, especially the blind case, is
limited in its success by complex application conditions which make the blur
kernel hard to obtain and be spatially variant. We provide a holistic
understanding and deep insight into image deblurring in this review. An
analysis of the empirical evidence for representative methods, practical
issues, as well as a discussion of promising future directions are also
presented.Comment: 53 pages, 17 figure
- …