164 research outputs found
Robust and efficient Fourier-Mellin transform approximations for invariant grey-level image description and reconstruction
International audienceThis paper addresses the gray-level image representation ability of the Fourier-Mellin Transform (FMT) for pattern recognition, reconstruction and image database retrieval. The main practical di±culty of the FMT lies in the accuracy and e±ciency of its numerical approximation and we propose three estimations of its analytical extension. Comparison of these approximations is performed from discrete and ¯nite-extent sets of Fourier- Mellin harmonics by means of experiments in: (i) image reconstruction via both visual inspection and the computation of a reconstruction error; and (ii) pattern recognition and discrimination by using a complete and convergent set of features invariant under planar similarities. Experimental results on real gray-level images show that it is possible to recover an image to within a speci¯ed degree of accuracy and to classify objects reliably even when a large set of descriptors is used. Finally, an example will be given, illustrating both theoretical and numerical results in the context of content-based image retrieval
Color Image Analysis by Quaternion-Type Moments
International audienceIn this paper, by using the quaternion algebra, the conventional complex-type moments (CTMs) for gray-scale images are generalized to color images as quaternion-type moments (QTMs) in a holistic manner. We first provide a general formula of QTMs from which we derive a set of quaternion-valued QTM invariants (QTMIs) to image rotation, scale and translation transformations by eliminating the influence of transformation parameters. An efficient computation algorithm is also proposed so as to reduce computational complexity. The performance of the proposed QTMs and QTMIs are evaluated considering several application frameworks ranging from color image reconstruction, face recognition to image registration. We show they achieve better performance than CTMs and CTM invariants (CTMIs). We also discuss the choice of the unit pure quaternion influence with the help of experiments. appears to be an optimal choice
Modular Adaptive System Based on a Multi-Stage Neural Structure for Recognition of 2D Objects of Discontinuous Production
This is a presentation of a new system for invariant recognition of 2D
objects with overlapping classes, that can not be effectively recognized with
the traditional methods. The translation, scale and partial rotation invariant
contour object description is transformed in a DCT spectrum space. The obtained
frequency spectrums are decomposed into frequency bands in order to feed
different BPG neural nets (NNs). The NNs are structured in three stages -
filtering and full rotation invariance; partial recognition; general
classification. The designed multi-stage BPG Neural Structure shows very good
accuracy and flexibility when tested with 2D objects used in the discontinuous
production. The reached speed and the opportunuty for an easy restructuring and
reprogramming of the system makes it suitable for application in different
applied systems for real time work.Comment: www.ars-journal.co
On The Potential of Image Moments for Medical Diagnosis
Medical imaging is widely used for diagnosis and postoperative or post-therapy monitoring. The ever-increasing number of images produced has encouraged the introduction of automated methods to assist doctors or pathologists. In recent years, especially after the advent of convolutional neural networks, many researchers have focused on this approach, considering it to be the only method for diagnosis since it can perform a direct classification of images. However, many diagnostic systems still rely on handcrafted features to improve interpretability and limit resource consumption. In this work, we focused our efforts on orthogonal moments, first by providing an overview and taxonomy of their macrocategories and then by analysing their classification performance on very different medical tasks represented by four public benchmark data sets. The results confirmed that convolutional neural networks achieved excellent performance on all tasks. Despite being composed of much fewer features than those extracted by the networks, orthogonal moments proved to be competitive with them, showing comparable and, in some cases, better performance. In addition, Cartesian and harmonic categories provided a very low standard deviation, proving their robustness in medical diagnostic tasks. We strongly believe that the integration of the studied orthogonal moments can lead to more robust and reliable diagnostic systems, considering the performance obtained and the low variation of the results. Finally, since they have been shown to be effective on both magnetic resonance and computed tomography images, they can be easily extended to other imaging techniques
Application of statistical learning theory to plankton image analysis
Submitted to the Joint Program in Applied Ocean Science and Engineering
in partial fulfillment of the requirements for the degree of Doctor of Philosophy
At the Massachusetts Institute of Technology
and the Woods Hole Oceanographic Institution
June 2006A fundamental problem in limnology and oceanography is the inability to quickly
identify and map distributions of plankton. This thesis addresses the problem by
applying statistical machine learning to video images collected by an optical sampler,
the Video Plankton Recorder (VPR). The research is focused on development
of a real-time automatic plankton recognition system to estimate plankton abundance.
The system includes four major components: pattern representation/feature
measurement, feature extraction/selection, classification, and abundance estimation.
After an extensive study on a traditional learning vector quantization (LVQ)
neural network (NN) classifier built on shape-based features and different pattern
representation methods, I developed a classification system combined multi-scale cooccurrence matrices feature with support vector machine classifier. This new method
outperforms the traditional shape-based-NN classifier method by 12% in classification
accuracy. Subsequent plankton abundance estimates are improved in the regions of
low relative abundance by more than 50%.
Both the NN and SVM classifiers have no rejection metrics. In this thesis, two
rejection metrics were developed. One was based on the Euclidean distance in the
feature space for NN classifier. The other used dual classifier (NN and SVM) voting as
output. Using the dual-classification method alone yields almost as good abundance
estimation as human labeling on a test-bed of real world data. However, the distance
rejection metric for NN classifier might be more useful when the training samples are
not “good” ie, representative of the field data.
In summary, this thesis advances the current state-of-the-art plankton recognition
system by demonstrating multi-scale texture-based features are more suitable
for classifying field-collected images. The system was verified on a very large realworld
dataset in systematic way for the first time. The accomplishments include developing a multi-scale occurrence matrices and support vector machine system, a dual-classification system, automatic correction in abundance estimation, and ability to get accurate abundance estimation from real-time automatic classification. The methods developed are generic and are likely to work on range of other image classification applications.This work was supported by National Science Foundation Grants OCE-9820099
and Woods Hole Oceanographic Institution academic program
Automated Target Acquisition, Recognition and Tracking (ATTRACT)
The primary objective of phase 1 of this research project is to conduct multidisciplinary research that will contribute to fundamental scientific knowledge in several of the USAF critical technology areas. Specifically, neural networks, signal processing techniques, and electro-optic capabilities are utilized to solve problems associated with automated target acquisition, recognition, and tracking. To accomplish the stated objective, several tasks have been identified and were executed
- …