17 research outputs found

    Modular Adaptive System Based on a Multi-Stage Neural Structure for Recognition of 2D Objects of Discontinuous Production

    Full text link
    This is a presentation of a new system for invariant recognition of 2D objects with overlapping classes, that can not be effectively recognized with the traditional methods. The translation, scale and partial rotation invariant contour object description is transformed in a DCT spectrum space. The obtained frequency spectrums are decomposed into frequency bands in order to feed different BPG neural nets (NNs). The NNs are structured in three stages - filtering and full rotation invariance; partial recognition; general classification. The designed multi-stage BPG Neural Structure shows very good accuracy and flexibility when tested with 2D objects used in the discontinuous production. The reached speed and the opportunuty for an easy restructuring and reprogramming of the system makes it suitable for application in different applied systems for real time work.Comment: www.ars-journal.co

    Computationally efficient wavelet affine invariant functions for 2D object recognition

    Get PDF
    In this paper, an affine invariant function is presented for object recognition from wavelet coefficients of the object boundary. In previous works, undecimated wavelet transform was used for affine invariant functions. In this paper, an algorithm based on decimated wavelet transform is developed to compute the affine invariant function. As a result, computational complexity is significantly reduced without decreasing recognition performance. Experimental results are presented

    A wavelet based method for affine invariant 2D object recognition

    Get PDF
    Recognizing objects that have undergone certain viewing transformations is an important problem in the field of computer vision. Most current research has focused almost exclusively on single aspects of the problem, concentrating on a few geometric transformations and distortions. Probably, the most important one is the affine transformation which may be considered as an approximation to perspective transformation. Many algorithms were developed for this purpose. Most popular ones are Fourier descriptors and moment based methods. Another powerful tool to recognize affine transformed objects, is the invariants of implicit polynomials. These three methods are usually called as traditional methods. Wavelet-based affine invariant functions are recent contributions to the solution of the problem. This method is better at recognition and more robust to noise compared to other methods. These functions mostly rely on the object contour and undecimated wavelet transform. In this thesis, a technique is developed to recognize objects undergoing a general affine transformation. Affine invariant functions are used, based on on image projections and high-pass filtered images of objects at projection angles . Decimated Wavelet Transform is used instead of undecimated Wavelet Transform. We compared our method with the an another wavelet based affine invariant function, Khalil-Bayoumi and also with traditional methods

    A Temporal Neural Trace of Wavelet Coefficients in Human Object Vision: An MEG Study

    Get PDF
    Wavelet transform has been widely used in image and signal processing applications such as denoising and compression. In this study, we explore the relation of the wavelet representation of stimuli with MEG signals acquired from a human object recognition experiment. To investigate the signature of wavelet descriptors in the visual system, we apply five levels of multi-resolution wavelet decomposition to the stimuli presented to participants during MEG recording and extract the approximation and detail sub-bands (horizontal, vertical, diagonal) coefficients in each level of decomposition. Apart from, employing multivariate pattern analysis (MVPA), a linear support vector classifier (SVM) is trained and tested over the time on MEG pattern vectors to decode neural information. Then, we calculate the representational dissimilarity matrix (RDM) on each time point of the MEG data and also on wavelet descriptors using classifier accuracy and one minus Pearson correlation coefficient, respectively. Given the time-courses calculated from performing the Pearson correlation between the wavelet descriptors RDMs and MEG decoding accuracy in each time point, our result shows that the peak latency of the wavelet approximation time courses occurs later for higher level coefficients. Furthermore, studying the neural trace of detail sub-bands indicates that the overall number of statistically significant time points for the horizontal and vertical detail coefficients is noticeably higher than diagonal detail coefficients, confirming the evidence of the oblique effect that the horizontal and vertical lines are more decodable in the human brain

    Shape-based Insect Classification: a Hybrid Region-based and Contour-based Approach

    Get PDF
    The American Burying Beetle (ABB) (Nicrophorus americanus) is a critically endangered insect whose distribution is limited to several states at the periphery of its historical range in the eastern and central United States. The objective of this study is to develop a digital image classification algorithm that will be used in an autonomous monitoring system to be attached to existing ABB traps that will detect, image, classify and report insects to species as they enter the trap. A training set of 92 individual specimens representing 11 insect species with shape similarity from the Oklahoma State University Entomology Museum was used in this study. Starting with a color digital image, an unsupervised preprocessing algorithm extracts each insect shape, converts it to a binary image, and then aligns it for classification using pattern recognition techniques. For region-based and contour-based shape representation methods, an area component and a Fourier descriptor methods are implemented for shape representation and classification. Analysis of initial classification results revealed that the pose variability of insect legs and antennae introduced excessive uncertainty in the feature space. To address this, a novel shape decomposition algorithm based on curvature theory is proposed to remove legs and antennae from the insect shape automatically prior to classification. This shape decomposition approach increased overall classification accuracy from 64% to 76% and 57% to 67% for area component and Fourier descriptor methods respectively. To further improve classification accuracy, a hybrid approach using a decision fusion technique has also been implemented after initial classification by each method. This resulted in 100% classification accuracy for ABB and 90% overall classification accuracy for the 11 species (total 92 images) investigated.Electrical Engineerin

    Maximum Energy Subsampling: A General Scheme For Multi-resolution Image Representation And Analysis

    Get PDF
    Image descriptors play an important role in image representation and analysis. Multi-resolution image descriptors can effectively characterize complex images and extract their hidden information. Wavelets descriptors have been widely used in multi-resolution image analysis. However, making the wavelets transform shift and rotation invariant produces redundancy and requires complex matching processes. As to other multi-resolution descriptors, they usually depend on other theories or information, such as filtering function, prior-domain knowledge, etc.; that not only increases the computation complexity, but also generates errors. We propose a novel multi-resolution scheme that is capable of transforming any kind of image descriptor into its multi-resolution structure with high computation accuracy and efficiency. Our multi-resolution scheme is based on sub-sampling an image into an odd-even image tree. Through applying image descriptors to the odd-even image tree, we get the relative multi-resolution image descriptors. Multi-resolution analysis is based on downsampling expansion with maximum energy extraction followed by upsampling reconstruction. Since the maximum energy usually retained in the lowest frequency coefficients; we do maximum energy extraction through keeping the lowest coefficients from each resolution level. Our multi-resolution scheme can analyze images recursively and effectively without introducing artifacts or changes to the original images, produce multi-resolution representations, obtain higher resolution images only using information from lower resolutions, compress data, filter noise, extract effective image features and be implemented in parallel processing

    Transform-based surface analysis and representation for CAD models

    Get PDF
    In most Computer-Aided Design (CAD) systems, the topological and geometrical information in a CAD model is usually represented by the edge-based data structure. With the emergence of concurrent engineering, such issues as product design, manufacturing, and process planning are considered simultaneously at the design stage. The need for the development of high-level models for completely documenting the geometry of a product and supporting manufacturing applications, such as automating the verification of a design for manufacturing (DIM) rules and generating process plans, becomes apparent;This dissertation has addressed the development of a generalized framework for high-level geometric representations of CAD models and form features to automate algorithmic search and retrieval of manufacturing information;A new wavelet-based ranking algorithm is developed to generate surface-based representations as input for the extraction of form features with non-planar surfaces in CAD models. The objective of using a wavelet-based shape analysis approach is to overcome the main limitation of the alternative feature extraction approaches, namely their restriction to planar surfaces or simple curved surfaces;A transform-invariant coding system for CAD models by multi-scale wavelet representations is also presented. The coding procedure is based on both the internal regions and external contours of topology entities---faces

    A biologically inspired optical flow system for motion detection and object identification

    Get PDF
    The entire dissertation/thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file (which also appears in the research.pdf); a non-technical general description, or public abstract, appears in the public.pdf file.Title from title screen of research.pdf file (viewed on April 7, 2008)Includes bibliographical references.Thesis (M.S.) University of Missouri-Columbia 2007.Dissertations, Academic -- University of Missouri--Columbia -- Electrical and computer engineering.Optical flow is possibly the best known method for motion segmentation. However its application is restricted to offline processing as it requires extensive computational resources and time. This thesis explores an optical flow method derived from observation on vision system of diptereous insect. The proposed method , Biological Optical flow (BioOF) was implemented using series of first order filters, and, therefore is much faster than any existing machine coded optical flow algorithm beside being hardware implement able. Like other optical flow methods, the output of proposed BioOF has two components: horizontal optical flow and vertical optical flow; both of them can be combined in order to get a better final result in terms of motion segmentation. Unfortunately, this combined output of the BioOF can be heavily coupled with noise. So, in order to remove the noise, intensive image processing had to be performed. The result was an algorithm that can provide a good contour of the segmented object in an image. Finally the object contour is converted to a Fourier feature space leading to a representation that is rotational and translational invariant. Over this feature space various classification algorithms including SVM, feature subset forward selection, Scatter matrix, and a simple linear classifier using principal component analysis and Mahanabolis distance were investigated

    Study of object recognition and identification based on shape and texture analysis

    Get PDF
    The objective of object recognition is to enable computers to recognize image patterns without human intervention. According to its applications, it is mainly divided into two parts: recognition of object categories and detection/identification of objects. My thesis studied the techniques of object feature analysis and identification strategies, which solve the object recognition problem by employing effective and perceptually important object features. The shape information is of particular interest and a review of the shape representation and description is presented, as well as the latest research work on object recognition. In the second chapter of the thesis, a novel content-based approach is proposed for efficient shape classification and retrieval of 2D objects. Two object detection approaches, which are designed according to the characteristics of the shape context and SIFT descriptors, respectively, are analyzed and compared. It is found that the identification strategy constructed on a single type of object feature is only able to recognize the target object under specific conditions which the identifier is adapted to. These identifiers are usually designed to detect the target objects which are rich in the feature type captured by the identifier. In addition, this type of feature often distinguishes the target object from the complex scene. To overcome this constraint, a novel prototyped-based object identification method is presented to detect the target object in the complex scene by employing different types of descriptors to capture the heterogeneous features. All types of descriptors are modified to meet the requirement of the detection strategy’s framework. Thus this new method is able to describe and identify various kinds of objects whose dominant features are quite different. The identification system employs the cosine similarity to evaluate the resemblance between the prototype image and image windows on the complex scene. Then a ‘resemblance map’ is established with values on each patch representing the likelihood of the target object’s presence. The simulation approved that this novel object detection strategy is efficient, robust and of scale and rotation invariance
    corecore