Search CORE

159 research outputs found

DTW-Radon-based Shape Descriptor for Pattern Recognition

Author: K.C. Santosh
Lamiroy Bart
Wendling Laurent
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 27/03/2013
Field of study

International audienceIn this paper, we present a pattern recognition method that uses dynamic programming (DP) for the alignment of Radon features. The key characteristic of the method is to use dynamic time warping (DTW) to match corresponding pairs of the Radon features for all possible projections. Thanks to DTW, we avoid compressing the feature matrix into a single vector which would otherwise miss information. To reduce the possible number of matchings, we rely on a initial normalisation based on the pattern orientation. A comprehensive study is made using major state-of-the-art shape descriptors over several public datasets of shapes such as graphical symbols (both printed and hand-drawn), handwritten characters and footwear prints. In all tests, the method proves its generic behaviour by providing better recognition performance. Overall, we validate that our method is robust to deformed shape due to distortion, degradation and occlusion

INRIA a CCSD electronic archive server

HAL Descartes

Comparing and Evaluating HMM Ensemble Training Algorithms Using Train and Test and Condition Number Criteria

Author: Davis Richard I. A.
Lovell Brian C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Hidden Markov Models have many applications in signal processing and pattern recognition, but their convergence-based training algorithms are known to suffer from over-sensitivity to the initial random model choice. This paper describes the boundary between regions in which ensemble learning is superior to Rabiner's multiplesequence Baum-Welch training method, and proposes techniques for determining the best method in any arbitrary situation. It also studies the suitability of the training methods using the condition number, a recently proposed diagnostic tool for testing the quality of the model. A new method for training Hidden Markov Models called the Viterbi Path counting algorithm is introduced and is found to produce significantly better performance than current methods in a range of trials

CiteSeerX

Crossref

University of Queensland eSpace

Texture descriptor combining fractal dimension and artificial crawlers

Author: Bruno Odemir Martinez
Gonçalves Wesley Nunes
Machado Bruno Brandoli
Publication venue: 'Elsevier BV'
Publication date: 20/11/2013
Field of study

Texture is an important visual attribute used to describe images. There are many methods available for texture analysis. However, they do not capture the details richness of the image surface. In this paper, we propose a new method to describe textures using the artificial crawler model. This model assumes that each agent can interact with the environment and each other. Since this swarm system alone does not achieve a good discrimination, we developed a new method to increase the discriminatory power of artificial crawlers, together with the fractal dimension theory. Here, we estimated the fractal dimension by the Bouligand-Minkowski method due to its precision in quantifying structural properties of images. We validate our method on two texture datasets and the experimental results reveal that our method leads to highly discriminative textural features. The results indicate that our method can be used in different texture applications.Comment: 12 pages 9 figures. Paper in press: Physica A: Statistical Mechanics and its Application

arXiv.org e-Print Archive

Universidade de São Paulo

Robust Face Recognition Providing the Identity and its Reliability Degree Combining Sparse Representation and Multiple Features

Author: G. Grossi
J. Lin
R. Lanzarotti
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/06/2016
Field of study

For decades, face recognition (FR) has attracted a lot of attention, and several systems have been successfully developed to solve this problem. However, the issue deserves further research effort so as to reduce the still existing gap between the computer and human ability in solving it. Among the others, one of the human skills concerns his ability in naturally conferring a \u201cdegree of reliability\u201d to the face identification he carried out. We believe that providing a FR system with this feature would be of great help in real application contexts, making more flexible and treatable the identification process. In this spirit, we propose a completely automatic FR system robust to possible adverse illuminations and facial expression variations that provides together with the identity the corresponding degree of reliability. The method promotes sparse coding of multi-feature representations with LDA projections for dimensionality reduction, and uses a multistage classifier. The method has been evaluated in the challenging condition of having few (3\u20135) images per subject in the gallery. Extended experiments on several challenging databases (frontal faces of Extended YaleB, BANCA, FRGC v2.0, and frontal faces of Multi-PIE) show that our method outperforms several state-of-the-art sparse coding FR systems, thus demonstrating its effectiveness and generalizability

AIR Universita degli studi di Milano

Multiscale Fractal Descriptors Applied to Nanoscale Images

Author: Bruno Odemir M.
Florindo João B.
Pereira Ernesto C.
Sikora Mariana S.
Publication venue
Publication date: 16/01/2012
Field of study

This work proposes the application of fractal descriptors to the analysis of nanoscale materials under different experimental conditions. We obtain descriptors for images from the sample applying a multiscale transform to the calculation of fractal dimension of a surface map of such image. Particularly, we have used the}Bouligand-Minkowski fractal dimension. We applied these descriptors to discriminate between two titanium oxide films prepared under different experimental conditions. Results demonstrate the discrimination power of proposed descriptors in such kind of application

arXiv.org e-Print Archive

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Universidade de São Paulo

Hidden Markov Models for Spatio-Temporal Pattern Recognition and Image Segmentation

Author: Lovell Brian C.
Publication venue: Allied Publishers
Publication date: 01/01/2003
Field of study

Time and again hidden Markov models have been demonstrated to be highly effective in one-dimensional pattern recognition and classification problems such as speech recognition. A great deal of attention is now focussed on 2-D and possibly 3-D applications arising from problems encountered in computer vision in domains such as gesture, face, and handwriting recognition. Despite their widespread usage and numerous successful applications, there are few analytical results which can explain their remarkably good performance and guide researchers in selecting topologies and parameters to improve classification performance

University of Queensland eSpace

Geometry based Three-Dimensional Image Processing Method for Electronic Cluster Eye

Author: Jian T.
Neri Ferrante
Wu S.
Zhang G.
Zhu M.
Publication venue: 'IOS Press'
Publication date: 09/01/2018
Field of study

The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI linkIn recent years, much attention has been paid to the electronic cluster eye (eCley), a new type of artificial compound eyes, because of its small size, wide field of view (FOV) and sensitivity to motion objects. An eCley is composed of a certain number of optical channels organized as an array. Each optical channel spans a small and fixed field of view (FOV). To obtain a complete image with a full FOV, the images from all the optical channels are required to be fused together. The parallax from unparallel neighboring optical channels in eCley may lead to reconstructed image blurring and incorrectly estimated depth. To solve this problem, this paper proposes a geometry based three-dimensional image processing method (G3D) for eCley to obtain a complete focused image and dense depth map. In G3D, we derive the geometry relationship of optical channels in eCley to obtain the mathematical relation between the parallax and depth among unparallel neighboring optical channels. Based on the geometry relationship, all of the optical channels are used to estimate the depth map and reconstruct a focused image. Subsequently, by using an edge-aware interpolation method, we can further gain a sharply focused image and a depth map. The effectiveness of the proposed method is verified by the experimental results

De Montfort University Open Research Archive