279 research outputs found

    DTW-Radon-based Shape Descriptor for Pattern Recognition

    Get PDF
    International audienceIn this paper, we present a pattern recognition method that uses dynamic programming (DP) for the alignment of Radon features. The key characteristic of the method is to use dynamic time warping (DTW) to match corresponding pairs of the Radon features for all possible projections. Thanks to DTW, we avoid compressing the feature matrix into a single vector which would otherwise miss information. To reduce the possible number of matchings, we rely on a initial normalisation based on the pattern orientation. A comprehensive study is made using major state-of-the-art shape descriptors over several public datasets of shapes such as graphical symbols (both printed and hand-drawn), handwritten characters and footwear prints. In all tests, the method proves its generic behaviour by providing better recognition performance. Overall, we validate that our method is robust to deformed shape due to distortion, degradation and occlusion

    Partial shape matching using CCP map and weighted graph transformation matching

    Get PDF
    La dĂ©tection de la similaritĂ© ou de la diffĂ©rence entre les images et leur mise en correspondance sont des problĂšmes fondamentaux dans le traitement de l'image. Pour rĂ©soudre ces problĂšmes, on utilise, dans la littĂ©rature, diffĂ©rents algorithmes d'appariement. MalgrĂ© leur nouveautĂ©, ces algorithmes sont pour la plupart inefficaces et ne peuvent pas fonctionner correctement dans les situations d’images bruitĂ©es. Dans ce mĂ©moire, nous rĂ©solvons la plupart des problĂšmes de ces mĂ©thodes en utilisant un algorithme fiable pour segmenter la carte des contours image, appelĂ©e carte des CCPs, et une nouvelle mĂ©thode d'appariement. Dans notre algorithme, nous utilisons un descripteur local qui est rapide Ă  calculer, est invariant aux transformations affines et est fiable pour des objets non rigides et des situations d’occultation. AprĂšs avoir trouvĂ© le meilleur appariement pour chaque contour, nous devons vĂ©rifier si ces derniers sont correctement appariĂ©s. Pour ce faire, nous utilisons l'approche « Weighted Graph Transformation Matching » (WGTM), qui est capable d'Ă©liminer les appariements aberrants en fonction de leur proximitĂ© et de leurs relations gĂ©omĂ©triques. WGTM fonctionne correctement pour les objets Ă  la fois rigides et non rigides et est robuste aux distorsions importantes. Pour Ă©valuer notre mĂ©thode, le jeu de donnĂ©es ETHZ comportant cinq classes diffĂ©rentes d'objets (bouteilles, cygnes, tasses, girafes, logos Apple) est utilisĂ©. Enfin, notre mĂ©thode est comparĂ©e Ă  plusieurs mĂ©thodes cĂ©lĂšbres proposĂ©es par d'autres chercheurs dans la littĂ©rature. Bien que notre mĂ©thode donne un rĂ©sultat comparable Ă  celui des mĂ©thodes de rĂ©fĂ©rence en termes du rappel et de la prĂ©cision de localisation des frontiĂšres, elle amĂ©liore significativement la prĂ©cision moyenne pour toutes les catĂ©gories du jeu de donnĂ©es ETHZ.Matching and detecting similarity or dissimilarity between images is a fundamental problem in image processing. Different matching algorithms are used in literature to solve this fundamental problem. Despite their novelty, these algorithms are mostly inefficient and cannot perform properly in noisy situations. In this thesis, we solve most of the problems of previous methods by using a reliable algorithm for segmenting image contour map, called CCP Map, and a new matching method. In our algorithm, we use a local shape descriptor that is very fast, invariant to affine transform, and robust for dealing with non-rigid objects and occlusion. After finding the best match for the contours, we need to verify if they are correctly matched. For this matter, we use the Weighted Graph Transformation Matching (WGTM) approach, which is capable of removing outliers based on their adjacency and geometrical relationships. WGTM works properly for both rigid and non-rigid objects and is robust to high order distortions. For evaluating our method, the ETHZ dataset including five diverse classes of objects (bottles, swans, mugs, giraffes, apple-logos) is used. Finally, our method is compared to several famous methods proposed by other researchers in the literature. While our method shows a comparable result to other benchmarks in terms of recall and the precision of boundary localization, it significantly improves the average precision for all of the categories in the ETHZ dataset

    Object localization using deformable templates

    Get PDF
    Object localization refers to the detection, matching and segmentation of objects in images. The localization model presented in this paper relies on deformable templates to match objects based on shape alone. The shape structure is captured by a prototype template consisting of hand-drawn edges and contours representing the object to be localized. A multistage, multiresolution algorithm is utilized to reduce the computational intensity of the search. The first stage reduces the physical search space dimensions using correlation to determine the regions of interest where a match it likely to occur. The second stage finds approximate matches between the template and target image at progressively finer resolutions, by attracting the template to salient image features using Edge Potential Fields. The third stage entails the use of evolutionary optimization to determine control point placement for a Local Weighted Mean warp, which deforms the template to fit the object boundaries. Results are presented for a number of applications, showing the successful localization of various objects. The algorithm’s invariance to rotation, scale, translation and moderate shape variation of the target objects is clearly illustrated

    Modal matching : a method for describing, comparing, and manipulating digital signals

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Program in Media Arts & Sciences, 1995.Includes bibliographical references (leaves 134-144).by Stanley Edward Sclaroff.Ph.D

    Using contour information and segmentation for object registration, modeling and retrieval

    Get PDF
    This thesis considers different aspects of the utilization of contour information and syntactic and semantic image segmentation for object registration, modeling and retrieval in the context of content-based indexing and retrieval in large collections of images. Target applications include retrieval in collections of closed silhouettes, holistic w ord recognition in handwritten historical manuscripts and shape registration. Also, the thesis explores the feasibility of contour-based syntactic features for improving the correspondence of the output of bottom-up segmentation to semantic objects present in the scene and discusses the feasibility of different strategies for image analysis utilizing contour information, e.g. segmentation driven by visual features versus segmentation driven by shape models or semi-automatic in selected application scenarios. There are three contributions in this thesis. The first contribution considers structure analysis based on the shape and spatial configuration of image regions (socalled syntactic visual features) and their utilization for automatic image segmentation. The second contribution is the study of novel shape features, matching algorithms and similarity measures. Various applications of the proposed solutions are presented throughout the thesis providing the basis for the third contribution which is a discussion of the feasibility of different recognition strategies utilizing contour information. In each case, the performance and generality of the proposed approach has been analyzed based on extensive rigorous experimentation using as large as possible test collections

    Symbolic and Visual Retrieval of Mathematical Notation using Formula Graph Symbol Pair Matching and Structural Alignment

    Get PDF
    Large data collections containing millions of math formulae in different formats are available on-line. Retrieving math expressions from these collections is challenging. We propose a framework for retrieval of mathematical notation using symbol pairs extracted from visual and semantic representations of mathematical expressions on the symbolic domain for retrieval of text documents. We further adapt our model for retrieval of mathematical notation on images and lecture videos. Graph-based representations are used on each modality to describe math formulas. For symbolic formula retrieval, where the structure is known, we use symbol layout trees and operator trees. For image-based formula retrieval, since the structure is unknown we use a more general Line of Sight graph representation. Paths of these graphs define symbol pairs tuples that are used as the entries for our inverted index of mathematical notation. Our retrieval framework uses a three-stage approach with a fast selection of candidates as the first layer, a more detailed matching algorithm with similarity metric computation in the second stage, and finally when relevance assessments are available, we use an optional third layer with linear regression for estimation of relevance using multiple similarity scores for final re-ranking. Our model has been evaluated using large collections of documents, and preliminary results are presented for videos and cross-modal search. The proposed framework can be adapted for other domains like chemistry or technical diagrams where two visually similar elements from a collection are usually related to each other

    On Visualizing Branched Surface: an Angle/Area Preserving Approach

    Get PDF
    The techniques of surface deformation and mapping are useful tools for the visualization of medical surfaces, especially for highly undulated or branched surfaces. In this thesis, two algorithms are presented for flattened visualizations of multi-branched medical surfaces, such as vessels. The first algorithm is an angle preserving approach, which is based on conformal analysis. The mapping function is obtained by minimizing two Dirichlet functionals. On a triangulated representation of vessel surfaces, this algorithm can be implemented efficiently using a finite element method. The second algorithm adjusts the result from conformal mapping to produce a flattened representation of the original surface while preserving areas. It employs the theory of optimal mass transport via a gradient descent approach. A new class of image morphing algorithms is also considered based on the theory of optimal mass transport. The mass moving energy functional is revised by adding an intensity penalizing term, in order to reduce the undesired "fading" effects. It is a parameter free approach. This technique has been applied on several natural and medical images to generate in-between image sequences.Ph.D.Allen Tannenbaum Committee Chair Anthony J. Yezzi, Committee Member; James Gruden, Committee Member; May D. Wang, Committee Member; Oskar Skrinjar, Committee Membe
    • 

    corecore