Search CORE

1,549 research outputs found

A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)

Author: Dogan Pelin
Gross Markus
Li Boyang
Sigal Leonid
Publication venue
Publication date: 09/04/2018
Field of study

The alignment of heterogeneous sequential data (video to text) is an important and challenging problem. Standard techniques for this task, including Dynamic Time Warping (DTW) and Conditional Random Fields (CRFs), suffer from inherent drawbacks. Mainly, the Markov assumption implies that, given the immediate past, future alignment decisions are independent of further history. The separation between similarity computation and alignment decision also prevents end-to-end training. In this paper, we propose an end-to-end neural architecture where alignment actions are implemented as moving data between stacks of Long Short-term Memory (LSTM) blocks. This flexible architecture supports a large variety of alignment tasks, including one-to-one, one-to-many, skipping unmatched elements, and (with extensions) non-monotonic alignment. Extensive experiments on semi-synthetic and real datasets show that our algorithm outperforms state-of-the-art baselines.Comment: Accepted at CVPR 2018 (Spotlight). arXiv file includes the paper and the supplemental materia

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref

Intelligent Combination of Structural Analysis Algorithms: Application to Mathematical Expression Recognition

Author: Pillay Amit
Publication venue: RIT Scholar Works
Publication date: 01/06/2014
Field of study

Structural analysis is an important step in many document based recognition problem. Structural analysis is performed to associate elements in a document and assign meaning to their association. Handwritten mathematical expression recognition is one such problem which has been studied and researched for long. Many techniques have been researched to build a system that produce high performance mathematical expression recognition. We have presented a novel method to combine multiple structural recognition algorithms in which the combined result shows better performance than each individual recognition algorithms. In our experiment we have applied our method to combine multiple mathematical expression recognition parsers called DRACULAE. We have used Graph Transformation Network (GTN) which is a network of function based systems in which each system takes graphs as input, apply function and produces a graph as output. GTN is used to combine multiple DRACULAE parsers and its parameter are tuned using gradient based learning. It has been shown that such a combination method can be used to accentuate the strength of individual algorithms in combination to produce better combination result which higher recognition performance. In our experiment we were able to obtain a highest recognition rate of 74% as compared to best recognition result of 70% from individual DRACULAE parsers. Our experiment also resulted into a maximum of 20% reduction of parent recognition errors and maximum 37% reduction in relation recognition errors between symbols in expressions

RIT Scholar Works

Preprocessing for Images Captured by Cameras

Author: Fan Kuo-Chin
Lue Hsin-Te
Wen Ming-Gang
Yu Chih-Chang
Publication venue: 'IntechOpen'
Publication date: 07/11/2012
Field of study

IntechOpen

Advances in Character Recognition

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

This book presents advances in character recognition, and it consists of 12 chapters that cover wide range of topics on different aspects of character recognition. Hopefully, this book will serve as a reference source for academic research, for professionals working in the character recognition field and for all interested in the subject

Directory of Open Access Books (DOAB)

Topological inference in graphs and images

Author: Vandaele Robin
Publication venue: Universiteit Gent. Faculteit Ingenieurswetenschappen en Architectuur
Publication date: 01/01/2020
Field of study

Ghent University Academic Bibliography