2,957 research outputs found
An On-line Handwritten Text Search Method based on Directional Feature Matching
Abstract-In this paper, we describe a method of retrieving online handwritten text based on directional feature matching. Although text search into the character recognition candidate lattice has been elaborated, the character recognition based approach does not support languages which are not assumed. The proposed method is liberated from this constraint. It first hypothetically segments on-line handwritten text into character pattern blocks and prepares the object text patterns by combining the character pattern blocks. On the other hand, it employs handwritten text as a query pattern or prepares a query pattern by combining character ink patterns from query character codes. Then, it extracts directional features from the object text patterns and the query pattern, and the dimensionalities of those features are further reduced by Fisher linear discriminate analysis (FDA). Finally, the similarity is measured between the object text patterns and the query pattern by block-shift matching. This paper discusses the retrieval performance in comparison with our previous character recognition based method
Advances in Character Recognition
This book presents advances in character recognition, and it consists of 12 chapters that cover wide range of topics on different aspects of character recognition. Hopefully, this book will serve as a reference source for academic research, for professionals working in the character recognition field and for all interested in the subject
Linking Image and Text with 2-Way Nets
Linking two data sources is a basic building block in numerous computer
vision problems. Canonical Correlation Analysis (CCA) achieves this by
utilizing a linear optimizer in order to maximize the correlation between the
two views. Recent work makes use of non-linear models, including deep learning
techniques, that optimize the CCA loss in some feature space. In this paper, we
introduce a novel, bi-directional neural network architecture for the task of
matching vectors from two data sources. Our approach employs two tied neural
network channels that project the two views into a common, maximally correlated
space using the Euclidean loss. We show a direct link between the
correlation-based loss and Euclidean loss, enabling the use of Euclidean loss
for correlation maximization. To overcome common Euclidean regression
optimization problems, we modify well-known techniques to our problem,
including batch normalization and dropout. We show state of the art results on
a number of computer vision matching tasks including MNIST image matching and
sentence-image matching on the Flickr8k, Flickr30k and COCO datasets.Comment: 14 pages, 2 figures, 6 table
Shape-Based Plagiarism Detection for Flowchart Figures in Texts
Plagiarism detection is well known phenomenon in the academic arena. Copying
other people is considered as serious offence that needs to be checked. There
are many plagiarism detection systems such as turn-it-in that has been
developed to provide this checks. Most, if not all, discard the figures and
charts before checking for plagiarism. Discarding the figures and charts
results in look holes that people can take advantage. That means people can
plagiarized figures and charts easily without the current plagiarism systems
detecting it. There are very few papers which talks about flowcharts plagiarism
detection. Therefore, there is a need to develop a system that will detect
plagiarism in figures and charts. This paper presents a method for detecting
flow chart figure plagiarism based on shape-based image processing and
multimedia retrieval. The method managed to retrieve flowcharts with ranked
similarity according to different matching sets.Comment: 12 page
Off-line Arabic Character-Based Writer Identification – a Survey
Off-line writer identification requires transferring the text under consideration into an image file. This represents the only available solution to bring the printed materials to the electronic media. However, the transferring process causes the system to lose the temporal information of that text, which it can be gathered in on-line writer identification. Various techniques have been implemented to achieve high identification rates. These techniques have tackled different aspects of the identification system. Importance of writer identification system is to help mainly in forensic fields, historical document analysis and handwriting recognition system enhancement. Unfortunately, the Arabic writer identification system not achieves a satisfaction rate yet whereas certain process of features and classification still not recognized
- …