2,957 research outputs found

    An On-line Handwritten Text Search Method based on Directional Feature Matching

    Get PDF
    Abstract-In this paper, we describe a method of retrieving online handwritten text based on directional feature matching. Although text search into the character recognition candidate lattice has been elaborated, the character recognition based approach does not support languages which are not assumed. The proposed method is liberated from this constraint. It first hypothetically segments on-line handwritten text into character pattern blocks and prepares the object text patterns by combining the character pattern blocks. On the other hand, it employs handwritten text as a query pattern or prepares a query pattern by combining character ink patterns from query character codes. Then, it extracts directional features from the object text patterns and the query pattern, and the dimensionalities of those features are further reduced by Fisher linear discriminate analysis (FDA). Finally, the similarity is measured between the object text patterns and the query pattern by block-shift matching. This paper discusses the retrieval performance in comparison with our previous character recognition based method

    Online Handwritten Chinese/Japanese Character Recognition

    Get PDF

    Advances in Character Recognition

    Get PDF
    This book presents advances in character recognition, and it consists of 12 chapters that cover wide range of topics on different aspects of character recognition. Hopefully, this book will serve as a reference source for academic research, for professionals working in the character recognition field and for all interested in the subject

    Linking Image and Text with 2-Way Nets

    Full text link
    Linking two data sources is a basic building block in numerous computer vision problems. Canonical Correlation Analysis (CCA) achieves this by utilizing a linear optimizer in order to maximize the correlation between the two views. Recent work makes use of non-linear models, including deep learning techniques, that optimize the CCA loss in some feature space. In this paper, we introduce a novel, bi-directional neural network architecture for the task of matching vectors from two data sources. Our approach employs two tied neural network channels that project the two views into a common, maximally correlated space using the Euclidean loss. We show a direct link between the correlation-based loss and Euclidean loss, enabling the use of Euclidean loss for correlation maximization. To overcome common Euclidean regression optimization problems, we modify well-known techniques to our problem, including batch normalization and dropout. We show state of the art results on a number of computer vision matching tasks including MNIST image matching and sentence-image matching on the Flickr8k, Flickr30k and COCO datasets.Comment: 14 pages, 2 figures, 6 table

    Shape-Based Plagiarism Detection for Flowchart Figures in Texts

    Full text link
    Plagiarism detection is well known phenomenon in the academic arena. Copying other people is considered as serious offence that needs to be checked. There are many plagiarism detection systems such as turn-it-in that has been developed to provide this checks. Most, if not all, discard the figures and charts before checking for plagiarism. Discarding the figures and charts results in look holes that people can take advantage. That means people can plagiarized figures and charts easily without the current plagiarism systems detecting it. There are very few papers which talks about flowcharts plagiarism detection. Therefore, there is a need to develop a system that will detect plagiarism in figures and charts. This paper presents a method for detecting flow chart figure plagiarism based on shape-based image processing and multimedia retrieval. The method managed to retrieve flowcharts with ranked similarity according to different matching sets.Comment: 12 page

    Off-line Arabic Character-Based Writer Identification – a Survey

    Get PDF
    Off-line writer identification requires transferring the text under consideration into an image file. This represents the only available solution to bring the printed materials to the electronic media. However, the transferring process causes the system to lose the temporal information of that text, which it can be gathered in  on-line writer identification. Various techniques have been implemented to achieve high identification rates. These techniques have tackled different aspects of the identification system. Importance of writer identification system is to help mainly in forensic fields, historical document analysis and  handwriting recognition system enhancement. Unfortunately, the Arabic writer identification system not achieves a satisfaction rate yet whereas certain process of features and classification still not recognized
    corecore