Search CORE

320 research outputs found

Online Handwritten Chinese/Japanese Character Recognition

Author: Nakagawa Masaki
Zhu Bilan
Publication venue: 'IntechOpen'
Publication date: 07/11/2012
Field of study

Special Radical Detection by Statistical Classification for On-line Handwritten Chinese Character Recognition

Author: Delaye Adrien
Liu Cheng-Lin
Long-Long Ma
Publication venue: HAL CCSD
Publication date: 16/11/2010
Field of study

International audienceThe hierarchical nature of Chinese characters has inspired radical-based recognition, but radical segmentation from characters remains a challenge. We previously proposed a radical-based approach for on-line handwritten Chinese character recognition, which incorporates character structure knowledge into integrated radical segmentation and recognition, and performs well on characters of left-right and up-down structures (non-special structures). In this paper, we propose a statistical-classification-based method for detecting special radicals from special-structure characters. We design 19 binary classifiers for classifying candidate radicals (groups of strokes) hypothesized from the input character. Characters with special radicals detected are recognized using special-structure models, while those without special radicals are recognized using the models for non-special structures. We applied the recognition framework to 6,763 character classes, and achieved promising recognition performance in experiments

HAL Descartes

Hal-Diderot

HAL-Rennes 1

Zone Segmentation and Thinning based Algorithm for Segmentation of Devnagari Text

Author: Er. Japneet Kaur, Er. Suppandeep Kaur, D
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/11/2015
Field of study

Character segmentation of handwritten documents is an challenging research topic due to its diverse application environment.OCR can be used for automated processing and handling of forms, old corrupted reports, bank cheques, postal codes and structures. Now Segmentation of a word into characters is one of the major challenge in optical character recognition. This is even more challenging when we segment characters in an offline handwritten document and the next hurdle is presence of broken ,touching and overlapped characters in devnagari script. So, in this paper we have introduced an algorithm that will segment both broken as well as touching characters in devnagari script. Now to segment these characters the algorithm uses both zone segmentation and thinning based techniques. We have used 85 words each for isolated, broken, touching and both broken as well as touching characters individually. Results achieved while segmentation of broken as well as touching are 96.2 % on an average

International Journal on Recent and Innovation Trends in Computing and Communication

Advances in Character Recognition

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

This book presents advances in character recognition, and it consists of 12 chapters that cover wide range of topics on different aspects of character recognition. Hopefully, this book will serve as a reference source for academic research, for professionals working in the character recognition field and for all interested in the subject

Directory of Open Access Books (DOAB)

A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation

Author
Publication venue: Springer
Publication date: 24/12/2015
Field of study

Springer - Publisher Connector

A limited-size ensemble of homogeneous CNN/LSTMs for high-performance word classification

Author: A Graves
A Graves
A Graves
A Vinciarelli
B Oommen
B Peleg
B Shi
B Stuner
C Wells
C-L Liu
CE Shannon
D Asonov
DM Ford
DR Hardoon
E Okafor
G Seni
G Seni
GK Zipf
J Almazán
J Sueiras
JP Van Oosten
JT Favata
M Côté
M Stehlìk
MA Youssef Bassil
NQ Emlen
PE Bramall
R Ptucha
RA Wagner
RC Angell
RJ Plamondon
RT Schuh
S Günter
S Günter
S He
S Hochreiter
SC Chantal Amrhein
T Van der Zant
T Van der Zant
TK Ho
U-V Marti
VI Levenshtein
VP Romesh Ranawana
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2021
Field of study

The strength of long short-term memory neural networks (LSTMs) that have been applied is more located in handling sequences of variable length than in handling geometric variability of the image patterns. In this paper, an end-to-end convolutional LSTM neural network is used to handle both geometric variation and sequence variability. The best results for LSTMs are often based on large-scale training of an ensemble of network instances. We show that high performances can be reached on a common benchmark set by using proper data augmentation for just five such networks using a proper coding scheme and a proper voting scheme. The networks have similar architectures (convolutional neural network (CNN): five layers, bidirectional LSTM (BiLSTM): three layers followed by a connectionist temporal classification (CTC) processing step). The approach assumes differently scaled input images and different feature map sizes. Three datasets are used: the standard benchmark RIMES dataset (French); a historical handwritten dataset KdK (Dutch); the standard benchmark George Washington (GW) dataset (English). Final performance obtained for the word-recognition test of RIMES was 96.6%, a clear improvement over other state-of-the-art approaches which did not use a pre-trained network. On the KdK and GW datasets, our approach also shows good results. The proposed approach is deployed in the Monk search engine for historical-handwriting collections

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen