Search CORE

158 research outputs found

Curriculum Learning for Handwritten Text Line Recognition

Author: Kermorvant Christopher
Louradour Jérôme
Publication venue
Publication date: 05/12/2013
Field of study

Recurrent Neural Networks (RNN) have recently achieved the best performance in off-line Handwriting Text Recognition. At the same time, learning RNN by gradient descent leads to slow convergence, and training times are particularly long when the training database consists of full lines of text. In this paper, we propose an easy way to accelerate stochastic gradient descent in this set-up, and in the general context of learning to recognize sequences. The principle is called Curriculum Learning, or shaping. The idea is to first learn to recognize short sequences before training on all available training sequences. Experiments on three different handwritten text databases (Rimes, IAM, OpenHaRT) show that a simple implementation of this strategy can significantly speed up the training of RNN for Text Recognition, and even significantly improve performance in some cases

arXiv.org e-Print Archive

Crossref

A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation

Author
Publication venue: Springer
Publication date: 24/12/2015
Field of study

Springer - Publisher Connector

Handwriting word recognition using windowed Bernoulli HMMs

Author: Adrià Giménez
Alfons Juan
Bianne-Bernard
Dehghan
Günter
Ihab Khoury
Jesús Andrés-Ferrer
Marti
Rabiner
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

[EN] Hidden Markov Models (HMMs) are now widely used for off-line handwriting recognition in many lan- guages. As in speech recognition, they are usually built from shared, embedded HMMs at symbol level, where state-conditional probability density functions in each HMM are modeled with Gaussian mixtures. In contrast to speech recognition, however, it is unclear which kind of features should be used and, indeed, very different features sets are in use today. Among them, we have recently proposed to directly use columns of raw, binary image pixels, which are directly fed into embedded Bernoulli (mixture) HMMs, that is, embedded HMMs in which the emission probabilities are modeled with Bernoulli mix- tures. The idea is to by-pass feature extraction and to ensure that no discriminative information is filtered out during feature extraction, which in some sense is integrated into the recognition model. In this work, column bit vectors are extended by means of a sliding window of adequate width to better capture image context at each horizontal position of the word image. Using these windowed Bernoulli mixture HMMs, good results are reported on the well-known IAM and RIMES databases of Latin script, and in particular, state-of-the-art results are provided on the IfN/ENIT database of Arabic handwritten words.Giménez Pastor, A.; Alkhoury, I.; Andrés Ferrer, J.; Juan Císcar, A. (2014). Handwriting word recognition using windowed Bernoulli HMMs. Pattern Recognition Letters. 35:149-156. doi:10.1016/j.patrec.2012.09.002S1491563

Crossref

RiuNet

Window repositioning for Printed Arabic Recognition

Author: Alkhoury Ihab
Andrés Ferrer Jesús
Giménez Pastor Adrián
Juan Alfons
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

[EN] Bernoulli HMMs are conventional HMMs in which the emission probabilities are modeled with Bernoulli mixtures. They have recently been applied, with good results, in off-line text recognition in many languages, in particular, Arabic. A key idea that has proven to be very effective in this application of Bernoulli HMMs is the use of a sliding window of adequate width for feature extraction. This idea has allowed us to obtain very competitive results in the recognition of both Arabic handwriting and printed text. Indeed, a system based on it ranked first at the ICDAR 2011 Arabic recognition competition on the Arabic Printed Text Image (APTI) database. More recently, this idea has been refined by using repositioning techniques for extracted windows, leading to further improvements in Arabic handwriting recognition. In the case of printed text, this refinement led to an improved system which ranked second at the ICDAR 2013 second competition on APTI, only at a marginal distance from the best system. In this work, we describe the development of this improved system. Following evaluation protocols similar to those of the competitions on APTI, exhaustive experiments are detailed from which state-of-the-art results are obtained.The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/ICT-287755) under grant agreement no. 287755. The research is also supported by the Spanish Government (Plan E, iTrans2 TIN2009-14511 and AECID 2011/2012 grant).Alkhoury, I.; Giménez Pastor, A.; Juan, A.; Andrés Ferrer, J. (2015). Window repositioning for Printed Arabic Recognition. Pattern Recognition Letters. 51:86-93. https://doi.org/10.1016/j.patrec.2014.08.009S86935

RiuNet

Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition

Author: Andrés Ferrer Jesús
Doetsch Patrick
Giménez Pastor Adrián
Hamdani Mahdi
Juan Císcar Alfonso
Ney Hermann
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

—In this paper a vertical repositioning method based on the center of gravity is investigated for handwriting recognition systems and evaluated on databases containing Arabic and French handwriting. Experiments show that vertical distortion in images has a large impact on the performance of HMM based handwriting recognition systems. Recently good results were obtained with Bernoulli HMMs (BHMMs) using a preprocessing with vertical repositioning of binarized images. In order to isolate the effect of the preprocessing from the BHMM model, experiments were conducted with Gaussian HMMs and the LSTM-RNN tandem HMM approach with relative improvements of 33% WER on the Arabic and up to 62% on the French database.Doetsch, P.; Hamdani, M.; Ney, H.; Giménez Pastor, A.; Andrés Ferrer, J.; Juan Císcar, A. (2012). Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition. En 2012 International Conference on Frontiers in Handwriting Recognition ICFHR 2012. Institute of Electrical and Electronics Engineers (IEEE). 3-7. doi:10.1109/ICFHR.2012.194S3

Crossref

RiuNet

Publikationsserver der RWTH Aachen University