158 research outputs found
Curriculum Learning for Handwritten Text Line Recognition
Recurrent Neural Networks (RNN) have recently achieved the best performance
in off-line Handwriting Text Recognition. At the same time, learning RNN by
gradient descent leads to slow convergence, and training times are particularly
long when the training database consists of full lines of text. In this paper,
we propose an easy way to accelerate stochastic gradient descent in this
set-up, and in the general context of learning to recognize sequences. The
principle is called Curriculum Learning, or shaping. The idea is to first learn
to recognize short sequences before training on all available training
sequences. Experiments on three different handwritten text databases (Rimes,
IAM, OpenHaRT) show that a simple implementation of this strategy can
significantly speed up the training of RNN for Text Recognition, and even
significantly improve performance in some cases
Handwriting word recognition using windowed Bernoulli HMMs
[EN] Hidden Markov Models (HMMs) are now widely used for off-line handwriting recognition in many lan-
guages. As in speech recognition, they are usually built from shared, embedded HMMs at symbol level,
where state-conditional probability density functions in each HMM are modeled with Gaussian mixtures.
In contrast to speech recognition, however, it is unclear which kind of features should be used and,
indeed, very different features sets are in use today. Among them, we have recently proposed to directly
use columns of raw, binary image pixels, which are directly fed into embedded Bernoulli (mixture)
HMMs, that is, embedded HMMs in which the emission probabilities are modeled with Bernoulli mix-
tures. The idea is to by-pass feature extraction and to ensure that no discriminative information is filtered
out during feature extraction, which in some sense is integrated into the recognition model. In this work,
column bit vectors are extended by means of a sliding window of adequate width to better capture image
context at each horizontal position of the word image. Using these windowed Bernoulli mixture HMMs,
good results are reported on the well-known IAM and RIMES databases of Latin script, and in particular,
state-of-the-art results are provided on the IfN/ENIT database of Arabic handwritten words.Giménez Pastor, A.; Alkhoury, I.; Andrés Ferrer, J.; Juan CÃscar, A. (2014). Handwriting word recognition using windowed Bernoulli HMMs. Pattern Recognition Letters. 35:149-156. doi:10.1016/j.patrec.2012.09.002S1491563
Window repositioning for Printed Arabic Recognition
[EN] Bernoulli HMMs are conventional HMMs in which the emission probabilities are modeled with Bernoulli mixtures. They have recently been applied, with good results, in off-line text recognition in many languages, in particular, Arabic. A key idea that has proven to be very effective in this application of Bernoulli HMMs is the use of a sliding window of adequate width for feature extraction. This idea has allowed us to obtain very competitive results in the recognition of both Arabic handwriting and printed text. Indeed, a system based on it ranked first at the ICDAR 2011 Arabic recognition competition on the Arabic Printed Text Image (APTI) database. More recently, this idea has been refined by using repositioning techniques for extracted windows, leading to further improvements in Arabic handwriting recognition. In the case of printed text, this refinement led to an improved system which ranked second at the ICDAR 2013 second competition on APTI, only at a marginal distance from the best system. In this work, we describe the development of this improved system. Following evaluation protocols similar to those of the competitions on APTI, exhaustive experiments are detailed from which state-of-the-art results are obtained.The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/ICT-287755) under grant agreement no. 287755. The research is also supported by the Spanish Government (Plan E, iTrans2 TIN2009-14511 and AECID 2011/2012 grant).Alkhoury, I.; Giménez Pastor, A.; Juan, A.; Andrés Ferrer, J. (2015). Window repositioning for Printed Arabic Recognition. Pattern Recognition Letters. 51:86-93. https://doi.org/10.1016/j.patrec.2014.08.009S86935
Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition
—In this paper a vertical repositioning method
based on the center of gravity is investigated for handwriting
recognition systems and evaluated on databases containing
Arabic and French handwriting. Experiments show that vertical
distortion in images has a large impact on the performance
of HMM based handwriting recognition systems. Recently good
results were obtained with Bernoulli HMMs (BHMMs) using a
preprocessing with vertical repositioning of binarized images.
In order to isolate the effect of the preprocessing from the
BHMM model, experiments were conducted with Gaussian
HMMs and the LSTM-RNN tandem HMM approach with
relative improvements of 33% WER on the Arabic and up
to 62% on the French database.Doetsch, P.; Hamdani, M.; Ney, H.; Giménez Pastor, A.; Andrés Ferrer, J.; Juan CÃscar, A. (2012). Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition. En 2012 International Conference on Frontiers in Handwriting Recognition ICFHR 2012. Institute of Electrical and Electronics Engineers (IEEE). 3-7. doi:10.1109/ICFHR.2012.194S3
- …