Search CORE

29,184 research outputs found

Handwritten Document Analysis for Automatic Writer Recognition

Author: Bensefia Ameur
Heutte Laurent
Paquet Thierry
Publication venue: 'Universitat Autonoma de Barcelona'
Publication date: 01/01/2005
Field of study

In this paper, we show that both the writer identification and the writer verification tasks can be carried out using local features such as graphemes extracted from the segmentation of cursive handwriting. We thus enlarge the scope of the possible use of these two tasks which have been, up to now, mainly evaluated on script handwritings. A textual based Information Retrieval model is used for the writer identification stage. This allows the use of a particular feature space based on feature frequencies. Image queries are handwritten documents projected in this feature space. The approach achieves 95% correct identification on the PSI_DataBase and 86% on the IAM_DataBase. Then writer hypothesis retrieved are analysed during a verification phase. We call upon a mutual information criterion to verify that two documents may have been produced by the same writer or not. Hypothesis testing is used for this purpose. The proposed method is first scaled on the PSI_DataBase then evaluated on the IAM_DataBase. On both databases, similar performance of nearly 96% correct verification is reported, thus making the approach general and very promising for large scale applications in the domain of handwritten document querying and writer verification

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Electronic Letters on Computer Vision and Image Analysis (ELCVIA - Universitat Autònoma de Barcelona)

Diposit Digital de Documents de la UAB

A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis

Author: Alberti Michele
Fischer Andreas
Goktepe Pinar
Ingold Rolf
Kolonko Thomas
Liwicki Marcus
Pondenkandath Vinaychandran
Studer Linda
Publication venue
Publication date: 22/05/2019
Field of study

Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, which are often challenging for machine learning due to a lack of human-annotated learning samples. With the advent of deep neural networks, a promising way to cope with the lack of training data is to pre-train models on images from a different domain and then fine-tune them on historical documents. In the current research, a typical example of such cross-domain transfer learning is the use of neural networks that have been pre-trained on the ImageNet database for object recognition. It remains a mostly open question whether or not this pre-training helps to analyse historical documents, which have fundamentally different image properties when compared with ImageNet. In this paper, we present a comprehensive empirical survey on the effect of ImageNet pre-training for diverse historical document analysis tasks, including character recognition, style classification, manuscript dating, semantic segmentation, and content-based retrieval. While we obtain mixed results for semantic segmentation at pixel-level, we observe a clear trend across different network architectures that ImageNet pre-training has a positive effect on classification as well as content-based retrieval

arXiv.org e-Print Archive

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

ACCESSING REFERENTIAL INFORMATION DURING TEXT COMPOSITION : WHEN AND WHY ?

Author: Alamargot Denis
Dansac Christophe
Publication venue: Amsterdam University Press
Publication date: 01/01/1999
Field of study

When composing a text, writers have to continually shift between content planning and content translating. This continuous shifting gives the writing activity its cyclic nature. The first section of this paper will analyse the writing process as a hierarchical cyclic activity. A methodological paradigm will be proposed for the investigation of the writing process. In the second section, we will partially present two experiments that were conducted independently, with this paradigm. Both give a coherent and interesting picture of what happens with content while the writer is planning. The characteristics of cycles depend both on the nature of the content information being recovered and on the complexity of the processes applied to this content

CogPrints Cognitive Sciences Eprint Archive

Sparse Radial Sampling LBP for Writer Identification

Author: Bagdanov Andrew D.
Karatzas Dimosthenis
Liwicki Marcus
Nicolaou Anguelos
Publication venue
Publication date: 23/04/2015
Field of study

In this paper we present the use of Sparse Radial Sampling Local Binary Patterns, a variant of Local Binary Patterns (LBP) for text-as-texture classification. By adapting and extending the standard LBP operator to the particularities of text we get a generic text-as-texture classification scheme and apply it to writer identification. In experiments on CVL and ICDAR 2013 datasets, the proposed feature-set demonstrates State-Of-the-Art (SOA) performance. Among the SOA, the proposed method is the only one that is based on dense extraction of a single local feature descriptor. This makes it fast and applicable at the earliest stages in a DIA pipeline without the need for segmentation, binarization, or extraction of multiple features.Comment: Submitted to the 13th International Conference on Document Analysis and Recognition (ICDAR 2015

arXiv.org e-Print Archive

Crossref

Deep Adaptive Learning for Writer Identification based on Single Handwritten Word Images

Author: He Sheng
Schomaker Lambert
Publication venue: 'Elsevier BV'
Publication date: 28/09/2018
Field of study

There are two types of information in each handwritten word image: explicit information which can be easily read or derived directly, such as lexical content or word length, and implicit attributes such as the author's identity. Whether features learned by a neural network for one task can be used for another task remains an open question. In this paper, we present a deep adaptive learning method for writer identification based on single-word images using multi-task learning. An auxiliary task is added to the training process to enforce the emergence of reusable features. Our proposed method transfers the benefits of the learned features of a convolutional neural network from an auxiliary task such as explicit content recognition to the main task of writer identification in a single procedure. Specifically, we propose a new adaptive convolutional layer to exploit the learned deep features. A multi-task neural network with one or several adaptive convolutional layers is trained end-to-end, to exploit robust generic features for a specific main task, i.e., writer identification. Three auxiliary tasks, corresponding to three explicit attributes of handwritten word images (lexical content, word length and character attributes), are evaluated. Experimental results on two benchmark datasets show that the proposed deep adaptive learning method can improve the performance of writer identification based on single-word images, compared to non-adaptive and simple linear-adaptive approaches.Comment: Under view of Pattern Recognitio

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

HMM-based Offline Recognition of Handwritten Words Crossed Out with Different Kinds of Strokes

Author: Likforman-Sulem L.
Vinciarelli A.
Publication venue
Publication date: 01/01/2008
Field of study

In this work, we investigate the recognition of words that have been crossed-out by the writers and are thus degraded. The degradation consists of one or more ink strokes that span the whole word length and simulate the signs that writers use to cross out the words. The simulated strokes are superimposed to the original clean word images. We considered two types of strokes: wave-trajectory strokes created with splines curves and line-trajectory strokes generated with the delta-lognormal model of rapid line movements. The experiments have been performed using a recognition system based on hidden Markov models and the results show that the performance decrease is moderate for single writer data and light strokes, but severe for multiple writer data

Enlighten