28 research outputs found
Automatic Dating of Historical Documents
With the growing number of digitized documents available to researchers it is becoming possible to answer scientific questions by simply analyzing the image content. In this article, a new approach for the automatic dating of historical documents is proposed. It is based on an approach only recently proposed for scribe identification. It uses local RootSIFT descriptors which are encoded using VLAD. The method is evaluated using a dataset consisting of context areas of medieval papal charters covering around 150 years from 1049 to 1198 AD. Experimental results show very promising mean absolute errors of about 17 years
Sparse Radial Sampling LBP for Writer Identification
In this paper we present the use of Sparse Radial Sampling Local Binary
Patterns, a variant of Local Binary Patterns (LBP) for text-as-texture
classification. By adapting and extending the standard LBP operator to the
particularities of text we get a generic text-as-texture classification scheme
and apply it to writer identification. In experiments on CVL and ICDAR 2013
datasets, the proposed feature-set demonstrates State-Of-the-Art (SOA)
performance. Among the SOA, the proposed method is the only one that is based
on dense extraction of a single local feature descriptor. This makes it fast
and applicable at the earliest stages in a DIA pipeline without the need for
segmentation, binarization, or extraction of multiple features.Comment: Submitted to the 13th International Conference on Document Analysis
and Recognition (ICDAR 2015
Dissimilarity Gaussian Mixture Models for Efficient Offline Handwritten Text-Independent Identification using SIFT and RootSIFT Descriptors
Handwriting biometrics is the science of identifying the behavioural aspect of an individual’s writing style and exploiting it to develop automated writer identification and verification systems. This paper presents an efficient handwriting identification system which combines Scale Invariant Feature Transform (SIFT) and RootSIFT descriptors in a set of Gaussian mixture models (GMM). In particular, a new concept of similarity and dissimilarity Gaussian mixture models (SGMM and DGMM) is introduced. While a SGMM is constructed for every writer to describe the intra-class similarity that is exhibited between the handwritten texts of the same writer, a DGMM represents the contrast or dissimilarity that exists between the writer’s style on one hand and other different handwriting styles on the other hand. Furthermore, because the handwritten text is described by a number of key point descriptors where each descriptor generates a SGMM/DGMM score, a new weighted histogram method is proposed to derive the intermediate prediction score for each writer’s GMM. The idea of weighted histogram exploits the fact that handwritings from the same writer should exhibit more similar textual patterns than dissimilar ones, hence, by penalizing the bad scores with a cost function, the identification rate can be significantly enhanced. Our proposed system has been extensively assessed using six different public datasets (including three English, two Arabic and one hybrid language) and the results have shown the superiority of the proposed system over state-of-the-art techniques
GR-RNN:Global-Context Residual Recurrent Neural Networks for Writer Identification
This paper presents an end-to-end neural network system to identify writers
through handwritten word images, which jointly integrates global-context
information and a sequence of local fragment-based features. The global-context
information is extracted from the tail of the neural network by a global
average pooling step. The sequence of local and fragment-based features is
extracted from a low-level deep feature map which contains subtle information
about the handwriting style. The spatial relationship between the sequence of
fragments is modeled by the recurrent neural network (RNN) to strengthen the
discriminative ability of the local fragment features. We leverage the
complementary information between the global-context and local fragments,
resulting in the proposed global-context residual recurrent neural network
(GR-RNN) method. The proposed method is evaluated on four public data sets and
experimental results demonstrate that it can provide state-of-the-art
performance. In addition, the neural networks trained on gray-scale images
provide better results than neural networks trained on binarized and contour
images, indicating that texture information plays an important role for writer
identification.
The source code will be available:
\url{https://github.com/shengfly/writer-identification}.Comment: To appear: Pattern Recognitio