Full Page Handwriting Recognition via Image to Sequence Extraction

Karayev, Sergey; Singh, Sumeet S.

Full Page Handwriting Recognition via Image to Sequence Extraction

Authors: Sergey Karayev
Sumeet S. Singh
Publication date: 10 March 2021
Publisher

Abstract

We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on an Image to Sequence architecture, it can be trained to extract text present in an image and sequence it correctly without imposing any constraints on language, shape of characters or orientation and layout of text and non-text. The model can also be trained to generate auxiliary markup related to formatting, layout and content. We use character level token vocabulary, thereby supporting proper nouns and terminology of any subject. The model achieves a new state-of-art in full page recognition on the IAM dataset and when evaluated on scans of real world handwritten free form test answers - a dataset beset with curved and slanted lines, drawings, tables, math, chemistry and other symbols - it performs better than all commercially available HTR APIs. It is deployed in production as part of a commercial web application

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2103.06450

Last time updated on 13/03/2021