Search CORE

3,700 research outputs found

An Anthropocentric Approach to Text Extraction from WWW Images

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue
Publication date: 01/01/2000
Field of study

There is a significant need to analyse the text in images on WWW pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper argues that the extraction of text from such images benefits from an anthropocentric approach in the distinction between colour regions. The novelty of the idea is the use of a human perspective of colour perception in preference to RGB colour space analysis. This enables the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background). More precisely, characters are extracted as distinct regions with separate chromaticity and/or luminance by performing a layer decomposition of the image. The method described here is the first in our systematic approach to approximate the human colour perception characteristics for the identification of character regions. In this instance, the image is decomposed by performing histogram analysis of Hue and Luminance and merging in the HLS colour space

Southampton (e-Prints Soton)

Visual Representation of Text in Web Documents and Its Interpretation

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue: 'Elsevier BV'
Publication date: 01/01/2004
Field of study

This paper examines the uses of text and its representation on Web documents in terms of the challenges in its interpretation. Particular attention is paid to the significant problem of non-uniform representation of text. This non-uniformity is mainly due to the presence of semantically important text in image form as opposed to the standard encoded text. The issues surrounding text representation in Web documents are discussed in the context of colour perception and spatial representation. The characteristics of the representation of text in image form are examined and research towards interpreting these images of text is briefly described

Crossref

Southampton (e-Prints Soton)

University of Salford Institutional Repository

A fine-grained approach to scene text script identification

Author: Gomez Lluis
Karatzas Dimosthenis
Publication venue
Publication date: 24/02/2016
Field of study

This paper focuses on the problem of script identification in unconstrained scenarios. Script identification is an important prerequisite to recognition, and an indispensable condition for automatic text understanding systems designed for multi-language environments. Although widely studied for document images and handwritten documents, it remains an almost unexplored territory for scene text images. We detail a novel method for script identification in natural images that combines convolutional features and the Naive-Bayes Nearest Neighbor classifier. The proposed framework efficiently exploits the discriminative power of small stroke-parts, in a fine-grained classification framework. In addition, we propose a new public benchmark dataset for the evaluation of joint text detection and script identification in natural scenes. Experiments done in this new dataset demonstrate that the proposed method yields state of the art results, while it generalizes well to different datasets and variable number of scripts. The evidence provided shows that multi-lingual scene text recognition in the wild is a viable proposition. Source code of the proposed method is made available online

arXiv.org e-Print Archive

Crossref

Object Proposals for Text Extraction in the Wild

Author: Gomez Lluis
Karatzas Dimosthenis
Publication venue
Publication date: 08/09/2015
Field of study

Object Proposals is a recent computer vision technique receiving increasing interest from the research community. Its main objective is to generate a relatively small set of bounding box proposals that are most likely to contain objects of interest. The use of Object Proposals techniques in the scene text understanding field is innovative. Motivated by the success of powerful while expensive techniques to recognize words in a holistic way, Object Proposals techniques emerge as an alternative to the traditional text detectors. In this paper we study to what extent the existing generic Object Proposals methods may be useful for scene text understanding. Also, we propose a new Object Proposals algorithm that is specifically designed for text and compare it with other generic methods in the state of the art. Experiments show that our proposal is superior in its ability of producing good quality word proposals in an efficient way. The source code of our method is made publicly available.Comment: 13th International Conference on Document Analysis and Recognition (ICDAR 2015

arXiv.org e-Print Archive

Crossref

Text Extraction from Web Images Based on A Split-and-Merge Segmentation Method Using Color Perception

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue
Publication date: 01/01/2004
Field of study

This paper describes a complete approach to the segmentation and extraction of text from Web images for subsequent recognition, to ultimately achieve both effective indexing and presentation by non-visual means (e.g., audio). The method described here (the first in the authors’ systematic approach to exploit human colour perception) enables the extraction of text in complex situations such as in the presence of varying colour (characters and background). More precisely, in addition to using structural features, the segmentation follows a split-and-merge strategy based on the Hue-Lightness- Saturation (HLS) representation of colour as a first approximation of an anthropocentric expression of the differences in chromaticity and lightness. Character-like components are then extracted as forming textlines in a number of orientations and along curves

Southampton (e-Prints Soton)

Visual Representation of Text in Web Documents and Its Interpretation

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue: 'Elsevier BV'
Publication date: 01/01/2004
Field of study

Southampton (e-Prints Soton)

Semantics-Based Content Extraction in Typewritten Historical Documents

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue
Publication date: 01/01/2005
Field of study

This paper presents a flexible approach to extracting content from scanned historical documents using semantic information. The final electronic document is the result of a "digital historical document lifecycle" process, where the expert knowledge of the historian/archivist user is incorporated at different stages. Results show that such a conversion strategy aided by (expert) user-specified semantic information and which enables the processing of individual parts of the document in a specialised way, produces superior (in a variety of significant ways) results than document analysis and understanding techniques devised for contemporary documents

Southampton (e-Prints Soton)

A Display Calibration Technique based on Invariant Human Colour Mechanisms

Author: Karatzas Dimosthenis
Meyer Georg
Wuerger Sophie
Publication venue
Publication date: 01/01/2005
Field of study

When human observers are asked to adjust a coloured light such that it appears neither red nor green, or such that it appears neither yellow nor blue, most colour-normal observers have no difficulty in making these adjustments. These hue judgements are not significantly influenced by language or age [Saunders and van Brakel 1997] and individual differences in colour sensitivity are not reflected in the unique-hue settings [Webster et al. 2000]. The human colour system seems to be able to calibrate itself so that there is a remarkable agreement across observers in relation to these unique-hue judgements. Here we show how we can use the invariance of these unique-hue judgements to develop a colour calibration technique for display devices, which eliminates the need for an external calibration standard or a measurement device

Southampton (e-Prints Soton)