Search CORE

8,763 research outputs found

Scatteract: Automated extraction of data from scatter plots

Author: JD Hunter
L Yang
MA Fischler
N Siegel
W Huang
WR Shadish
X Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/04/2017
Field of study

Charts are an excellent way to convey patterns and trends in data, but they do not facilitate further modeling of the data or close inspection of individual data points. We present a fully automated system for extracting the numerical values of data points from images of scatter plots. We use deep learning techniques to identify the key components of the chart, and optical character recognition together with robust regression to map from pixels to the coordinate system of the chart. We focus on scatter plots with linear scales, which already have several interesting challenges. Previous work has done fully automatic extraction for other types of charts, but to our knowledge this is the first approach that is fully automatic for scatter plots. Our method performs well, achieving successful data extraction on 89% of the plots in our test set.Comment: Submitted to ECML PKDD 2017 proceedings, 16 page

arXiv.org e-Print Archive

Crossref

Linking Text and Image with SVG

Author: Cayless Hugh A.
Publication venue: 'Antibodypedia'
Publication date: 01/01/2009
Field of study

Annotation and linking (or referring) have been described as "scholarly primitives", basic methods used in scholarly research and publication of all kinds. The online publication of manuscript images is one basic use case where the need for linking and annotation is very clear. High resolution images are of great use to scholars and transcriptions of texts provide for search and browsing, so the ideal method for the digital publication of manuscript works is the presentation of page images plus a transcription of the text therein. This has become a standard method, but leaves open the questions of how deeply the linkages can be done and how best to handle the annotation of sections of the image. This paper presents a new method (named img2xml) for connecting text and image using an XML-based tracing of the text on the page image. The tracing method was developed as part of a series of experiments in text and image linking beginning in the summer of 2008 and will continue under a grant funded by the National Endowment for the Humanities. It employs Scalable Vector Graphics (SVG) to represent the text in an image of a manuscript page in a referenceable form and enables linking and annotation of the page image in a variety of ways. The paper goes on to discuss the scholarly requirements for tools that will be developed around the tracing method, and explores some of the issues raised by the img2xml method

Kölner UniversitätsPublikationsServer

Visual Representation of Text in Web Documents and Its Interpretation

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue: 'Elsevier BV'
Publication date: 01/01/2004
Field of study

This paper examines the uses of text and its representation on Web documents in terms of the challenges in its interpretation. Particular attention is paid to the significant problem of non-uniform representation of text. This non-uniformity is mainly due to the presence of semantically important text in image form as opposed to the standard encoded text. The issues surrounding text representation in Web documents are discussed in the context of colour perception and spatial representation. The characteristics of the representation of text in image form are examined and research towards interpreting these images of text is briefly described

Southampton (e-Prints Soton)

Content-Based Video Retrieval in Historical Collections of the German Broadcasting Archive

Author: B Zhou
D Albertson
G Marchionini
J Matas
M Mühling
PN Belhumeur
R Salakhutdinov
T Ahonen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/02/2017
Field of study

The German Broadcasting Archive (DRA) maintains the cultural heritage of radio and television broadcasts of the former German Democratic Republic (GDR). The uniqueness and importance of the video material stimulates a large scientific interest in the video content. In this paper, we present an automatic video analysis and retrieval system for searching in historical collections of GDR television recordings. It consists of video analysis algorithms for shot boundary detection, concept classification, person recognition, text recognition and similarity search. The performance of the system is evaluated from a technical and an archival perspective on 2,500 hours of GDR television recordings.Comment: TPDL 2016, Hannover, Germany. Final version is available at Springer via DO

arXiv.org e-Print Archive

Crossref

Visual Representation of Text in Web Documents and Its Interpretation

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue: 'Elsevier BV'
Publication date: 01/01/2004
Field of study

Southampton (e-Prints Soton)

University of Salford Institutional Repository

Interactive Machine Learning (IML) Markup of OCR Generated Text by Exploiting Domain Knowledge: A Biodiversity Case Study

Author: Heidorn P. Bryan
Wei Qin
Publication venue
Publication date: 28/02/2008
Field of study

Illinois Digital Environment for Access to Learning and Scholarship Repository

A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception

Author: Antonacopoulos Apostolos
Karatzas Dimosthenis
Publication venue: World Scientific Publishing Company
Publication date: 01/12/2003
Field of study

This chapter describes a new approach for the segmentation of text in images on Web pages. In the same spirit as the authors’ previous work on this subject, this approach attempts to model the ability of humans to differentiate between colours. In this case, pixels of similar colour are first grouped using a colour distance defined in a perceptually uniform colour space (as opposed to the commonly used RGB). The resulting colour connected components are then grouped to form larger (character-like) regions with the aid of a propinquity measure, which is the output of a fuzzy inference system. This measure expresses the likelihood for merging two components based on two features. The first feature is the colour distance between the components, in the L*a*b* colour space. The second feature expresses the topological relationship of two components. The results of the method indicate a better performance than previous methods devised by the authors and possibly better (a direct comparison is not really possible due to the differences in application domain characteristics between this and previous methods) performance to other existing methods

Southampton (e-Prints Soton)