Search CORE

4,804 research outputs found

On-the-fly Historical Handwritten Text Annotation

Author: Hast Anders
Vats Ekta
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/09/2017
Field of study

The performance of information retrieval algorithms depends upon the availability of ground truth labels annotated by experts. This is an important prerequisite, and difficulties arise when the annotated ground truth labels are incorrect or incomplete due to high levels of degradation. To address this problem, this paper presents a simple method to perform on-the-fly annotation of degraded historical handwritten text in ancient manuscripts. The proposed method aims at quick generation of ground truth and correction of inaccurate annotations such that the bounding box perfectly encapsulates the word, and contains no added noise from the background or surroundings. This method will potentially be of help to historians and researchers in generating and correcting word labels in a document dynamically. The effectiveness of the annotation method is empirically evaluated on an archival manuscript collection from well-known publicly available datasets

arXiv.org e-Print Archive

Crossref

Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and Bottom-up approaches

Author: Enser Peter G.B.
Hare Jonathon S.
Lewis Paul H.
Martinez Kirk
Sandom Christine J.
Sinclair Patrick A. S.
Publication venue
Publication date: 01/01/2006
Field of study

Semantic representation of multimedia information is vital for enabling the kind of multimedia search capabilities that professional searchers require. Manual annotation is often not possible because of the shear scale of the multimedia information that needs indexing. This paper explores the ways in which we are using both top-down, ontologically driven approaches and bottom-up, automatic-annotation approaches to provide retrieval facilities to users. We also discuss many of the current techniques that we are investigating to combine these top-down and bottom-up approaches

CiteSeerX

Southampton (e-Prints Soton)

Markerless Motion Capture in the Crowd

Author: Bregler Christoph
Huston Thomas
Spiro Ian
Publication venue
Publication date: 01/01/2012
Field of study

This work uses crowdsourcing to obtain motion capture data from video recordings. The data is obtained by information workers who click repeatedly to indicate body configurations in the frames of a video, resulting in a model of 2D structure over time. We discuss techniques to optimize the tracking task and strategies for maximizing accuracy and efficiency. We show visualizations of a variety of motions captured with our pipeline then apply reconstruction techniques to derive 3D structure.Comment: Presented at Collective Intelligence conference, 2012 (arXiv:1204.2991

arXiv.org e-Print Archive

CiteSeerX

READ-BAD: A New Dataset and Evaluation Scheme for Baseline Detection in Archival Documents

Author: Diem Markus
Fiel Stefan
Grüning Tobias
Kleber Florian
Labahn Roger
Publication venue
Publication date: 11/12/2017
Field of study

Text line detection is crucial for any application associated with Automatic Text Recognition or Keyword Spotting. Modern algorithms perform good on well-established datasets since they either comprise clean data or simple/homogeneous page layouts. We have collected and annotated 2036 archival document images from different locations and time periods. The dataset contains varying page layouts and degradations that challenge text line segmentation methods. Well established text line segmentation evaluation schemes such as the Detection Rate or Recognition Accuracy demand for binarized data that is annotated on a pixel level. Producing ground truth by these means is laborious and not needed to determine a method's quality. In this paper we propose a new evaluation scheme that is based on baselines. The proposed scheme has no need for binarization and it can handle skewed as well as rotated text lines. The ICDAR 2017 Competition on Baseline Detection and the ICDAR 2017 Competition on Layout Analysis for Challenging Medieval Manuscripts used this evaluation scheme. Finally, we present results achieved by a recently published text line detection algorithm.Comment: Submitted to DAS201

arXiv.org e-Print Archive

Crossref

An examination of automatic video retrieval technology on access to the contents of an historical video archive

Author: Auld Dan
Petrelli Daniela
Publication venue: 'Emerald'
Publication date: 01/01/2008
Field of study

Purpose – This paper aims to provide an initial understanding of the constraints that historical video collections pose to video retrieval technology and the potential that online access offers to both archive and users. Design/methodology/approach – A small and unique collection of videos on customs and folklore was used as a case study. Multiple methods were employed to investigate the effectiveness of technology and the modality of user access. Automatic keyframe extraction was tested on the visual content while the audio stream was used for automatic classification of speech and music clips. The user access (search vs browse) was assessed in a controlled user evaluation. A focus group and a survey provided insight on the actual use of the analogue archive. The results of these multiple studies were then compared and integrated (triangulation). Findings – The amateur material challenged automatic techniques for video and audio indexing, thus suggesting that the technology must be tested against the material before deciding on a digitisation strategy. Two user interaction modalities, browsing vs searching, were tested in a user evaluation. Results show users preferred searching, but browsing becomes essential when the search engine fails in matching query and indexed words. Browsing was also valued for serendipitous discovery; however the organisation of the archive was judged cryptic and therefore of limited use. This indicates that the categorisation of an online archive should be thought of in terms of users who might not understand the current classification. The focus group and the survey showed clearly the advantage of online access even when the quality of the video surrogate is poor. The evidence gathered suggests that the creation of a digital version of a video archive requires a rethinking of the collection in terms of the new medium: a new archive should be specially designed to exploit the potential that the digital medium offers. Similarly, users' needs have to be considered before designing the digital library interface, as needs are likely to be different from those imagined. Originality/value – This paper is the first attempt to understand the advantages offered and limitations held by video retrieval technology for small video archives like those often found in special collections

Crossref

Sheffield Hallam University Research Archive

White Rose Research Online

From Handwritten Manuscripts to Linked Data

Author: Plaat Aske
Stork Lise
van den Herik Jaap
Verbeek Fons
Weber Andreas
Wolstencroft Katherine
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Computer Systems, Imagery and Medi

Crossref

Leiden University Scholary Publications

University of Twente Research Information

Rethinking authenticity in digital art preservation

Author: Innocenti P.
Publication venue
Publication date: 01/01/2012
Field of study

In this paper I am discussing the repositioning of traditional conservation concepts of historicity, authenticity and versioning in relation to born digital artworks, upon findings from my research on preservation of computer-based artifacts. Challenges for digital art preservation and previous work in this area are described, followed by an analysis of digital art as a process of components interaction, as performance and in terms of instantiations. The concept of dynamic authenticity is proposed, and it is argued that our approach to digital artworks preservation should be variable and digital object responsive, with a level of variability tolerance to match digital art intrinsic variability and dynamic authenticity

Enlighten