Search CORE

6,987 research outputs found

Illustrations Segmentation in Digitized Documents Using Local Correlation Features

Author: COPPI DALIA
CUCCHIARA Rita
GRANA Costantino
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

In this paper we propose an approach for Document Layout Analysis based on local correlation features. We identify and extract illustrations in digitized documents by learning the discriminative patterns of textual and pictorial regions. The proposal has been demonstrated to be effective on historical datasets and to outperform the state-of-the-art in presence of challenging documents with a large variety of pictorial elements

Elsevier - Publisher Connector

Crossref

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

A survey of comics research in computer science

Author: Augereau Olivier
Iwata Motoi
Kise Koichi
Publication venue
Publication date: 15/04/2018
Field of study

Graphical novels such as comics and mangas are well known all over the world. The digital transition started to change the way people are reading comics, more and more on smartphones and tablets and less and less on paper. In the recent years, a wide variety of research about comics has been proposed and might change the way comics are created, distributed and read in future years. Early work focuses on low level document image analysis: indeed comic books are complex, they contains text, drawings, balloon, panels, onomatopoeia, etc. Different fields of computer science covered research about user interaction and content generation such as multimedia, artificial intelligence, human-computer interaction, etc. with different sets of values. We propose in this paper to review the previous research about comics in computer science, to state what have been done and to give some insights about the main outlooks

arXiv.org e-Print Archive

Directory of Open Access Journals

You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine

Author: Clérice Thibault
Publication venue
Publication date: 19/07/2022
Field of study

Layout Analysis (the identification of zones and their classification) is the first step along line segmentation in Optical Character Recognition and similar tasks. The ability of identifying main body of text from marginal text or running titles makes the difference between extracting the work full text of a digitized book and noisy outputs. We show that most segmenters focus on pixel classification and that polygonization of this output has not been used as a target for the latest competition on historical document (ICDAR 2017 and onwards), despite being the focus in the early 2010s. We propose to shift, for efficiency, the task from a pixel classification-based polygonization to an object detection using isothetic rectangles. We compare the output of Kraken and YOLOv5 in terms of segmentation and show that the later severely outperforms the first on small datasets (1110 samples and below). We release two datasets for training and evaluation on historical documents as well as a new package, YALTAi, which injects YOLOv5 in the segmentation pipeline of Kraken 4.1

arXiv.org e-Print Archive

Speculative practices : utilizing InfoVis to explore untapped literary collections

Author: Forlini Stefania
Hinrichs Uta
Moynihan Bridget
Publication venue
Publication date: 12/08/2015
Field of study

Funding: Canadian Social Sciences and Humanities Research CouncilIn this paper we exemplify how information visualization supports speculative thinking, hypotheses testing, and preliminary interpretation processes as part of literary research. While InfoVis has become a buzz topic in the digital humanities, skepticism remains about how effectively it integrates into and expands on traditional humanities research approaches. From an InfoVis perspective, we lack case studies that show the specific design challenges that make literary studies and humanities research at large a unique application area for information visualization. We examine these questions through our case study of the Speculative W@nderverse, a visualization tool that was designed to enable the analysis and exploration of an untapped literary collection consisting of thousands of science fiction short stories. We present the results of two empirical studies that involved general-interest readers and literary scholars who used the evolving visualization prototype as part of their research for over a year. Our findings suggest a design space for visualizing literary collections that is defined by (1) their academic and public relevance, (2) the tension between qualitative vs. quantitative methods of interpretation, (3) result- vs. process-driven approaches to InfoVis, and (4) the unique material and visual qualities of cultural collections. Through the Speculative W@nderverse we demonstrate how visualization can bridge these sometimes contradictory perspectives by cultivating curiosity and providing entry points into literary collections while, at the same time, supporting multiple aspects of humanities research processes.PostprintPeer reviewe

CiteSeerX

Crossref

University of St. Andrews - Pure

St Andrews Research Repository

Developing an Image-Based Classifier for Detecting Poetic Content in Historic Newspaper Collections

Author: Datla Maanas Varma
Kulwicki Spencer
Lorang Elizabeth M
Soh Leen-Kiat
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/03/2015
Field of study

Developing an Image-Based Classifier for Detecting Poetic Content in Historic Newspaper Collections details and analyzes the first stage of work of the Image Analysis for Archival Discovery project team. Our team is is investigating the use of image analysis to identify poetic content in historic newspapers. The project seeks both to augment the study of literary history by drawing attention to the magnitude of poetry published in newspapers and by making the poetry more readily available for study, as well as to advance work on the use of digital images in facilitating discovery in digital libraries and other digitized collections. We have recently completed the process of training our classifier for identifying poetic content, and as we prepare to move in to the deployment stage, we are making available our methods for classification and testing in order to promote further research and discussion. The precision and recall values achieved during the training (90.58%; 79.4%) and testing (74.92%; 61.84%) stages are encouraging. In addition to discussing why such an approach is needed and relevant and situating our project alongside related work, this paper analyzes preliminary results, which support the feasibility and viability of our approach to detecting poetic content in historic newspaper collections

DigitalCommons@University of Nebraska

Academic digital libraries of the future : an environment scan

Author: Law Derek
Publication venue: 'Informa UK Limited'
Publication date: 01/04/2009
Field of study

Libraries are attempting to face a future in which almost every fixed point has disappeared. Users are changing; content is changing; research is taking new forms. Indeed the very need for libraries is being questioned in some quarters. This paper explores the nature of the changes and challenges facing higher education libraries and suggests key areas of strength and core activities which should be exploited to secure their future

University of Strathclyde Institutional Repository

Text Line Segmentation of Historical Documents: a Survey

Author: A. Amin
A. Bozzi
A. Downton
A. Jain
A. Kolcz
Abderrazak Zahour
Bruno Taconet
C.L. Tan
C.V. Lakshmi
E. Cohen
E. Oztop
G. Seni
I.-K. Kim
K. Wong
L. Likforman-Sulem
L. Likforman-Sulem
L. Likforman-Sulem
L. O’Gorman
L.A. Fletcher
Laurence Likforman-Sulem
R. Plamondon
R.D. Lins
U. Pal
V. Shapiro
Ventadert Gusnard de de
Y. Solihin
Y.H. Tseng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/04/2007
Field of study

There is a huge amount of historical documents in libraries and in various National Archives that have not been exploited electronically. Although automatic reading of complete pages remains, in most cases, a long-term objective, tasks such as word spotting, text/image alignment, authentication and extraction of specific fields are in use today. For all these tasks, a major step is document segmentation into text lines. Because of the low quality and the complexity of these documents (background noise, artifacts due to aging, interfering lines),automatic text line segmentation remains an open research field. The objective of this paper is to present a survey of existing methods, developed during the last decade, and dedicated to documents of historical interest.Comment: 25 pages, submitted version, To appear in International Journal on Document Analysis and Recognition, On line version available at http://www.springerlink.com/content/k2813176280456k3

arXiv.org e-Print Archive

Crossref

Transfer Learning for OCRopus Model Training on Early Printed Books

Author: Puppe Frank
Reul Christian
Springmann Uwe
Wick Christoph
Publication venue
Publication date: 01/12/2017
Field of study

A method is presented that significantly reduces the character error rates for OCR text obtained from OCRopus models trained on early printed books when only small amounts of diplomatic transcriptions are available. This is achieved by building from already existing models during training instead of starting from scratch. To overcome the discrepancies between the set of characters of the pretrained model and the additional ground truth the OCRopus code is adapted to allow for alphabet expansion or reduction. The character set is now capable of flexibly adding and deleting characters from the pretrained alphabet when an existing model is loaded. For our experiments we use a self-trained mixed model on early Latin prints and the two standard OCRopus models on modern English and German Fraktur texts. The evaluation on seven early printed books showed that training from the Latin mixed model reduces the average amount of errors by 43% and 26%, respectively compared to training from scratch with 60 and 150 lines of ground truth, respectively. Furthermore, it is shown that even building from mixed models trained on data unrelated to the newly added training and test data can lead to significantly improved recognition results

arXiv.org e-Print Archive

Directory of Open Access Journals

Historical Document Digitization through Layout Analysis and Deep Content Classification

Author: BARALDI LORENZO
CORBELLI ANDREA
CUCCHIARA Rita
GRANA Costantino
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Document layout segmentation and recognition is an important task in the creation of digitized documents collections, especially when dealing with historical documents. This paper presents an hybrid approach to layout segmentation as well as a strategy to classify document regions, which is applied to the process of digitization of an historical encyclopedia. Our layout analysis method merges a classic top-down approach and a bottom-up classification process based on local geometrical features, while regions are classified by means of features extracted from a Convolutional Neural Network merged in a Random Forest classifier. Experiments are conducted on the first volume of the ``Enciclopedia Treccani'', a large dataset containing 999 manually annotated pages from the historical Italian encyclopedia

Crossref

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Library as Agent of [Re]Contextualization

Author: Zafrin Vika
Ammerman Jack
Green Garth W.
Publication venue: Maryland Institute for Technology in the Humanities
Publication date: 01/01/1997
Field of study

Paper presented at the Digital Humanities 2009 conference in College Park, Maryland

Boston University Institutional Repository (OpenBU)