Search CORE

4 research outputs found

La Diplomatica e il digitale. Il Fondo della Biblioteca della Società Napoletana di Storia Patria online

Author: Antonella Ambrosio
Publication venue
Publication date: 01/01/2020
Field of study

Archivio della ricerca - Università degli studi di Napoli Federico II

Named entity recognition applied on a data base of Medieval Latin charters. The case of chartae burgundiae

Author: Chastang Pierre
Tannier Xavier
Torres Aguilar Sergio
Publication venue: HAL CCSD
Publication date: 01/01/2016
Field of study

International audienceThe work on the named entity recognition (NER) in databases of historical texts has been placed among the most promising new ways to implement best recovery and managements tools for exploring mass data. In this paper, we describe the application processing NER through a modelling with CRF on an annotated database of Burgundy collection of charters from the tenth to thirteenth centuries. The aim is to generate a model for automatic recognition of named entities in historical sources. We discuss the nature of historical documents in the corpus and extraction of rules, and we expose adaptation to the processing algorithm and the most common problems encountered in Medio Latin texts using diplomatic formularies, which is an atypical case within the NER studies

Open Repository and Bibliography - Luxembourg

HAL UVSQ

Named entity recognition applied on a data base of Medieval Latin charters. The case of chartae burgundiae

Author: A Jatowt
J Preiser-Kapeller
M Düring
Publication venue
Publication date: 24/04/2020
Field of study

Abstract The work on the named entity recognition (NER) in databases of historical texts has been placed among the most promising new ways to implement best recovery and managements tools for exploring mass data. In this paper, we describe the application processing NER through a modelling with CRF on an annotated database of Burgundy collection of charters from the tenth to thirteenth centuries. The aim is to generate a model for automatic recognition of named entities in historical sources. We discuss the nature of historical documents in the corpus and extraction of rules, and we expose adaptation to the processing algorithm and the most common problems encountered in Medio Latin texts using diplomatic formularies, which is an atypical case within the NER studies

CiteSeerX

Machine Learning Algorithm for the Scansion of Old Saxon Poetry

Author: Alessandro Torcinovich
Gianluca Lebani
Irene Miani
Marina Buzzoni
Publication venue: place:Siena
Publication date: 01/01/2023
Field of study

Several scholars designed tools to perform the automatic scansion of poetry in many languages, but none of these tools deal with Old Saxon or Old English. This project aims to be a first attempt to create a tool for these languages. We implemented a Bidirectional Long Short-Term Memory (BiLSTM) model to perform the automatic scansion of Old Saxon and Old English poems. Since this model uses supervised learning, we manually annotated the Heliand manuscript, and we used the resulting corpus as labeled dataset to train the model. The evaluation of the performance of the algorithm reached a 97% for the accuracy and a 99% of weighted average for precision, recall and F1 Score. In addition, we tested the model with some verses from the Old Saxon Genesis and some from The Battle of Brunanburh, and we observed that the model predicted almost all Old Saxon metrical patterns correctly misclassified the majority of the Old English input verses

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari