Search CORE

265 research outputs found

Domain-specific named entity disambiguation in historical memoirs

Author: Goy Anna
Nanni Federico
Ponzetto Simone Paolo
Rovera Marco
Publication venue: RWTH
Publication date: 01/01/2017
Field of study

This paper presents the results of the extraction of named entities from a collection of historical memoirs about the italian Resistance during the World War II. The methodology followed for the extraction and disambiguation task will be discussed, as well as its evaluation. For the semantic annotations of the dataset, we have developed a pipeline based on established practices for extracting and disambiguating Named Entities. This has been necessary, considering the poor performances of out-of-the-box Named Entity Recognition and Disambiguation (NERD) tools tested in the initial phase of this work.Questo articolo presenta l’attività di estrazione di entità nominate realizzata su una collezione di memorie relative al periodo della Resistenza italiana nella Seconda Guerra Mondiale. Verrà discussa la metodologia sviluppata per il processo di estrazione e disambiguazione delle entità nominate, nonché la sua valutazione. L’implementazione di una metodologia di estrazione e disambiguazione basata su lookup si è resa necessaria in considerazione delle scarse prestazioni dei sistemi di Named Entity Recognition and Disambiguation (NERD), come si evince dalla discussione nella prima parte di questo lavoro

Crossref

MAnnheim DOCument Server

OpenEdition

Domain-specific Named Entity Disambiguation in Historical Memoirs

Author: Federico Nanni
Goy Annamaria
Rovera Marco
Simone Paolo Ponzetto
Publication venue: CEUR
Publication date: 01/01/2017
Field of study

Institutional Research Information System University of Turin

Event-based Access to Historical Italian War Memoirs

Author: Nanni Federico
Ponzetto Simone Paolo
Rovera Marco
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2021
Field of study

The progressive digitization of historical archives provides new, often domain specific, textual resources that report on facts and events which have happened in the past; among these, memoirs are a very common type of primary source. In this paper, we present an approach for extracting information from Italian historical war memoirs and turning it into structured knowledge. This is based on the semantic notions of events, participants and roles. We evaluate quantitatively each of the key-steps of our approach and provide a graph-based representation of the extracted knowledge, which allows to move between a Close and a Distant Reading of the collection.Comment: 23 pages, 6 figure

arXiv.org e-Print Archive

MAnnheim DOCument Server

Event-based Access to Historical Italian War Memoirs

Author: Rovera M
Publication venue
Publication date: 01/01/2021
Field of study

Institutional Research Information System University of Turin

Recommended from our members

Knowledge Extraction for Art History: the Case of Vasari’s The Lives of The Artists (1568)

Author: Bruns Oleksandra
Posthumus Etienne
Sack Harald
Santini Cristian
Tan Mary Ann
Tietz Tabea
Publication venue: Aachen, Germany : RWTH Aachen
Publication date: 01/01/2022
Field of study

Knowledge Extraction (KE) techniques are used to convert unstructured information present in texts to Knowledge Graphs (KGs) which can be queried and explored. Despite their potential for cultural heritage domains, such as Art History, these techniques often encounter limitations if applied to domain-specific data. In this paper we present the main challenges that KE has to face on art-historical texts, by using as case study Giorgio Vasari's The Lives of The Artists. This paper discusses the following NLP tasks for art-historical texts, namely entity recognition and linking, coreference resolution, time extraction, motif extraction and artwork extraction. Several strategies to annotate art-historical data for these tasks and evaluate NLP models are also proposed

Repositorium für Naturwissenschaften und Technik

Knowledge Extraction for Art History: the Case of Vasari’s The Lives of The Artists (1568)

Author: Bruns Oleksandra
Posthumus Etienne
Sack Harald
Santini Cristian
Tan Mary Ann
Tietz Tabea
Publication venue: CEUR-WS.org
Publication date: 01/01/2022
Field of study

Knowledge Extraction (KE) techniques are used to convert unstructured information present in texts to Knowledge Graphs (KGs) which can be queried and explored. Despite their potential for cultural heritage domains, such as Art History, these techniques often encounter limitations if applied to domain-specific data. In this paper we present the main challenges that KE has to face on art-historical texts, by using as case study Giorgio Vasari’s The Lives of The Artists. This paper discusses the following NLP tasks for art-historical texts, namely entity recognition and linking, coreference resolution, time extraction, motif extraction and artwork extraction. Several strategies to annotate art-historical data for these tasks and evaluate NLP models are also proposed

KITopen

Repositorium für Naturwissenschaften und Technik

Entity-Centric Text Mining for Historical Documents

Author: Coll Ardanuy Maria
Publication venue
Publication date: 07/07/2017
Field of study

Georg-August-University Göttingen

Recommended from our members

Musical Meetups: a Knowledge Graph approach for Historical Social Network Analysis

Author: Carvalho Jason
Daga Enrico
Morales Tirado Alba
Mulholland Paul
Publication venue
Publication date: 28/05/2023
Field of study

The large-scale analysis of historical events data makes it possible to trace key points of cultural and social exchange in history. There has been research focused on facilitating the integration and interpretation of events from heterogeneous sources (such as memoirs, books, and biographies) mainly considering events as a sequence of spatiotemporal objects. However, exploring and discovering new connections (e.g., collaborations, interactions) between people does require characterising those events with dimensions that are relevant to the scholarly enquiry such as the actual participants and nature of the event. This paper describes the concept of historical meetup to represent the encounters (for instance, collaborations, exchanges, links) between personalities of European history and formalise its constituent parts as an ontology. Furthermore, we report on preliminary work undertaken to generate a Knowledge Graph of historical meetups extracted from encyclopedic sources, i.e. biographies collected from Wikipedia. We discuss our results and illustrate the challenges of extracting such type of knowledge from biographical sources. The current experimental setting explores historical meetups in the European musical culture between 1800 and 1945. Our work sketches the basis for applying event knowledge graphs to cultural and social history research, providing support for the analysis, and exchange of ideas and practices

Open Research Online