3 research outputs found

    A Tool to Explore the Population of a CIDOC-CRM Ontology

    Get PDF
    This paper presents a visualising tool to explore the population of an Ontology, obtained through the processes of automatic migration and text information extraction. It was developed in the context of EPISA project, a R&D project that aims to represent the Portuguese National Archives records information in CIDOC-CRM, an ontology developed for museums. The tool allows the migration process developers to visualise the instances and their properties, and to debug the migration process and the migration representation model, or to explore the Archives by final users. It uses modeling and reasoners OWL-API with SPARQL-DL queries to obtain the exploration results

    A strategy for archives metadata representation on CIDOC-CRM and knowledge discovery

    Get PDF
    This paper presents a strategy for the semantic migration of Portuguese National Archives records into CIDOC-CRM standard, an ontology developed for museums, within the context of the EPISA project. The approach to automatically populate the CIDOC-CRM is based on Mapping Description Rules to semantically translate the archives descriptive information into CIDOC-CRM representation. The compliance of the CIDOC-CRM model recommendations guarantees that the populated CIDOC-CRM ontology of archives descriptive information verifies interoperability, and could be linked and integrated with other populated CIDOC-CRM ontologies. In the information modelling, requirements on the mapping representation, due to the intent of interpreting natural language text to automatically extract information of metadata text fields and to interpret natural language queries, are taken into account. To automatically interpret the Mapping Description Rules, OWL API was used to obtain the set of assertions that represents the information in the target ontology and two datasets are available with some migration examples. The exploration of the knowledge representation is done through some Description Logic queries to highlight the advantages of having this new representation of the National Archives. The evaluation of the resulting representation can be done automatically proving its correctness for the metadata that has a direct representation in CIDOC-CRM

    EPISA Project: Semantic Migration from DigitArq to CIDOC CRM

    No full text
    This thesis presents a strategy for the semantic migration of Portuguese National Archives records representation from the ISAD(G) standard into CIDOC-CRM standard , and the strategy to extract valuable information from these records. These two research activities were developed within the context of the EPISA project, a part of the ongoing renewal of Direção-Geral do Livro, dos Arquivos e das Bibliotecas (DGLAB) existing data infrastructure. The semantic migration was performed using the Migration Mapping Rules, a set of rules used to semantically translate the archives' descriptive information into CIDOC-CRM representation. The implementation of these rules is done with OWL API, a Java library that allows generating and manipulating ontologies. The extraction of valuable information was performed with the application of Natural Language Processing (NLP) techniques, like Named Entity Recognition (NER), and a set of pattern matching rules implemented in JAPE. The process is managed by GATE, a framework and graphical development environment for NLP tools. The analysis is performed in a different way depending on the type of record, selected at the beginning of the process with a multiclass classification, and implemented as a Decision Tree. The resultant Knowledge Base can be explored with Query Ontology Interface, an application developed with Spring Application that allows the domain expert users from DGLAB to evaluate the results of the migration and extraction process
    corecore