Search CORE

535 research outputs found

A Lime-Flavored REST API for Alignment Services

Author: Armando Stellato
Manuel Fiorelli
Publication venue: European Language Resources Association
Publication date: 01/05/2020
Field of study

A practical alignment service should be flexible enough to handle the varied alignment scenarios that arise in the real world, while minimizing the need for manual configuration. MAPLE, an orchestration framework for ontology alignment, supports this goal by coordinating a few loosely coupled actors, which communicate and cooperate to solve a matching task using explicit metadata about the input ontologies, other available resources and the task itself. The alignment task is thus summarized by a report listing its characteristics and suggesting alignment strategies. The schema of the report is based on several metadata vocabularies, among which the Lime module of the OntoLex-Lemon model is particularly important, summarizing the lexical content of the input ontologies and describing external language resources that may be exploited for performing the alignment. In this paper, we propose a REST API that enables the participation of downstream alignment services in the process orchestrated by MAPLE, helping them self-adapt in order to handle heterogeneous alignment tasks and scenarios. The realization of this alignment orchestration effort has been performed through two main phases: we first described its API as an OpenAPI specification (a la API-first), which we then exploited to generate server stubs and compliant client libraries. Finally, we switched our focus to the integration of existing alignment systems, with one fully integrated system and an additional one being worked on, in the effort to propose the API as a valuable addendum to any system being developed

Marvin: Semantic annotation using multiple knowledge sources

Author: Milosevic Nikola
Publication venue
Publication date: 01/02/2016
Field of study

People are producing more written material then anytime in the history. The increase is so high that professionals from the various fields are no more able to cope with this amount of publications. Text mining tools can offer tools to help them and one of the tools that can aid information retrieval and information extraction is semantic text annotation. In this report we present Marvin, a text annotator written in Java, which can be used as a command line tool and as a Java library. Marvin is able to annotate text using multiple sources, including WordNet, MetaMap, DBPedia and thesauri represented as SKOS.Comment: 9 pages, 4 figures, keywords: Semantic annotation, text normalization, semantic web, linked data, information management, text mining, information extraction, data curatio

arXiv.org e-Print Archive

RNeXML: a package for reading and writing richly annotated phylogenetic, character, and trait data in R

Author: Boettiger Carl
Chamberlain Scott
Lapp Hilmar
Vos Rutger
Publication venue
Publication date: 08/06/2015
Field of study

NeXML is a powerful and extensible exchange standard recently proposed to better meet the expanding needs for phylogenetic data and metadata sharing. Here we present the RNeXML package, which provides users of the R programming language with easy-to-use tools for reading and writing NeXML documents, including rich metadata, in a way that interfaces seamlessly with the extensive library of phylogenetic tools already available in the R ecosystem

arXiv.org e-Print Archive

eScholarship - University of California

HILT : High-Level Thesaurus Project. Phase IV and Embedding Project Extension : Final Report

Author: Joseph Anu
McCulloch Emma
Nicholson Dennis
Publication venue: University of Strathclyde
Publication date: 01/01/2009
Field of study

Ensuring that Higher Education (HE) and Further Education (FE) users of the JISC IE can find appropriate learning, research and information resources by subject search and browse in an environment where most national and institutional service providers - usually for very good local reasons - use different subject schemes to describe their resources is a major challenge facing the JISC domain (and, indeed, other domains beyond JISC). Encouraging the use of standard terminologies in some services (institutional repositories, for example) is a related challenge. Under the auspices of the HILT project, JISC has been investigating mechanisms to assist the community with this problem through a JISC Shared Infrastructure Service that would help optimise the value obtained from expenditure on content and services by facilitating subject-search-based resource sharing to benefit users in the learning and research communities. The project has been through a number of phases, with work from earlier phases reported, both in published work elsewhere, and in project reports (see the project website: http://hilt.cdlr.strath.ac.uk/). HILT Phase IV had two elements - the core project, whose focus was 'to research, investigate and develop pilot solutions for problems pertaining to cross-searching multi-subject scheme information environments, as well as providing a variety of other terminological searching aids', and a short extension to encompass the pilot embedding of routines to interact with HILT M2M services in the user interfaces of various information services serving the JISC community. Both elements contributed to the developments summarised in this report

Outside The Box: Building a Digital Asset Management Ecosystem for Preservation and Access

Author: Krewer Drew
Richardson Matthew
Scott Bethany
Washington Anne
Watkins Sean
Weidner Andrew
Publication venue
Publication date: 01/04/2017
Field of study

The University of Houston (UH) Libraries made an institutional commitment in late 2015 to migrate the data for its digitized cultural heritage collections to open source systems for preservation and access: Hydra-in-a-Box, Archivematica, and ArchivesSpace. This article describes the work that the UH Libraries implementation team has completed to date, including open source tools for streamlining digital curation workflows, minting and resolving identifiers, and managing SKOS vocabularies. These systems, workflows, and tools, collectively known as the Bayou City Digital Asset Management System (BCDAMS), represent a novel effort to solve common issues in the digital curation lifecycle and may serve as a model for other institutions seeking to implement flexible and comprehensive systems for digital preservation and access.Librarie

Directory of Open Access Journals

Developing a RDF4J frontend

Author: Oliveira Francisco José Moreira
Publication venue
Publication date: 06/04/2021
Field of study

Dissertação de mestrado integrado em Engenharia InformáticaA few years ago, data was not shared and kept isolated, preventing communication between datasets. Currently, we have more significant data volumes, and in a world where everything is connected, our data is now also following this trend. Data model focus changed from a square structure like the relational model to a model centered on the relations. Knowledge graphs are the new paradigm to represent and manage this new kind of information structure. Along with the new paradigm, graph databases emerged to support the new requirements. Despite the increasing interest in the field, only a few native solutions are available. Most are under a commercial license, and the open-source options have very basic or outdated interfaces, and because of that, they are a little distant for most end-users. In this thesis, we introduce the Open Web Ontobud and discuss its design and develop ment. Ontobud is a Web Application aimed at improving the interface for one of the most fascinating and influential frameworks in this area: RDF4J. RDF4J is a Java framework to deal with RDF triple storage, management, and query. Open Web Ontobud is an open-source RDF4J web frontend created to reduce the gap between end-users and the RDF4J backend. We created a web interface that enables users with a basic knowledge of OWL and SPARQL to explore ontologies via resource tables or graphs and extract information from them with SPARQL queries. The interface aims to remain intuitive, providing tooltips and help when needed, as well as some statistical data in a readily available form. Despite the frontend being the main focus, a backend and two databases are also used for a total of four components in the framework. For the best deployment experience, Docker was used for its simplicity, allowing deployment in just a few commands. Each component has a dedicated image, following a modular design and allowing them to be executed on separate machines if desired.No passado, dados não era partilhada e permanecia isolada, impedindo comunicação entre datasets. Atualmente, temos maiores volumes de dados e num mundo onde tudo está interligado, os nossos dados também seguem essa tendência. O foco de modelo de dados alterou de uma estrutura quadrada, como o modelo relacional, para um modelo centrado em relações. Grafos de Conhecimento são o novo paradigma para a representação e manipulação desta nova estrutura de dados. Com o novo paradigma, bases de dados de grafos emergiram para suportar as novas necessidades. Apesar do aumento de interesse neste campo, apenas algumas soluções nativas estão disponíveis. A maioria requere uma licença comercial, e as opções open-source são interfaces básicas ou desatualizadas, e por consequência, distantes a muitos utilizadores. Nesta tese introduzimos o Open Web Ontobud e discutimos o seu design e desenvolvi mento. O Ontobud é uma aplicação Web direcionada ao melhoramento da interface de uma das mais fascinantes e influentes frameworks nesta área: o RDF4J. O RDF4J é uma framework em Java para guardar, manipular e inquirir grafos RDF. Open Web Ontobud é um open-source web frontend para o RDF4J criado para diminuir a separação entre os utilizadores e o RDF4J backend. Nós criamos uma interface web que permite utilizadores com conhecimento básico de OWL e SPARQL explorar ontologias através de tabelas de recursos ou grafos, e inquirir informação com queries SPARQL. O objetivo da interface é ser intuitiva, com tooltips e ajuda quando necessário, bem como alguma informação de estatísticas numa forma facilmente acessível. Apesar do frontend ser o foco principal, o backend e duas bases de dados também são utilizadas, para um total de quatro componentes nesta framework. Para a melhor experiência de inicialização utilizamos Docker pela sua simplicidade, permitindo inicialização em poucos comandos. Cada componente tem uma imagem dedicada, seguindo um design modular e permitindo cada componente ser executada em máquinas separadas se necessário

Adaptation of NLP Techniques to Cultural Heritage Research and Documentation

Author: Guenther Goerz
Martin Scholz
Publication venue: 'University of Zagreb - University Computing Centre'
Publication date: 01/01/2010
Field of study

The WissKI system provides a framework for ontology based science communication and cultural heritage documentation. In many cases, the documentation consists of semi-structured data records with free text fields. Most references in the texts comprise of person and place names, as well as time specifications. We present the WissKI tools for semantic annotation using controlled vocabularies and formal ontologies derived from CIDOC Conceptual Reference Model (CRM). Current research deals with the annotations as building blocks for event recognition. Finally, we outline how the CRM helps to build bridges between documentation in different scientific disciplines

Spacialist – A Virtual Research Environment for the Spatial Humanities

Author: Derntl Michael
Glissmann Benjamin
Kirschenheuter David
Lang Matthias
Rosenkranz Vinzenz
Seidensticker Dirk
Publication venue: Universität Tübingen
Publication date: 01/01/2020
Field of study

Many archaeological research projects generate data and tools that are unusable or abandoned after the funding period ends. To counter this unsustainable practice, the Spacialist project was tasked to create a virtual research environment that offers an integrated, web-based user interface to record, browse, analyze, and visualize all spatial, graphical, textual and statistical data from archaeological or cultural heritage research projects. Spacialist is developed as an open-source software platform composed of modules providing the required functionality to end-users. It builds on controlled multi-language vocabularies and an abstract, extensible data model to facilitate data recording and analysis, as well as interoperability with other projects and infrastructures. Development of Spacialist is driven by an interdisciplinary team in collaboration with various pilot projects in different areas of archaeology. To support the complete research lifecycle, the platform is being integrated with the University’s research-data archive, guaranteeing long-term availability of project data

Publikationsserver der Universität Tübingen

Connecting medical educational resources to the Linked Data cloud: the mEducator RDF Schema, store and API

Author: Bamidis Panagiotis
Bratsas Charalampos
Dietze Stefan
Evangelia Mitsopoulou
Giordano Daniela
Taibi Davide
Woodham Luke
Yu Hong Qing
Publication venue
Publication date: 01/01/2011
Field of study

The existence of non-interoperable metadata schemas and limited use of shared vocabularies means that retrieving and processing educational resources across the Web represents a challenge. The emerging Linked Data paradigm has provided the tools and methods to share and expose metadata in a more unified and well interlinked manner, permitting both humans and machines to process Web data. The availability of vast amounts of RDF-based Linked Data is offering a worthwhile alternative to the isolated and heterogeneous data silos which previously dominated the Web. In the ECfunded project mEducator a standardised approach is proposed to describing and exposing medical educational resources. In this paper are described firstly the design considerations, and conceptual model upon which mEducator’s metadata scheme approach was based. Afterwards, follows the description of the serialization of the scheme in RDF/XML, and in turn follows an example showing how medical educational resources are exposed on the Web using an RDF endpoint, and discuss the potential advantages of this approach