162 research outputs found
La família humana: Perspectives multidisciplinàries de la investigació en Ciències Humanes i Socials
Recull part de les ponències presentades en les XXVII Jornades de
Foment de la Investigació en Ciències Humanes i Socials, celebrades en Castelló el 6 de
maig de 2022 organitzades per la Universitat Jaume
Recent developments for the linguistic linked open data infrastructure
In this paper we describe the contributions made by the European H2020 project “Pret-a-LLOD” (‘Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors’) to the further development of the Linguistic Linked Open Data (LLOD) infrastructure. Pret-a-LLOD aims to develop a new methodology for building data value chains applicable to a wide range of sectors and applications and based around language resources and language technologies that can be integrated by means of semantic technologies. We describe the methods implemented for increasing the number of language data sets in the LLOD. We also present the approach for ensuring interoperability and for porting LLOD data sets and services to other infrastructures, as well as the contribution of the projects to existing standards
The ACoLi dictionary graph
In this paper, we report the release of the ACoLi Dictionary Graph, a large-scale collection of multilingual open source dictionaries available in two machine-readable formats, a graph representation in RDF, using the OntoLex-Lemon vocabulary, and a simple tabular data format to facilitate their use in NLP tasks, such as translation inference across dictionaries. We describe the mapping and harmonization of the underlying data structures into a unified representation, its serialization in RDF and TSV, and the release of a massive and coherent amount of lexical data under open licenses
Historical lexicography of Old French and linked open data: transforming the resources of the Dictionnaire étymologique de l'ancien français with OntoLex-Lemon
The adaptation of novel techniques and standards in computational lexicography is taking place at an accelerating pace, as manifested by
recent extensions beyond the traditional XML-based paradigm of electronic publication. One important area of activity in this regard is the transformation of lexicographic resources into (Linguistic) Linked Open Data ([L]LOD), and the application of the OntoLex-Lemon
vocabulary to electronic editions of dictionaries. At the moment, however, these activities focus on machine-readable dictionaries,
natural language processing and modern languages and found only limited resonance in philology in general and in historical language
stages in particular. This paper presents an endeavor to transform the resources of a comprehensive dictionary of Old French into LOD
using OntoLex-Lemon and it sketches the difficulties of modeling particular aspects that are due to the medieval stage of the language
Inducing discourse marker inventories from lexical knowledge graphs
Discourse marker inventories are important tools for the development of both discourse parsers and corpora with discourse annotations. In this paper we explore the potential of massively multilingual lexical knowledge graphs to induce multilingual discourse marker lexicons using concept propagation methods as previously developed in the context of translation inference across dictionaries. Given one or multiple source languages with discourse marker inventories that discourse relations as senses of potential discourse markers, as well as a large number of bilingual dictionaries that link them – directly or indirectly – with the target language, we specifically study to what extent discourse marker induction can benefit from the integration of information from different sources, the impact of sense granularity and what limiting factors may need to be considered. Our study uses discourse marker inventories from nine European languages normalized against the discourse relation inventory of the Penn Discourse Treebank (PDTB), as well as three collections of machine-readable dictionaries with different characteristics, so that the interplay of a large number of factors can be studied
- …