6 research outputs found
Variability of the Facet Values in the VLO -a Case for Metadata Curation
Abstract In this paper we propose a strategy for metadata curation especially with respect to the variability of the values encountered in the metadata records and hence in the facets of the main CLARIN metadata catalogue, the VLO. The approach concentrates on measures on the side of the infrastructure and on the interaction between human curators and the automatic processes
When linguistics meets web technologies. Recent advances in modelling linguistic linked data
This article provides an up-to-date and comprehensive survey of models (including vocabularies, taxonomies and ontologies) used for representing linguistic linked data (LLD). It focuses on the latest developments in the area and both builds upon and complements previous works covering similar territory. The article begins with an overview of recent trends which have had an impact on linked data models and vocabularies, such as the growing influence of the FAIR guidelines, the funding of several major projects in which LLD is a key component, and the increasing importance of the relationship of the digital humanities with LLD. Next, we give an overview of some of the most well known vocabularies and models in LLD. After this we look at some of the latest developments in community standards and initiatives such as OntoLex-Lemon as well as recent work which has been in carried out in corpora and annotation and LLD including a discussion of the LLD metadata vocabularies META-SHARE and lime and language identifiers. In the following part of the paper we look at work which has been realised in a number of recent projects and which has a significant impact on LLD vocabularies and models
Terminologie numérique : conception, représentation et gestion
Cet ouvrage se consacre Ă la notion de terminologie numĂ©rique considĂ©rĂ©e comme une approche de la discipline impliquant la reprĂ©sentation numĂ©rique dâinformations conceptuelles et linguistiques dâun domaine spĂ©cifique. Lâobjectif est lâillustration des Ă©tapes de conception et dâimplĂ©mentation de base de donnĂ©es terminologiques multilingues permettant le respect des meilleures pratiques dans la gestion des donnĂ©es terminologiques du numĂ©rique. Pour ce faire, lâouvrage met en exergue les nouvelles compĂ©tences du terminologue Ă lâĂšre numĂ©rique. Celles-ci trouvent leur vĂ©ritable essence dans lâesprit interdisciplinaire et collaboratif de la recherche
Experiences with the ISOcat Data Category Registry
The ISOcat Data Category Registry has been a joint project of both ISO TC 37 and the European CLARIN infrastructure. In this paper the experiences of using ISOcat in CLARIN are described and evaluated. This evaluation clarifies the requirements of CLARIN with regard to a semantic registry to support its semantic interoperability needs. A simpler model based on concepts instead of data categories and a simpler workflow based on community recommendations will address these needs better and offer the required flexibility
Experiences with the ISOcat Data Category Registry
The ISOcat Data Category Registry has been a joint project of both ISO TC 37 and the European CLARIN infrastructure. In this paper the experiences of using ISOcat in CLARIN are described and evaluated. This evaluation clarifies the requirements of CLARIN with regard to a semantic registry to support its semantic interoperability needs. A simpler model based on concepts instead of data cate-gories and a simpler workflow based on community recommendations will address these needs better and offer the required flexibility.status: publishe