Search CORE

5 research outputs found

Flexible RDF data extraction from Wiktionary - Leveraging the power of community build linguistic wikis

Author: Brekle Jonas
Publication venue
Publication date: 26/02/2018
Field of study

We present a declarative approach implemented in a comprehensive opensource framework (based on DBpedia) to extract lexical-semantic resources (an ontology about language use) from Wiktionary. The data currently includes language, part of speech, senses, definitions, synonyms, taxonomies (hyponyms, hyperonyms, synonyms, antonyms) and translations for each lexical word. Main focus is on flexibility to the loose schema and configurability towards differing language-editions ofWiktionary. This is achieved by a declarative mediator/wrapper approach. The goal is, to allow the addition of languages just by configuration without the need of programming, thus enabling the swift and resource-conserving adaptation of wrappers by domain experts. The extracted data is as fine granular as the source data in Wiktionary and additionally follows the lemon model. It enables use cases like disambiguation or machine translation. By offering a linked data service, we hope to extend DBpedia’s central role in the LOD infrastructure to the world of Open Linguistics.

Qucosa - Publikationsserver der Universität Leipzig

Flexible RDF data extraction from Wiktionary - Leveraging the power of community build linguistic wikis

Author: Brekle Jonas
Publication venue
Publication date: 26/02/2018
Field of study

Qucosa

HSSS - Hochschulschriftenserver der SLUB

Qucosa - Publikationsserver der Universität Leipzig

The Working Group for Open Data in Linguistics

Author: Brekle Jonas
Chiarcos Christian
Cimiano Philipp
Eckle-Kohler Judith
Gurevych Iryna
Hartmann Silvana
Hellmann Sebastian
Littauer Richard
Matuschek Michael
McCrae John
Meyer Christian M.
Nordhoff Sebastian
Publication venue
Publication date: 01/03/2012
Field of study

TUbiblio