Search CORE

289 research outputs found

Combining multiple resources to build reliable wordnets

Author: Fišer , Darja
Sagot Benoît
Publication venue: HAL CCSD
Publication date: 01/01/2008
Field of study

International audienceThis paper compares automatically generated sets of synonyms in French and Slovene wordnets with respect to the resources used in the construction process. Polysemous words were disambiguated via a five-language word-alignment of the SEERA.NET parallel corpus, a subcorpus of the JRC Acquis. The extracted multilingual lexicon was disambiguated with the existing wordnets for these languages. On the other hand, a bilingual approach sufficed to acquire equivalents for monosemous words. Bilingual lexicons were extracted from different resources, including Wikipedia, Wiktionary and EUROVOC thesaurus. A representative sample of the generated synsets was evaluated against the gold-standards

INRIA a CCSD electronic archive server

Combining multiple resources to build reliable wordnets

Author: Fišer , Darja
Sagot Benoît
Publication venue: HAL CCSD
Publication date: 01/01/2008
Field of study

INRIA a CCSD electronic archive server

Hal-Diderot

Right or wrong: combing lexical resources in the EuroWordNet project

Author: Vossen P.J.T.M.
Publication venue: 'Vrije Universiteit Amsterdam Faculty of Law'
Publication date: 01/01/1996
Field of study

VU Research Portal

International Migration, Integration and Social Cohesion online publications

Grouping Synonyms by Definitions

Author: Falk Ingrid
Gardent Claire
Jacquey Evelyne
Venant Fabienne
Publication venue
Publication date: 14/09/2009
Field of study

We present a method for grouping the synonyms of a lemma according to its dictionary senses. The senses are defined by a large machine readable dictionary for French, the TLFi (Tr\'esor de la langue fran\c{c}aise informatis\'e) and the synonyms are given by 5 synonym dictionaries (also for French). To evaluate the proposed method, we manually constructed a gold standard where for each (word, definition) pair and given the set of synonyms defined for that word by the 5 synonym dictionaries, 4 lexicographers specified the set of synonyms they judge adequate. While inter-annotator agreement ranges on that task from 67% to at best 88% depending on the annotator pair and on the synonym dictionary being considered, the automatic procedure we propose scores a precision of 67% and a recall of 71%. The proposed method is compared with related work namely, word sense disambiguation, synonym lexicon acquisition and WordNet construction

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Creating the Open Wordnet Bahasa

Author: Bond Francis
Noor Nurril Hirfana Bte Mohamed
Sapuan Suerya
Publication venue: Institute of Digital Enhancement of Cognitive Processing, Waseda University
Publication date: 01/01/2011
Field of study

Waseda University Repository

Building a free French wordnet from multilingual resources

Author: Fišer Darja
Sagot Benoît
Publication venue: HAL CCSD
Publication date: 31/05/2008
Field of study

International audienceThis paper describes automatic construction a freely-available wordnet for French (WOLF) based on Princeton WordNet (PWN) by using various multilingual resources. Polysemous words were dealt with an approach in which a parallel corpus for five languages was word-aligned and the extracted multilingual lexicon was disambiguated with the existing wordnets for these languages. On the other hand, a bilingual approach sufficed to acquire equivalents for monosemous words. Bilingual lexicons were extracted from Wikipedia and thesauri. The results obtained from each resource were merged and ranked according to the number of resources yielding the same literal. Automatic evaluation of the merged wordnet was performed with the French WordNet (FREWN). Manual evaluation was also carried out on a sample of the generated synsets. Precision shows that the presented approach has proved to be very promising and applications to use the created wordnet are already intended

CiteSeerX

INRIA a CCSD electronic archive server

Hal-Diderot

Experiments on applying relaxation labeling to map multilingual hierarchies

Author: Daude Ventura Jordi
Padró Lluís
Rigau Claramunt German
Publication venue
Publication date: 01/01/1999
Field of study

This paper explores the automatic construction of a multilingual Lexical Knowledge Base from preexisting lexical resources. This paper presents a new approach for linking already existing hierarchies. The Relaxation labeling algorithm is used to select --among all the candidate connections proposed by a bilingual dictionary-- the right conection for each node in the taxonomy.Postprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

The Dutch Wordnet

Author: Bloksma L.
Boersma P.
Vossen P.J.T.M.
Publication venue: Amsterdam: Vrije Universiteit
Publication date: 01/01/1999
Field of study

VU Research Portal