Search CORE

22,082 research outputs found

Towards a Universal Wordnet by Learning from Combined Evidenc

Author: de Melo G.
Weikum G.
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/2009
Field of study

Lexical databases are invaluable sources of knowledge about words and their meanings, with numerous applications in areas like NLP, IR, and AI. We propose a methodology for the automatic construction of a large-scale multilingual lexical database where words of many languages are hierarchically organized in terms of their meanings and their semantic relations to other words. This resource is bootstrapped from WordNet, a well-known English-language resource. Our approach extends WordNet with around 1.5 million meaning links for 800,000 words in over 200 languages, drawing on evidence extracted from a variety of resources including existing (monolingual) wordnets, (mostly bilingual) translation dictionaries, and parallel corpora. Graph-based scoring functions and statistical learning techniques are used to iteratively integrate this information and build an output graph. Experiments show that this wordnet has a high level of precision and coverage, and that it can be useful in applied tasks such as cross-lingual text classification

MPG.PuRe

Light-front quark distributions in the nucleon and nucleon electromagnetic form factors

Author: Anselmino
Brodsky
de Araujo
de Melo
de Melo
E. Pace
Frederico
G. Salmè
J.P.B.C. de Melo
Jacob
Lai
Mandelstam
Pace
S. Pisano
T. Frederico
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

Longitudinal and transverse quark momentum distributions in the nucleon are calculated from a phenomenological quark-nucleon vertex function obtained through an investigation of the nucleon electromagnetic form factors within a light-front framework.Comment: 6 pages, 11 figs. proceedings of LC2009, to appear in Nucl. Phys.

arXiv.org e-Print Archive

Crossref

ART

Open Access Repository

Using Multi-Sense Vector Embeddings for Reverse Dictionaries

Author: de Melo G.
Hedderich M.
Klakow D.
Yates A.
Publication venue
Publication date: 01/01/2019
Field of study

Popular word embedding methods such as word2vec and GloVe assign a single vector representation to each word, even if a word has multiple distinct meanings. Multi-sense embeddings instead provide different vectors for each sense of a word. However, they typically cannot serve as a drop-in replacement for conventional single-sense embeddings, because the correct sense vector needs to be selected for each word. In this work, we study the effect of multi-sense embeddings on the task of reverse dictionaries. We propose a technique to easily integrate them into an existing neural network architecture using an attention mechanism. Our experiments demonstrate that large improvements can be obtained when employing multi-sense embeddings both in the input sequence as well as for the target representation. An analysis of the sense distributions and of the learned attention is provided as well

MPG.PuRe