Context-dependent multilingual lexical lookup for under-resourced languages

Enya, Kong Tang; Lay-Ki, Soon; Lian, Tze Lim; Ranaivo-Malançon, Bali; Tek, Yong Lim

Context-dependent multilingual lexical lookup for under-resourced languages

Authors: Kong Tang Enya
Soon Lay-Ki
Tze Lim Lian
Bali Ranaivo-Malançon
Yong Lim Tek
Publication date: 1 January 2013
Publisher

Abstract

Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Unimas Institutional Repository

oai:ir.unimas.my:16527

Last time updated on 18/04/2020