Semantic Oriented Extension for Machine Translation Corpora

Abstract

Corpora play a crucial role in the development and improvement of automatic translation systems. There are currently many corpora used in the machine translation (MT) domain. However, exploiting and using these corpora are still challenging and limited because of some reasons, of which the main reason is that most corpora are in terms of raw texts or linked to other different kinds of data such as audio, images, graphs.... But they are not organized into semantic layers. Therefore, in this paper, we want to propose an idea of extending and enlarging corpora by adding to them a semantic layer so that the performance of corpus exploitation systems will be much improved

    Similar works

    Full text

    thumbnail-image

    Available Versions

    Last time updated on 09/10/2022