research

Parallel corpora based translation resources extraction

Abstract

This paper describes NATools, a toolkit to process, analyze and extract translation resources from Parallel Corpora. It includes tools like a sentence-aligner, a probabilistic translation dictionaries extractor, word-aligner, a corpus server, a set of tools to query corpora and dictionaries, as well as a set of tools to extract bilingual resources.Alberto Simoes has a scholarship from Fundacao para a Computacao Cientifica Nacional and the work reported here has been partially funded by Fundacao para a Ciencia e Tecnologia through project POSI/PLP/43931/2001, co-financed by POSI, and by POSC project POSC/339/1.3//NAC

    Similar works