Second Language Acquisition from Aligned Corpora

Abstract

The paper describes a system for automatic aligning and searching for translation equivalents in large bilingual corpora. This implementation was developed to facilitate our tasks in GLOSSER #343 Copernicus '94 Joint Research Project, where Linguistic Modeling Laboratory was charged especially with preparation of bilingual material. The GaleChurch algorithm is chosen as aligning procedure for parallel texts. The main functional characteristics of our system MARK ALISTeR (MARKing, ALIgning and Searching TRanslation equivalents ) are described. The program package can be used under MS Windows as an autonomous procedure of second language acquisition and CALL instrument. Evaluation of the results and the alignment errors of the algorithm and the tool is presented for different types of texts. 1 Introduction Large corpora are a well recognized basic resource for linguistic knowledge acquisition. When they are parallel and aligned, their role of text support for second language learning i..

    Similar works

    Full text

    thumbnail-image

    Available Versions