An algorithm for aligning sentences in Bilingual Corpora Using Lexical Information An Algorithm for Aligning Sentences in Bilingual Corpora Using Lexical Information
Abstract In this paper we describe an algorithm for aligning sentences with their translations in a bilingual corpus using lexical information of the languages. Existing efficient algorithms ignore word identities and consider only the sentence lengths The algorithm gives comparable results with the existing algorithms in most of the cases while it does better in cases where statistical algorithms do not give good results