Source-side context-informed hypothesis alignment for combining outputs from machine translation systems

Du, Jinhua; Ma, Yanjun; Way, Andy

research

Source-side context-informed hypothesis alignment for combining outputs from machine translation systems

Authors: Jinhua Du
Yanjun Ma
Andy Way
Publication date: 1 January 2009
Publisher

Abstract

This paper presents a new hypothesis alignment method for combining outputs of multiple machine translation (MT) systems. Traditional hypothesis alignment algorithms such as TER, HMM and IHMM do not directly utilise the context information of the source side but rather address the alignment issues via the output data itself. In this paper, a source-side context-informed (SSCI) hypothesis alignment method is proposed to carry out the word alignment and word reordering issues. First of all, the source–target word alignment links are produced as the hidden variables by exporting source phrase spans during the translation decoding process. Secondly, a mapping strategy and normalisation model are employed to acquire the 1- to-1 alignment links and build the confusion network (CN). The source-side context-based method outperforms the state-of-the-art TERbased alignment model in our experiments on the WMT09 English-to-French and NIST Chinese-to-English data sets respectively. Experimental results demonstrate that our proposed approach scores consistently among the best results across different data and language pair conditions

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Name not available

oai:doras.dcu.ie:15163

Last time updated on 09/02/2018

Irish Universities

Last time updated on 30/12/2017

DCU Online Research Access Service

oai:doras.dcu.ie:15163

Last time updated on 10/07/2013