Robust Estimation of Feature Weights in Statistical Machine Translation

España Bonet, Cristina; Màrquez Villodre, Lluís

unknown

Robust Estimation of Feature Weights in Statistical Machine Translation

Authors: Cristina España Bonet
Lluís Màrquez Villodre
Publication date: 1 January 2010
Publisher

Abstract

Weights of the various components in a standard Statistical Machine Translation model are usually estimated via Minimum Error Rate Training. With this, one finds their optimum value on a development set with the expectation that these optimal weights generalise well to other test sets. However, this is not always the case when domains differ. This work uses a perceptron algorithm to learn more robust weights to be used on out-of-domain corpora without the need for specialised data. For an Arabic-to-English translation system, the generalisation of weights represents an improvement of more than 2 points of BLEU with respect to the MERT baseline using the same information.Peer ReviewedPostprint (published version

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

UPCommons. Portal del coneixement obert de la UPC

oai:upcommons.upc.edu:2117/104...

Last time updated on 16/06/2016