Word- and sentence-level confidence measures for machine translation

Langlois, David; Lavecchia, Caroline; Raybaud, Sylvain; Smaïli, Kamel

Word- and sentence-level confidence measures for machine translation

Authors: David Langlois
Caroline Lavecchia
Sylvain Raybaud
Kamel Smaïli
Publication date: 14 May 2009
Publisher: HAL CCSD

Abstract

International audienceA machine translated sentence is seldom completely correct. Confidence measures are designed to detect incorrect words, phrases or sentences, or to provide an estimation of the probability of correctness. In this article we describe several word- and sentence-level confidence measures relying on different features: mutual information between words, n-gram and backward n-gram language models, and linguistic features. We also try different combination of these measures. Their accuracy is evaluated on a classification task. We achieve 17% error-rate (0.84 f-measure) on word-level and 31% error-rate (0.71 f-measure) on sentence-level

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

INRIA a CCSD electronic archive server

oai:HAL:inria-00417541v1

Last time updated on 09/11/2016