Optimising Multiple Metrics with MERT

Schwenk, Holger; Servan, Christophe

Optimising Multiple Metrics with MERT

Authors: Holger Schwenk
Christophe Servan
Publication date: 1 January 2011
Publisher: HAL CCSD

Abstract

International audienceThe main metric used for SMT systems evaluation an optimisation is BLEU score but this metric is questioned about its relevance to human evaluation. Some other metrics already exist but none of them are in perfect harmony with human evaluation. On the other hand, most evaluations use multiple metrics (BLEU, TER, METEOR, etc.). Systems can optimise toward other metrics than BLEU. But optimisation with other metrics tends to decrease BLEU score. As Machine Translation evaluations still use BLEU as main metric, it is important to min-imise the decrease of BLEU. We propose to optimise toward a metric combination like BLEU-TER. This proposition includes two new open source scorers for MERT, the SMT optimisation tool. The first one is a TER scorer that allows us to optimise toward TER; the second one is a combination scorer. The latter one enables the combination of two or more metrics for the optimisation process. This paper also presents some experiments on the MERT optimisation in the Statistical Machine Translation system Moses with the TER and the BLEU metrics and some metric combinations

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

HAL Descartes

oai:HAL:hal-01157949v1

Last time updated on 14/04/2021

Archive Ouverte en Sciences de l'Information et de la Communication

oai:HAL:hal-01157949v1

Last time updated on 13/10/2017