Phrase-Based Language Model in Statistical Machine Translation

Ben Hamadou, AbdelMajid; Ben Romdhane, Achraf; Jamoussi, Salma; Smaïli, Kamel

Phrase-Based Language Model in Statistical Machine Translation

Authors: AbdelMajid Ben Hamadou
Achraf Ben Romdhane
Salma Jamoussi
Kamel Smaïli
Publication date: 8 December 2016
Publisher: Alexander Gelbukh

Abstract

La date de publication ne nous a pas encore été communiquéeInternational audienceAs one of the most important modules in statistical machine translation (SMT), language model measures whether one translation hypothesis is more grammatically correct than other hypotheses. Currently the state-of-the-art SMT systems use standard word n-gram models, whereas the translation model is phrase-based. In this paper, the idea is to use a phrase-based language model. For that, target portion of the translation table are retrieved and used to rewrite the training corpus and to calculate a phrase n-gram language model. In this work, weperform experiments with two language models word-based (WBLM) and phrase-based (PBLM). The different SMT are trained with threeoptimization algorithms MERT, MIRA and PRO. Thus, the PBLM systems are compared to the baseline system in terms of BLUE and TER.The experimental results show that the use of a phrase-based language model in SMT can improve results and is especially able to reduce theerror rate

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

INRIA a CCSD electronic archive server

oai:HAL:hal-01336485v1

Last time updated on 09/11/2016

Archive Ouverte en Sciences de l'Information et de la Communication

oai:HAL:hal-01336485v1

Last time updated on 08/03/2017