Exploiting source similarity for SMT using context-informed features

Stroppa, Nicolas; van den Bosch, Antal; Way, Andy

research

Exploiting source similarity for SMT using context-informed features

Authors: Nicolas Stroppa
Antal van den Bosch
Andy Way
Publication date: 1 January 2007
Publisher

Abstract

In this paper, we introduce context informed features in a log-linear phrase-based SMT framework; these features enable us to exploit source similarity in addition to target similarity modeled by the language model. We present a memory-based classification framework that enables the estimation of these features while avoiding sparseness problems. We evaluate the performance of our approach on Italian-to-English and Chinese-to-English translation tasks using a state-of-the-art phrase-based SMT system, and report significant improvements for both BLEU and NIST scores when adding the context-informed features

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

DCU Online Research Access Service

oai:doras.dcu.ie:15226

Last time updated on 10/07/2013

Name not available

oai:doras.dcu.ie:15226

Last time updated on 09/02/2018

Irish Universities

Last time updated on 30/12/2017