Article thumbnail
Location of Repository

Statistical Phrase-based Post-editing

By Michel Simard, Cyril Goutte and Pierre Isabelle

Abstract

We propose to use a statistical phrase-based machine translation system in a post-editing task: the system takes as input raw machine translation output (from a commercial rule-based MT system), and produces post-edited target-language text. We report on experiments that were performed on data collected in precisely such a setting: pairs of raw MT output and their manually post-edited versions. In our evaluation, the output of our automatic post-editing (APE) system is not only better quality than the rule-based MT (both in terms of the BLEU and TER metrics), it is also better than the output of a state-of-the-art phrase-based MT system used in standalone translation mode. These results indicate that automatic post-editing constitutes a simple and efficient way of combining rule-based and statistical MT technologies

Topics: Computational Linguistics, Machine Learning, Artificial Intelligence
Year: 2007
OAI identifier: oai:cogprints.org:5627

Suggested articles

Citations

  1. (2006). A Study of Translation Edit Rate with Targeted Human An-notation.
  2. (1994). Automated Postediting of Documents.
  3. (2004). Case study: Implementing MT forthe translation of pre-sales marketing and post-salessoftware deployment documentation.
  4. (2006). Phrasetable Smoothing for Statistical MachineTranslation.
  5. (1993). The Mathematicsof Statistical Machine Translation: Parameter Estimation.
  6. (2006). Transformation-based correctionsof rule-based MT.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.