Skip to main content
Article thumbnail
Location of Repository

Phrase Linguistic Classification and Generalization for Improving Statistical Machine Translation

By Adrià De Gispert

Abstract

In this paper a method to incorporate linguistic information regarding single-word and compound verbs is proposed, as a first step towards an SMT model based on linguistically-classified phrases. By substituting these verb structures by the base form of the head verb, we achieve a better statistical word alignment performance, and are able to better estimate the translation model and generalize to unseen verb forms during translation. Preliminary experiments for the English- Spanish language pair are performed, and future research lines are detailed

Year: 2005
OAI identifier: oai:CiteSeerX.psu:10.1.1.134.2220
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://mi.eng.cam.ac.uk/~ad465... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.