Search CORE

1 research outputs found

String Transduction with Target Language Models and Insertion Handling

Author: Kondrak Grzegorz
Najafi Saeed
Nicolai Garrett
Publication venue
Publication date: 19/09/2018
Field of study

Many character-level tasks can be framed as sequence-to-sequence transduction, where the target is a word from a natural language. We show that leveraging target language models derived from unannotated target corpora, combined with a precise alignment of the training data, yields state-of-the art results on cognate projection, inflection generation, and phoneme-to-grapheme conversion.Comment: 8 pages + 1 page Appendix, 4 figures and 8 tables, plus an additional 1 figure and 2 tables in appendix; to appear at SIGMORPHON 2018, October, 201

arXiv.org e-Print Archive