1 research outputs found

    String Transduction with Target Language Models and Insertion Handling

    Full text link
    Many character-level tasks can be framed as sequence-to-sequence transduction, where the target is a word from a natural language. We show that leveraging target language models derived from unannotated target corpora, combined with a precise alignment of the training data, yields state-of-the art results on cognate projection, inflection generation, and phoneme-to-grapheme conversion.Comment: 8 pages + 1 page Appendix, 4 figures and 8 tables, plus an additional 1 figure and 2 tables in appendix; to appear at SIGMORPHON 2018, October, 201
    corecore