Search CORE

16 research outputs found

Adapting Sequence Models for Sentence Correction

Author: Kim Yoon
Rush Alexander M.
Schmaltz Allen
Shieber Stuart M.
Publication venue
Publication date: 01/01/2017
Field of study

In a controlled experiment of sequence-to-sequence approaches for the task of sentence correction, we find that character-based models are generally more effective than word-based models and models that encode subword information via convolutions, and that modeling the output data as a series of diffs improves effectiveness over standard approaches. Our strongest sequence-to-sequence model improves over our strongest phrase-based statistical machine translation model, with access to the same data, by 6 M2 (0.5 GLEU) points. Additionally, in the data environment of the standard CoNLL-2014 setup, we demonstrate that modeling (and tuning against) diffs yields similar or better M2 scores with simpler models and/or significantly less data than previous sequence-to-sequence approaches.Comment: EMNLP 201

arXiv.org e-Print Archive

Crossref

Retrieve and Refine: Improved Sequence Generation Models For Dialogue

Author: Dinan Emily
Miller Alexander H.
Weston Jason
Publication venue
Publication date: 01/01/2018
Field of study

Sequence generation models for dialogue are known to have several problems: they tend to produce short, generic sentences that are uninformative and unengaging. Retrieval models on the other hand can surface interesting responses, but are restricted to the given retrieval set leading to erroneous replies that cannot be tuned to the specific context. In this work we develop a model that combines the two approaches to avoid both their deficiencies: first retrieve a response and then refine it -- the final sequence generator treating the retrieval as additional context. We show on the recent CONVAI2 challenge task our approach produces responses superior to both standard retrieval and generation models in human evaluations

arXiv.org e-Print Archive

Crossref