Search CORE

1,618 research outputs found

Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives

Author: Fu Jie
Hui Siu Cheung
Phan Minh C.
Rao Jinfeng
Tay Yi
Tuan Luu Anh
Wang Shuohang
Yuan Xingdi
Zhang Aston
Publication venue
Publication date: 01/01/2019
Field of study

This paper tackles the problem of reading comprehension over long narratives where documents easily span over thousands of tokens. We propose a curriculum learning (CL) based Pointer-Generator framework for reading/sampling over large documents, enabling diverse training of the neural model based on the notion of alternating contextual difficulty. This can be interpreted as a form of domain randomization and/or generative pretraining during training. To this end, the usage of the Pointer-Generator softens the requirement of having the answer within the context, enabling us to construct diverse training samples for learning. Additionally, we propose a new Introspective Alignment Layer (IAL), which reasons over decomposed alignments using block-based self-attention. We evaluate our proposed method on the NarrativeQA reading comprehension benchmark, achieving state-of-the-art performance, improving existing baselines by

51\%

relative improvement on BLEU-4 and

17\%

relative improvement on Rouge-L. Extensive ablations confirm the effectiveness of our proposed IAL and CL components.Comment: Accepted to ACL 201

arXiv.org e-Print Archive

Institutional Knowledge at Singapore Management University

PolyPublie

Paraphrase Generation with Deep Reinforcement Learning

Author: Jiang Xin
Li Hang
Li Zichao
Shang Lifeng
Publication venue
Publication date: 01/01/2018
Field of study

Automatic generation of paraphrases from a given sentence is an important yet challenging task in natural language processing (NLP), and plays a key role in a number of applications such as question answering, search, and dialogue. In this paper, we present a deep reinforcement learning approach to paraphrase generation. Specifically, we propose a new framework for the task, which consists of a \textit{generator} and an \textit{evaluator}, both of which are learned from data. The generator, built as a sequence-to-sequence learning model, can produce paraphrases given a sentence. The evaluator, constructed as a deep matching model, can judge whether two sentences are paraphrases of each other. The generator is first trained by deep learning and then further fine-tuned by reinforcement learning in which the reward is given by the evaluator. For the learning of the evaluator, we propose two methods based on supervised learning and inverse reinforcement learning respectively, depending on the type of available training data. Empirical study shows that the learned evaluator can guide the generator to produce more accurate paraphrases. Experimental results demonstrate the proposed models (the generators) outperform the state-of-the-art methods in paraphrase generation in both automatic evaluation and human evaluation.Comment: EMNLP 201

arXiv.org e-Print Archive

Crossref