Search CORE

43,194 research outputs found

Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives

Author: Fu Jie
Hui Siu Cheung
Phan Minh C.
Rao Jinfeng
Tay Yi
Tuan Luu Anh
Wang Shuohang
Yuan Xingdi
Zhang Aston
Publication venue
Publication date: 01/01/2019
Field of study

This paper tackles the problem of reading comprehension over long narratives where documents easily span over thousands of tokens. We propose a curriculum learning (CL) based Pointer-Generator framework for reading/sampling over large documents, enabling diverse training of the neural model based on the notion of alternating contextual difficulty. This can be interpreted as a form of domain randomization and/or generative pretraining during training. To this end, the usage of the Pointer-Generator softens the requirement of having the answer within the context, enabling us to construct diverse training samples for learning. Additionally, we propose a new Introspective Alignment Layer (IAL), which reasons over decomposed alignments using block-based self-attention. We evaluate our proposed method on the NarrativeQA reading comprehension benchmark, achieving state-of-the-art performance, improving existing baselines by

51\%

relative improvement on BLEU-4 and

17\%

relative improvement on Rouge-L. Extensive ablations confirm the effectiveness of our proposed IAL and CL components.Comment: Accepted to ACL 201

arXiv.org e-Print Archive

PolyPublie

Teaching Machines to Read and Comprehend

Author: Blunsom Phil
Espeholt Lasse
Grefenstette Edward
Hermann Karl Moritz
Kay Will
Kočiský Tomáš
Suleyman Mustafa
Publication venue
Publication date: 19/11/2015
Field of study

Teaching machines to read natural language documents remains an elusive challenge. Machine reading systems can be tested on their ability to answer questions posed on the contents of documents that they have seen, but until now large scale training and test datasets have been missing for this type of evaluation. In this work we define a new methodology that resolves this bottleneck and provides large scale supervised reading comprehension data. This allows us to develop a class of attention based deep neural networks that learn to read real documents and answer complex questions with minimal prior knowledge of language structure.Comment: Appears in: Advances in Neural Information Processing Systems 28 (NIPS 2015). 14 pages, 13 figure

arXiv.org e-Print Archive

Oxford University Research Archive