Search CORE

24,263 research outputs found

Neural Generative Question Answering

Author: Jiang Xin
Li Hang
Li Xiaoming
Lu Zhengdong
Shang Lifeng
Yin Jun
Publication venue
Publication date: 01/01/2016
Field of study

This paper presents an end-to-end neural network model, named Neural Generative Question Answering (GENQA), that can generate answers to simple factoid questions, based on the facts in a knowledge-base. More specifically, the model is built on the encoder-decoder framework for sequence-to-sequence learning, while equipped with the ability to enquire the knowledge-base, and is trained on a corpus of question-answer pairs, with their associated triples in the knowledge-base. Empirical study shows the proposed model can effectively deal with the variations of questions and answers, and generate right and natural answers by referring to the facts in the knowledge-base. The experiment on question answering demonstrates that the proposed model can outperform an embedding-based QA model as well as a neural dialogue model trained on the same data.Comment: Accepted by IJCAI 201

arXiv.org e-Print Archive

Crossref

Open-Retrieval Conversational Question Answering

Author: Chen Y.
Chuklin A.
Clark C.
Das R.
Devlin J.
Dhingra B.
Dunn M.
Garg S.
Huang H.-Y.
Johnson J.
Kwiatkowski T.
Lan Z.-Z.
Nguyen T.
Reddy S.
Shrivastava A.
Thomas P.
Trippas J. R.
Trischler A.
Vaswani A.
Voorhees E. M.
Wang M.
Wang S.
Wu Y.
Yang L.
Yang W.
Yatskar M.
Zhang Y.
Zhu C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 22/05/2020
Field of study

Conversational search is one of the ultimate goals of information retrieval. Recent research approaches conversational search by simplified settings of response ranking and conversational question answering, where an answer is either selected from a given candidate set or extracted from a given passage. These simplifications neglect the fundamental role of retrieval in conversational search. To address this limitation, we introduce an open-retrieval conversational question answering (ORConvQA) setting, where we learn to retrieve evidence from a large collection before extracting answers, as a further step towards building functional conversational search systems. We create a dataset, OR-QuAC, to facilitate research on ORConvQA. We build an end-to-end system for ORConvQA, featuring a retriever, a reranker, and a reader that are all based on Transformers. Our extensive experiments on OR-QuAC demonstrate that a learnable retriever is crucial for ORConvQA. We further show that our system can make a substantial improvement when we enable history modeling in all system components. Moreover, we show that the reranker component contributes to the model performance by providing a regularization effect. Finally, further in-depth analyses are performed to provide new insights into ORConvQA.Comment: Accepted to SIGIR'2

arXiv.org e-Print Archive

Crossref

Learning to Rank Question Answer Pairs with Holographic Dual LSTM Architecture

Author: Chang Ming-Wei
Heilman Michael
Hu Baotian
Luu Anh Tuan
Mikolov Tomas
Nickel Maximilian
Plate Tony
Qiu Xipeng
Robertson Stephen E.
Wang Di
Wang Mengqiu
Yao Xuchen
Zhou Guangyou
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/07/2017
Field of study

We describe a new deep learning architecture for learning to rank question answer pairs. Our approach extends the long short-term memory (LSTM) network with holographic composition to model the relationship between question and answer representations. As opposed to the neural tensor layer that has been adopted recently, the holographic composition provides the benefits of scalable and rich representational learning approach without incurring huge parameter costs. Overall, we present Holographic Dual LSTM (HD-LSTM), a unified architecture for both deep sentence modeling and semantic matching. Essentially, our model is trained end-to-end whereby the parameters of the LSTM are optimized in a way that best explains the correlation between question and answer representations. In addition, our proposed deep learning architecture requires no extensive feature engineering. Via extensive experiments, we show that HD-LSTM outperforms many other neural architectures on two popular benchmark QA datasets. Empirical studies confirm the effectiveness of holographic composition over the neural tensor layer.Comment: SIGIR 2017 Full Pape

arXiv.org e-Print Archive

Crossref

Answer Sequence Learning with Neural Networks for Answer Selection in Community Question Answering

Author: Chen Qingcai
Hu Baotian
Tang Buzhou
Wang Xiaolong
Zhou Xiaoqiang
Publication venue
Publication date: 01/01/2015
Field of study

In this paper, the answer selection problem in community question answering (CQA) is regarded as an answer sequence labeling task, and a novel approach is proposed based on the recurrent architecture for this problem. Our approach applies convolution neural networks (CNNs) to learning the joint representation of question-answer pair firstly, and then uses the joint representation as input of the long short-term memory (LSTM) to learn the answer sequence of a question for labeling the matching quality of each answer. Experiments conducted on the SemEval 2015 CQA dataset shows the effectiveness of our approach.Comment: 6 page

arXiv.org e-Print Archive

Crossref