Search CORE

47,226 research outputs found

Crowdsourcing Question-Answer Meaning Representations

Author: Dagan Ido
He Luheng
Michael Julian
Stanovsky Gabriel
Zettlemoyer Luke
Publication venue
Publication date: 15/11/2017
Field of study

We introduce Question-Answer Meaning Representations (QAMRs), which represent the predicate-argument structure of a sentence as a set of question-answer pairs. We also develop a crowdsourcing scheme to show that QAMRs can be labeled with very little training, and gather a dataset with over 5,000 sentences and 100,000 questions. A detailed qualitative analysis demonstrates that the crowd-generated question-answer pairs cover the vast majority of predicate-argument relationships in existing datasets (including PropBank, NomBank, QA-SRL, and AMR) along with many previously under-resourced ones, including implicit arguments and relations. The QAMR data and annotation code is made publicly available to enable future work on how best to model these complex phenomena.Comment: 8 pages, 6 figures, 2 table

arXiv.org e-Print Archive

Crossref

On the Generation of Medical Question-Answer Pairs

Author: Du Nan
Fan Wei
Ge Shen
Li Yaliang
Liang Xingzheng
Shen Sheng
Wang Kai
Wu Xian
Xie Yusheng
Yang Tao
Publication venue
Publication date: 06/12/2019
Field of study

Question answering (QA) has achieved promising progress recently. However, answering a question in real-world scenarios like the medical domain is still challenging, due to the requirement of external knowledge and the insufficient quantity of high-quality training data. In the light of these challenges, we study the task of generating medical QA pairs in this paper. With the insight that each medical question can be considered as a sample from the latent distribution of questions given answers, we propose an automated medical QA pair generation framework, consisting of an unsupervised key phrase detector that explores unstructured material for validity, and a generator that involves a multi-pass decoder to integrate structural knowledge for diversity. A series of experiments have been conducted on a real-world dataset collected from the National Medical Licensing Examination of China. Both automatic evaluation and human annotation demonstrate the effectiveness of the proposed method. Further investigation shows that, by incorporating the generated QA pairs for training, significant improvement in terms of accuracy can be achieved for the examination QA system.Comment: AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Cross Temporal Recurrent Networks for Ranking Question Answer Pairs

Author: Hui Siu Cheung
Tay Yi
Tuan Luu Anh
Publication venue
Publication date: 21/11/2017
Field of study

Temporal gates play a significant role in modern recurrent-based neural encoders, enabling fine-grained control over recursive compositional operations over time. In recurrent models such as the long short-term memory (LSTM), temporal gates control the amount of information retained or discarded over time, not only playing an important role in influencing the learned representations but also serving as a protection against vanishing gradients. This paper explores the idea of learning temporal gates for sequence pairs (question and answer), jointly influencing the learned representations in a pairwise manner. In our approach, temporal gates are learned via 1D convolutional layers and then subsequently cross applied across question and answer for joint learning. Empirically, we show that this conceptually simple sharing of temporal gates can lead to competitive performance across multiple benchmarks. Intuitively, what our network achieves can be interpreted as learning representations of question and answer pairs that are aware of what each other is remembering or forgetting, i.e., pairwise temporal gating. Via extensive experiments, we show that our proposed model achieves state-of-the-art performance on two community-based QA datasets and competitive performance on one factoid-based QA dataset.Comment: Accepted to AAAI201

arXiv.org e-Print Archive

Crossref

Association for the Advancement of Artificial Intelligence: AAAI Publications

Question/answer congruence and the semantics of wh-phrases

Author: Reich Ingo
Publication venue
Publication date: 14/11/2013
Field of study

This paper is about the semantics of wh-phrases. It is argued that wh-phrases should not be analyzed as indefinites as, for example, Karttunen (1977) and many others have done, but as functional expressions with an indefinite core -their function being to restrict possible focus/background structures in direct or congruent answers. This will be argued for on the basis of observations made with respect to the distribution of term answers in well-formed question/answer sequences. This claim having been established, it will be integrated in a categorial variant of Schwarzschild's (1999) information-theoretic approach to F-marking and accent placement, and – second – its consequences with respect to the focus/background structure of wh-questions will be outlined

Hochschulschriftenserver - Universität Frankfurt am Main

Word Embedding based Correlation Model for Question/Answer Matching

Author: Jiang Nan
Peng Baolin
Rong Wenge
Shen Yikang
Tang Jie
Xiong Zhang
Publication venue
Publication date: 25/11/2016
Field of study

With the development of community based question answering (Q&A) services, a large scale of Q&A archives have been accumulated and are an important information and knowledge resource on the web. Question and answer matching has been attached much importance to for its ability to reuse knowledge stored in these systems: it can be useful in enhancing user experience with recurrent questions. In this paper, we try to improve the matching accuracy by overcoming the lexical gap between question and answer pairs. A Word Embedding based Correlation (WEC) model is proposed by integrating advantages of both the translation model and word embedding, given a random pair of words, WEC can score their co-occurrence probability in Q&A pairs and it can also leverage the continuity and smoothness of continuous space word representation to deal with new pairs of words that are rare in the training parallel text. An experimental study on Yahoo! Answers dataset and Baidu Zhidao dataset shows this new method's promising potential.Comment: 8 pages, 2 figure

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus

Author: Ahn Sungjin
Bengio Yoshua
Chandar Sarath
Courville Aaron
García-Durán Alberto
Gulcehre Caglar
Serban Iulian Vlad
Publication venue
Publication date: 01/01/2016
Field of study

Over the past decade, large-scale supervised learning corpora have enabled machine learning researchers to make substantial advances. However, to this date, there are no large-scale question-answer corpora available. In this paper we present the 30M Factoid Question-Answer Corpus, an enormous question answer pair corpus produced by applying a novel neural network architecture on the knowledge base Freebase to transduce facts into natural language questions. The produced question answer pairs are evaluated both by human evaluators and using automatic evaluation metrics, including well-established machine translation and sentence similarity metrics. Across all evaluation criteria the question-generation model outperforms the competing template-based baseline. Furthermore, when presented to human evaluators, the generated questions appear comparable in quality to real human-generated questions.Comment: 13 pages, 1 figure, 7 table

arXiv.org e-Print Archive

Crossref

PolyPublie