Search CORE

9 research outputs found

Tree-Structured Neural Topic Model

Author: Bollegala Danushka
Isonuma Masaru
Linguist Assoc Computat
Mori Junichiro
Sakata Ichiro
Publication venue
Publication date: 01/01/2020
Field of study

University of Liverpool Repository

Crossref

Entity-Enriched Neural Models for Clinical Question Answering

Author: Linguist Assoc Computat
Min So Yeon
Raghavan Preethi
Rawat Bhanu Pratap Singh
Szolovits Peter
Weng Wei-Hung
Publication venue
Publication date: 26/01/2021
Field of study

We explore state-of-the-art neural models for question answering on electronic medical records and improve their ability to generalize better on previously unseen (paraphrased) questions at test time. We enable this by learning to predict logical forms as an auxiliary task along with the main task of answer span detection. The predicted logical forms also serve as a rationale for the answer. Further, we also incorporate medical entity information in these models via the ERNIE architecture. We train our models on the large-scale emrQA dataset and observe that our multi-task entity-enriched models generalize to paraphrased questions ~5% better than the baseline BERT model

arXiv.org e-Print Archive

DSpace@MIT

Recommended from our members

Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces

Author: Glavas Goran
Korhonen Anna
Linguist Assoc Computat
Vulic Ivan
Publication venue: 5TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2020)
Publication date: 01/01/2020
Field of study

Work on projection-based induction of cross-lingual word embedding spaces (CLWEs) predominantly focuses on the improvement of the projection (i.e., mapping) mechanisms. In this work, in contrast, we show that a simple method for post-processing monolingual embedding spaces facilitates learning of the cross-lingual alignment and, in turn, substantially improves bilingual lexicon induction (BLI). The post-processing method we examine is grounded in the generalisation of first- and second-order monolingual similarities to the nth-order similarity. By post-processing monolingual spaces before the cross-lingual alignment, the method can be coupled with any projection-based method for inducing CLWE spaces. We demonstrate the effectiveness of this simple monolingual post-processing across a set of 15 typologically diverse languages (i.e., 15*14 BLI setups), and in combination with two different projection methods

Apollo (Cambridge)

Recommended from our members

Will-They-Won't-They: A Very Large Dataset for Stance Detection on Twitter

Author: Berndt Jakob
Collier Nigel
Conforti Costanza
Giannitsarou Chryssi
Linguist Assoc Computat
Pilehvar Mohammad Taher
Toxvaerd Flavio
Publication venue: 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020)
Publication date: 01/01/2020
Field of study

We present a new challenging stance detection dataset, called Will-They-Won’t-They (WT--WT), which contains 51,284 tweets in English, making it by far the largest available dataset of the type. All the annotations are carried out by experts; therefore, the dataset constitutes a high-quality and reliable benchmark for future research in stance detection. Our experiments with a wide range of recent state-of-the-art stance detection systems show that the dataset poses a strong challenge to existing models in this domain.Keynes Fund, Cambridg

Apollo (Cambridge)

CD2 CR: Co-reference Resolution Across Documents and Domains:16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021)

Author: Caftan A
Clare A
Dagan I
Liakata M
Linguist Assoc Computat
Ravenscroft J
Publication venue
Publication date: 01/01/2021
Field of study

Cross-document co-reference resolution (CDCR) is the task of identifying and linking mentions to entities and concepts across many text documents. Current state-of-the-art models for this task assume that all documents are of the same type (e.g. news articles) or fall under the same theme. However, it is also desirable to perform CDCR across different domains (type or theme). A particular use case we focus on in this paper is the resolution of entities mentioned across scientific work and newspaper articles that discuss them. Identifying the same entities and corresponding concepts in both scientific articles and news can help scientists understand how their work is represented in mainstream media. We propose a new task and English language dataset for cross-document cross-domain co-reference resolution (CD2 CR). The task aims to identify links between entities across heterogeneous document types. We show that in this cross-domain, cross-document setting, existing CDCR models do not perform well and we provide a baseline model that outperforms current state-of-the-art CDCR models on CD2 CR. Our data set, annotation tool and guidelines as well as our model for cross-document cross-domain co-reference are all supplied as open access open source resources

Aberystwyth Research Portal

Recommended from our members

Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

Author: Coope Sam
Farghly Tyler
Gerz Daniela
Henderson Matthew
Linguist Assoc Computat
Vulic Ivan
Publication venue: 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020)
Publication date: 01/01/2020
Field of study

We introduce Span-ConveRT, a light-weight model for dialog slot-filling which frames the task as a turn-based span extraction task. This formulation allows for a simple integration of conversational knowledge coded in large pretrained conversational models such as ConveRT (Henderson et al., 2019). We show that leveraging such knowledge in Span-ConveRT is especially useful for few-shot learning scenarios: we report consistent gains over 1) a span extractor that trains representations from scratch in the target domain, and 2) a BERT-based span extractor. In order to inspire more work on span extraction for the slot-filling task, we also release RESTAURANTS-8K, a new challenging data set of 8,198 utterances, compiled from actual conversations in the restaurant booking domain

Apollo (Cambridge)

Entity-Enriched Neural Models for Clinical Question Answering

Author: Linguist Assoc Computat
Min So Yeon
Raghavan Preethi
Rawat Bhanu Pratap Singh
Szolovits Peter
Weng Wei-Hung
Publication venue
Publication date: 26/01/2021
Field of study

DSpace@MIT

An empirical investigation of neural methods for content scoring of science explanations

Author: Bichler Sarah
Bradford Allison
Chen Jennifer King
Gerard Libby
Linguist Assoc Computat
Linn Marcia C
Riordan Brian
Wiley Korah
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

With the widespread adoption of the Next Generation Science Standards (NGSS), science teachers and online learning environments face the challenge of evaluating students' integration of different dimensions of science learning. Recent advances in representation learning in natural language processing have proven effective across many natural language processing tasks, but a rigorous evaluation of the relative merits of these methods for scoring complex constructed response formative assessments has not previously been carried out. We present a detailed empirical investigation of feature-based, recurrent neural network, and pre-trained transformer models on scoring content in real-world formative assessment data. We demonstrate that recent neural methods can rival or exceed the performance of feature-based methods. We also provide evidence that different classes of neural models take advantage of different learning cues, and pre-trained transformer models may be more robust to spurious, dataset-specific learning cues, better reflecting scoring rubrics

Crossref

eScholarship - University of California