358 research outputs found
Neural Natural Language Inference Models Enhanced with External Knowledge
Modeling natural language inference is a very challenging task. With the
availability of large annotated data, it has recently become feasible to train
complex models such as neural-network-based inference models, which have shown
to achieve the state-of-the-art performance. Although there exist relatively
large annotated data, can machines learn all knowledge needed to perform
natural language inference (NLI) from these data? If not, how can
neural-network-based NLI models benefit from external knowledge and how to
build NLI models to leverage it? In this paper, we enrich the state-of-the-art
neural natural language inference models with external knowledge. We
demonstrate that the proposed models improve neural NLI models to achieve the
state-of-the-art performance on the SNLI and MultiNLI datasets.Comment: Accepted by ACL 201
Transfer Learning for Sequence Labeling Using Source Model and Target Data
In this paper, we propose an approach for transferring the knowledge of a
neural model for sequence labeling, learned from the source domain, to a new
model trained on a target domain, where new label categories appear. Our
transfer learning (TL) techniques enable to adapt the source model using the
target data and new categories, without accessing to the source data. Our
solution consists in adding new neurons in the output layer of the target model
and transferring parameters from the source model, which are then fine-tuned
with the target data. Additionally, we propose a neural adapter to learn the
difference between the source and the target label distribution, which provides
additional important information to the target model. Our experiments on Named
Entity Recognition show that (i) the learned knowledge in the source model can
be effectively transferred when the target data contains new categories and
(ii) our neural adapter further improves such transfer.Comment: 9 pages, 4 figures, 3 tables, accepted paper in the Thirty-Third AAAI
Conference on Artificial Intelligence (AAAI-19
- …