Search CORE

172 research outputs found

e-SNLI: Natural Language Inference with Natural Language Explanations

Author: Blunsom Phil
Camburu Oana-Maria
Lukasiewicz Thomas
Rocktäschel Tim
Publication venue
Publication date: 01/01/2018
Field of study

In order for machine learning to garner widespread public adoption, models must be able to provide interpretable and robust explanations for their decisions, as well as learn from human-provided explanations at train time. In this work, we extend the Stanford Natural Language Inference dataset with an additional layer of human-annotated natural language explanations of the entailment relations. We further implement models that incorporate these explanations into their training process and output them at test time. We show how our corpus of explanations, which we call e-SNLI, can be used for various goals, such as obtaining full sentence justifications of a model's decisions, improving universal sentence representations and transferring to out-of-domain NLI datasets. Our dataset thus opens up a range of research directions for using natural language explanations, both for improving models and for asserting their trust.Comment: NeurIPS 201

arXiv.org e-Print Archive

UCL Discovery

Oxford University Research Archive

A Recurrent Neural Model with Attention for the Recognition of Chinese Implicit Discourse Relations

Author: Chiarcos Christian
Rönnqvist Samuel
Schenk Niko
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

We introduce an attention-based Bi-LSTM for Chinese implicit discourse relations and demonstrate that modeling argument pairs as a joint sequence can outperform word order-agnostic approaches. Our model benefits from a partial sampling scheme and is conceptually simple, yet achieves state-of-the-art performance on the Chinese Discourse Treebank. We also visualize its attention activity to illustrate the model's ability to selectively focus on the relevant parts of an input sequence.Comment: To appear at ACL2017, code available at https://github.com/sronnqvist/discourse-ablst

arXiv.org e-Print Archive

Crossref

A Hybrid Siamese Neural Network for Natural Language Inference in Cyber-Physical Systems

Author: Chang Victor
Li Gangmin
Li Yuming
Ni Pin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 15/03/2021
Field of study

Cyber-Physical Systems (CPS), as a multi-dimensional complex system that connects the physical world and the cyber world, has a strong demand for processing large amounts of heterogeneous data. These tasks also include Natural Language Inference (NLI) tasks based on text from different sources. However, the current research on natural language processing in CPS does not involve exploration in this field. Therefore, this study proposes a Siamese Network structure that combines Stacked Residual Long Short-Term Memory (bidirectional) with the Attention mechanism and Capsule Network for the NLI module in CPS, which is used to infer the relationship between text/language data from different sources. This model is mainly used to implement NLI tasks and conduct a detailed evaluation in three main NLI benchmarks as the basic semantic understanding module in CPS. Comparative experiments prove that the proposed method achieves competitive performance, has a certain generalization ability, and can balance the performance and the number of trained parameters

Teeside University's Research Repository

UCL Discovery

Aston Publications Explorer

Bilateral Multi-Perspective Matching for Natural Language Sentences

Author: Florian Radu
Hamza Wael
Wang Zhiguo
Publication venue
Publication date: 14/07/2017
Field of study

Natural language sentence matching is a fundamental technology for a variety of tasks. Previous approaches either match sentences from a single direction or only apply single granular (word-by-word or sentence-by-sentence) matching. In this work, we propose a bilateral multi-perspective matching (BiMPM) model under the "matching-aggregation" framework. Given two sentences

P

and

Q

, our model first encodes them with a BiLSTM encoder. Next, we match the two encoded sentences in two directions

P \rightarrow Q

and

P \leftarrow Q

. In each matching direction, each time step of one sentence is matched against all time-steps of the other sentence from multiple perspectives. Then, another BiLSTM layer is utilized to aggregate the matching results into a fix-length matching vector. Finally, based on the matching vector, the decision is made through a fully connected layer. We evaluate our model on three tasks: paraphrase identification, natural language inference and answer sentence selection. Experimental results on standard benchmark datasets show that our model achieves the state-of-the-art performance on all tasks.Comment: To appear in Proceedings of IJCAI 201

arXiv.org e-Print Archive

Crossref

Neural networks for text matching

Author: Xu Chunlin
Publication venue
Publication date: 01/04/2021
Field of study

Ulster University's Research Portal

A recurrent neural model with attention for the recognition of Chinese implicit discourse relations

Author: Chiarcos Christian
Rönnqvist Samuel
Schenk Niko
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 27/04/2023
Field of study

We introduce an attention-based Bi-LSTM for Chinese implicit discourse relations and demonstrate that modeling argument pairs as a joint sequence can outperform word order-agnostic approaches. Our model benefits from a partial sampling scheme and is conceptually simple, yet achieves state-of-the-art performance on the Chinese Discourse Treebank. We also visualize its attention activity to illustrate the model’s ability to selectively focus on the relevant parts of an input sequence

OPUS Augsburg