Search CORE

221 research outputs found

Resolving Lexical Ambiguity in Tensor Regression Models of Meaning

Author: Kalchbrenner Nal
Kartsaklis Dimitri
Sadrzadeh Mehrnoosh
Publication venue
Publication date: 01/01/2014
Field of study

This paper provides a method for improving tensor-based compositional distributional models of meaning by the addition of an explicit disambiguation step prior to composition. In contrast with previous research where this hypothesis has been successfully tested against relatively simple compositional models, in our work we use a robust model trained with linear regression. The results we get in two experiments show the superiority of the prior disambiguation method and suggest that the effectiveness of this approach is model-independent

arXiv.org e-Print Archive

CiteSeerX

Oxford University Research Archive

A Convolutional Neural Network for Modelling Sentences

Author: Blunsom Phil
Grefenstette Edward
Kalchbrenner Nal
Publication venue
Publication date: 01/01/2014
Field of study

The ability to accurately represent sentences is central to language understanding. We describe a convolutional architecture dubbed the Dynamic Convolutional Neural Network (DCNN) that we adopt for the semantic modelling of sentences. The network uses Dynamic k-Max Pooling, a global pooling operation over linear sequences. The network handles input sentences of varying length and induces a feature graph over the sentence that is capable of explicitly capturing short and long-range relations. The network does not rely on a parse tree and is easily applicable to any language. We test the DCNN in four experiments: small scale binary and multi-class sentiment prediction, six-way question classification and Twitter sentiment prediction by distant supervision. The network achieves excellent performance in the first three tasks and a greater than 25% error reduction in the last task with respect to the strongest baseline

arXiv.org e-Print Archive

CiteSeerX

Crossref

Oxford University Research Archive

Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network

Author: Blunsom Phil
de Freitas Nando
Demiraj Alban
Denil Misha
Kalchbrenner Nal
Publication venue
Publication date: 01/01/2014
Field of study

Capturing the compositional process which maps the meaning of words to that of documents is a central challenge for researchers in Natural Language Processing and Information Retrieval. We introduce a model that is able to represent the meaning of documents by embedding them in a low dimensional vector space, while preserving distinctions of word and sentence order crucial for capturing nuanced semantics. Our model is based on an extended Dynamic Convolution Neural Network, which learns convolution filters at both the sentence and document level, hierarchically learning to capture and compose low level lexical features into high level semantic concepts. We demonstrate the effectiveness of this model on a range of document modelling tasks, achieving strong results with no feature engineering and with a more compact model. Inspired by recent advances in visualising deep convolution networks for computer vision, we present a novel visualisation technique for our document networks which not only provides insight into their learning process, but also can be interpreted to produce a compelling automatic summarisation system for texts

arXiv.org e-Print Archive

CiteSeerX

Oxford University Research Archive

Dialogue Act Recognition via CRF-Attentive Structured Network

Author: Ang Jeremy
Chen Yun-Nung
Chen Zheqian
Geertzen Jeroen
Kalchbrenner Nal
Lee Ji Young
Pan Boyuan
Publication venue
Publication date: 15/11/2017
Field of study

Dialogue Act Recognition (DAR) is a challenging problem in dialogue interpretation, which aims to attach semantic labels to utterances and characterize the speaker's intention. Currently, many existing approaches formulate the DAR problem ranging from multi-classification to structured prediction, which suffer from handcrafted feature extensions and attentive contextual structural dependencies. In this paper, we consider the problem of DAR from the viewpoint of extending richer Conditional Random Field (CRF) structural dependencies without abandoning end-to-end training. We incorporate hierarchical semantic inference with memory mechanism on the utterance modeling. We then extend structured attention network to the linear-chain conditional random field layer which takes into account both contextual utterances and corresponding dialogue acts. The extensive experiments on two major benchmark datasets Switchboard Dialogue Act (SWDA) and Meeting Recorder Dialogue Act (MRDA) datasets show that our method achieves better performance than other state-of-the-art solutions to the problem. It is a remarkable fact that our method is nearly close to the human annotator's performance on SWDA within 2% gap.Comment: 10 pages, 4figure

arXiv.org e-Print Archive

Crossref