Search CORE

95,863 research outputs found

A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Author: Auli Michael
Brockett Chris
Dolan Bill
Galley Michel
Gao Jianfeng
Ji Yangfeng
Mitchell Margaret
Nie Jian-Yun
Sordoni Alessandro
Publication venue
Publication date: 01/01/2015
Field of study

We present a novel response generation system that can be trained end to end on large quantities of unstructured Twitter conversations. A neural network architecture is used to address sparsity issues that arise when integrating contextual information into classic statistical models, allowing the system to take into account previous dialog utterances. Our dynamic-context generative models show consistent gains over both context-sensitive and non-context-sensitive Machine Translation and Information Retrieval baselines.Comment: A. Sordoni, M. Galley, M. Auli, C. Brockett, Y. Ji, M. Mitchell, J.-Y. Nie, J. Gao, B. Dolan. 2015. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses. In Proc. of NAACL-HLT. Pages 196-20

arXiv.org e-Print Archive

Crossref

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Author: Bahdanau Dzmitry
Bengio Yoshua
Bougares Fethi
Cho Kyunghyun
Gulcehre Caglar
Schwenk Holger
van Merrienboer Bart
Publication venue
Publication date: 01/01/2014
Field of study

In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.Comment: EMNLP 201

arXiv.org e-Print Archive

Crossref