Search CORE

1,754 research outputs found

Hierarchical RNN with Static Sentence-Level Attention for Text-Based Speaker Change Detection

Author: Jin Zhi
Meng Zhao
Mou Lili
Publication venue
Publication date: 28/09/2018
Field of study

Speaker change detection (SCD) is an important task in dialog modeling. Our paper addresses the problem of text-based SCD, which differs from existing audio-based studies and is useful in various scenarios, for example, processing dialog transcripts where speaker identities are missing (e.g., OpenSubtitle), and enhancing audio SCD with textual information. We formulate text-based SCD as a matching problem of utterances before and after a certain decision point; we propose a hierarchical recurrent neural network (RNN) with static sentence-level attention. Experimental results show that neural networks consistently achieve better performance than feature-based approaches, and that our attention-based model significantly outperforms non-attention neural networks.Comment: In Proceedings of the ACM on Conference on Information and Knowledge Management (CIKM), 201

arXiv.org e-Print Archive

Crossref

An Affect-Rich Neural Conversational Model with Biased Attention and Weighted Cross-Entropy Loss

Author: Miao Chunyan
Wang Di
Zhong Peixiang
Publication venue
Publication date: 01/01/2018
Field of study

Affect conveys important implicit information in human communication. Having the capability to correctly express affect during human-machine conversations is one of the major milestones in artificial intelligence. In recent years, extensive research on open-domain neural conversational models has been conducted. However, embedding affect into such models is still under explored. In this paper, we propose an end-to-end affect-rich open-domain neural conversational model that produces responses not only appropriate in syntax and semantics, but also with rich affect. Our model extends the Seq2Seq model and adopts VAD (Valence, Arousal and Dominance) affective notations to embed each word with affects. In addition, our model considers the effect of negators and intensifiers via a novel affective attention mechanism, which biases attention towards affect-rich words in input sentences. Lastly, we train our model with an affect-incorporated objective function to encourage the generation of affect-rich words in the output responses. Evaluations based on both perplexity and human evaluations show that our model outperforms the state-of-the-art baseline model of comparable size in producing natural and affect-rich responses.Comment: AAAI-1

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Association for the Advancement of Artificial Intelligence: AAAI Publications