Search CORE

32 research outputs found

Towards Zero-Shot Frame Semantic Parsing for Domain Scaling

Author: Bapna Ankur
Hakkani-Tur Dilek
Heck Larry
Tur Gokhan
Publication venue
Publication date: 07/07/2017
Field of study

State-of-the-art slot filling models for goal-oriented human/machine conversational language understanding systems rely on deep learning methods. While multi-task training of such models alleviates the need for large in-domain annotated datasets, bootstrapping a semantic parsing model for a new domain using only the semantic frame, such as the back-end API or knowledge graph schema, is still one of the holy grail tasks of language understanding for dialogue systems. This paper proposes a deep learning based approach that can utilize only the slot description in context without the need for any labeled or unlabeled in-domain examples, to quickly bootstrap a new domain. The main idea of this paper is to leverage the encoding of the slot names and descriptions within a multi-task deep learned slot filling model, to implicitly align slots across domains. The proposed approach is promising for solving the domain scaling problem and eliminating the need for any manually annotated data or explicit schema alignment. Furthermore, our experiments on multiple domains show that this approach results in significantly better slot-filling performance when compared to using only in-domain data, especially in the low data regime.Comment: 4 pages + 1 reference

arXiv.org e-Print Archive

Crossref

Sequential Dialogue Context Modeling for Spoken Language Understanding

Author: Bapna Ankur
Hakkani-Tur Dilek
Heck Larry
Tur Gokhan
Publication venue
Publication date: 01/01/2017
Field of study

Spoken Language Understanding (SLU) is a key component of goal oriented dialogue systems that would parse user utterances into semantic frame representations. Traditionally SLU does not utilize the dialogue history beyond the previous system turn and contextual ambiguities are resolved by the downstream components. In this paper, we explore novel approaches for modeling dialogue context in a recurrent neural network (RNN) based language understanding system. We propose the Sequential Dialogue Encoder Network, that allows encoding context from the dialogue history in chronological order. We compare the performance of our proposed architecture with two context models, one that uses just the previous turn context and another that encodes dialogue context in a memory network, but loses the order of utterances in the dialogue history. Experiments with a multi-domain dialogue dataset demonstrate that the proposed architecture results in reduced semantic frame error rates.Comment: 8 + 2 pages, Updated 10/17: Updated typos in abstract, Updated 07/07: Updated Title, abstract and few minor change

arXiv.org e-Print Archive

Crossref

A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Author: Auli Michael
Brockett Chris
Dolan Bill
Galley Michel
Gao Jianfeng
Ji Yangfeng
Mitchell Margaret
Nie Jian-Yun
Sordoni Alessandro
Publication venue
Publication date: 01/01/2015
Field of study

We present a novel response generation system that can be trained end to end on large quantities of unstructured Twitter conversations. A neural network architecture is used to address sparsity issues that arise when integrating contextual information into classic statistical models, allowing the system to take into account previous dialog utterances. Our dynamic-context generative models show consistent gains over both context-sensitive and non-context-sensitive Machine Translation and Information Retrieval baselines.Comment: A. Sordoni, M. Galley, M. Auli, C. Brockett, Y. Ji, M. Mitchell, J.-Y. Nie, J. Gao, B. Dolan. 2015. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses. In Proc. of NAACL-HLT. Pages 196-20

arXiv.org e-Print Archive

Crossref

Recommended from our members

Towards end-to-end multi-domain dialogue modelling

Author: Budzianowski PF
Casanueva Inigo
Gasic M
Tseng B-H
Publication venue
Publication date: 01/10/2018
Field of study

This work was funded by a Google Faculty Re- search Award (RG91111), an EPSRC studentship (RG80792), an EPSRC grant (EP/M018946/1) and by Toshiba Research Europe Ltd, Cam- bridge Research Laboratory (RG85875

Apollo (Cambridge)

Recommended from our members

A WOz Variant with Contrastive Conditions

Author: Levin Esther
Passonneau Rebecca
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2006
Field of study

We present a variant of the WOz paradigm we refer to as incremental ablation. The new feature involves incrementally restricting the human wizard’s capacities in the direction of a dialog system. We lay out a data collection design with six conditions of user-system and user-wizard interactions that allows us to more precisely identify how to close the communication gap between humans and systems. We describe the application of the method to analysis of contexts in which ASR errors occur, giving us a means to investigate the problem solving strategies humans would resort to if their communication channel were restricted to be more like the machine’s. We describe how we can use the methodology to collect data that is more relevant to a particular learning paradigm involving Markov Decision Processes (MDP)

Columbia University Academic Commons