Search CORE

27,800 research outputs found

Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset

Author: Gupta Raghav
Khaitan Pranav
Rastogi Abhinav
Sunkara Srinivas
Zang Xiaoxue
Publication venue
Publication date: 29/01/2020
Field of study

Virtual assistants such as Google Assistant, Alexa and Siri provide a conversational interface to a large number of services and APIs spanning multiple domains. Such systems need to support an ever-increasing number of services with possibly overlapping functionality. Furthermore, some of these services have little to no training data available. Existing public datasets for task-oriented dialogue do not sufficiently capture these challenges since they cover few domains and assume a single static ontology per domain. In this work, we introduce the the Schema-Guided Dialogue (SGD) dataset, containing over 16k multi-domain conversations spanning 16 domains. Our dataset exceeds the existing task-oriented dialogue corpora in scale, while also highlighting the challenges associated with building large-scale virtual assistants. It provides a challenging testbed for a number of tasks including language understanding, slot filling, dialogue state tracking and response generation. Along the same lines, we present a schema-guided paradigm for task-oriented dialogue, in which predictions are made over a dynamic set of intents and slots, provided as input, using their natural language descriptions. This allows a single dialogue system to easily support a large number of services and facilitates simple integration of new services without requiring additional training data. Building upon the proposed paradigm, we release a model for dialogue state tracking capable of zero-shot generalization to new APIs, while remaining competitive in the regular setting.Comment: To appear at AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Maps, agents and dialogue for exploring a virtual world

Author: Dijk E.M.A.G. van
Nijholt A.
Zwiers J.
Publication venue: International Institute of Informatics and Systemics (IIIS)
Publication date: 01/01/2001
Field of study

In previous years we have been involved in several projects in which users (or visitors) had to find their way in information-rich virtual environments. 'Information-rich' means that the users do not know beforehand what is available in the environment, where to go in the environment to find the information and, moreover, users or visitors do not necessarily know exactly what they are looking for. Information-rich means also that the information may change during time. A second visit to the same environment will require different behavior of the visitor in order for him or her to obtain similar information than was available during a previous visit. In this paper we report about two projects and discuss our attempts to generalize from the different approaches and application domains to obtain a library of methods and tools to design and implement intelligent agents that inhabit virtual environments and where the agents support the navigation of the user/visitor

CiteSeerX

University of Twente Research Information

Reference Resolution in Multi-modal Interaction: Position paper

Author: Nijholt A.
Publication venue: EU IST RTD Roadmap
Publication date: 01/01/2002
Field of study

In this position paper we present our research on multimodal interaction in and with virtual environments. The aim of this presentation is to emphasize the necessity to spend more research on reference resolution in multimodal contexts. In multi-modal interaction the human conversational partner can apply more than one modality in conveying his or her message to the environment in which a computer detects and interprets signals from different modalities. We show some naturally arising problems and how they are treated for different contexts. No generally applicable solutions are given

University of Twente Research Information

Reference resolution in multi-modal interaction: Preliminary observations

Author: Nijholt A.
Publication venue: Universidad de Pinar del Rio "Hermanos Saiz Montes de Oca"
Publication date: 01/01/2002
Field of study

In this paper we present our research on multimodal interaction in and with virtual environments. The aim of this presentation is to emphasize the necessity to spend more research on reference resolution in multimodal contexts. In multi-modal interaction the human conversational partner can apply more than one modality in conveying his or her message to the environment in which a computer detects and interprets signals from different modalities. We show some naturally arising problems but do not give general solutions. Rather we decide to perform more detailed research on reference resolution in uni-modal contexts to obtain methods generalizable to multi-modal contexts. Since we try to build applications for a Dutch audience and since hardly any research has been done on reference resolution for Dutch, we give results on the resolution of anaphoric and deictic references in Dutch texts. We hope to be able to extend these results to our multimodal contexts later

University of Twente Research Information

Evorus: A Crowd-powered Conversational Assistant Built to Automate Itself Over Time

Author: Banchs Rafael E
Carpenter Rollo
Constine Josh
Gasic Milica
Han Bo
Hempel Jessi
Huang Ting-Hao K.
Kamar Ece
Kamar Ece
Kenneth Huang Ting-Hao
Kenneth Huang Ting-Hao
Li Jiwei
Li Xuijun
Lynley Matthew
Newton Casey
Pennington Jeffrey
Sadun Erica
Sarma Akash Das
Ting-Hao
Wen Tsung-Hsien
Wikipedia Cleverbot
Wikipedia Tay
Witten Ian H
Zhao Tiancheng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/01/2018
Field of study

Crowd-powered conversational assistants have been shown to be more robust than automated systems, but do so at the cost of higher response latency and monetary costs. A promising direction is to combine the two approaches for high quality, low latency, and low cost solutions. In this paper, we introduce Evorus, a crowd-powered conversational assistant built to automate itself over time by (i) allowing new chatbots to be easily integrated to automate more scenarios, (ii) reusing prior crowd answers, and (iii) learning to automatically approve response candidates. Our 5-month-long deployment with 80 participants and 281 conversations shows that Evorus can automate itself without compromising conversation quality. Crowd-AI architectures have long been proposed as a way to reduce cost and latency for crowd-powered systems; Evorus demonstrates how automation can be introduced successfully in a deployed system. Its architecture allows future researchers to make further innovation on the underlying automated components in the context of a deployed open domain dialog system.Comment: 10 pages. To appear in the Proceedings of the Conference on Human Factors in Computing Systems 2018 (CHI'18

arXiv.org e-Print Archive

Crossref

An End-to-End Conversational Style Matching Agent

Author: Bartneck Christoph
Bickmore Timothy
DeVault David
Elofson Greg
Gratch Jonathan
Hirschberg Julia
Pecune Florian
S
Tannen Deborah
Thomas Paul
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 13/08/2019
Field of study

We present an end-to-end voice-based conversational agent that is able to engage in naturalistic multi-turn dialogue and align with the interlocutor's conversational style. The system uses a series of deep neural network components for speech recognition, dialogue generation, prosodic analysis and speech synthesis to generate language and prosodic expression with qualities that match those of the user. We conducted a user study (N=30) in which participants talked with the agent for 15 to 20 minutes, resulting in over 8 hours of natural interaction data. Users with high consideration conversational styles reported the agent to be more trustworthy when it matched their conversational style. Whereas, users with high involvement conversational styles were indifferent. Finally, we provide design guidelines for multi-turn dialogue interactions using conversational style adaptation

arXiv.org e-Print Archive

Crossref

Improving Search through A3C Reinforcement Learning based Conversational Agent

Author: EL Deci
G Shani
H Cuayhuitl
H Cuayáhuitl
J Wei
JS Bridle
RS Sutton
S Hochreiter
Publication venue
Publication date: 19/08/2018
Field of study

We develop a reinforcement learning based search assistant which can assist users through a set of actions and sequence of interactions to enable them realize their intent. Our approach caters to subjective search where the user is seeking digital assets such as images which is fundamentally different from the tasks which have objective and limited search modalities. Labeled conversational data is generally not available in such search tasks and training the agent through human interactions can be time consuming. We propose a stochastic virtual user which impersonates a real user and can be used to sample user behavior efficiently to train the agent which accelerates the bootstrapping of the agent. We develop A3C algorithm based context preserving architecture which enables the agent to provide contextual assistance to the user. We compare the A3C agent with Q-learning and evaluate its performance on average rewards and state values it obtains with the virtual user in validation episodes. Our experiments show that the agent learns to achieve higher rewards and better states.Comment: 17 pages, 7 figure

arXiv.org e-Print Archive

Crossref

FANTASIA: a framework for advanced natural tools and applications in social, interactive approaches

Author: Cosi Piero
Cutugno Francesco
Origlia Antonio
Rodà Antonio
Zmarich Claudio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Archivio della ricerca - Università degli studi di Napoli Federico II