Search CORE

47 research outputs found

Evaluation of a hierarchical reinforcement learning spoken dialogue system

Author: Barto
Cheyer
Demsar
Dietterich
Grosz
Henderson
Heriberto Cuayáhuitl
Hiroshi Shimodaira
Kaelbling
Levin
Litman
Oliver Lemon
Paek
Russell
Singh
Steve Renals
Walker
Walker
Walker
Williams
Young
Young
Young
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

We describe an evaluation of spoken dialogue strategies designed using hierarchical reinforcement learning agents. The dialogue strategies were learnt in a simulated environment and tested in a laboratory setting with 32 users. These dialogues were used to evaluate three types of machine dialogue behaviour: hand-coded, fully-learnt and semi-learnt. These experiments also served to evaluate the realism of simulated dialogues using two proposed metrics contrasted with ‘Precision-Recall’. The learnt dialogue behaviours used the Semi-Markov Decision Process (SMDP) model, and we report the first evaluation of this model in a realistic conversational environment. Experimental results in the travel planning domain provide evidence to support the following claims: (a) hierarchical semi-learnt dialogue agents are a better alternative (with higher overall performance) than deterministic or fully-learnt behaviour; (b) spoken dialogue strategies learnt with highly coherent user behaviour and conservative recognition error rates (keyword error rate of 20%) can outperform a reasonable hand-coded strategy; and (c) hierarchical reinforcement learning dialogue agents are feasible and promising for the (semi) automatic design of optimized dialogue behaviours in larger-scale systems

University of Lincoln Institutional Repository

Heriot Watt Pure

Crossref

Edinburgh Research Archive

Adapting the use of attributes to the task environment in joint action: results and a model

Author: Bard Ellen
Guhe Markus
Publication venue
Publication date: 01/06/2008
Field of study

Edinburgh Research Explorer

Reinforcement Learning With Simulated User For Automatic Dialog Strategy Optimization

Author: Meunier Jean-Guy
Nguyen Minh-Quang
Nguyen Philip H.P.
Nguyen Tho-Hau
O’Shaughnessy Douglas
Publication venue
Publication date: 01/07/2007
Field of study

In this paper, we propose a solution to the problem of formulating strategies for a spoken dialog system. Our approach is based on reinforcement learning with the help of a simulated user in order to identify an optimal dialog strategy. Our method considers the Markov decision process to be a framework for representation of speech dialog in which the states represent history and discourse context, the actions are dialog acts and the transition strategies are decisions on actions to take between states. We present our reinforcement learning architecture with a novel objective function that is based on dialog quality rather than its duration

Archipel - Université du Québec à Montréal

FLoReS: A Forward Looking, Reward Seeking, Dialogue Manager

Author: A.A. Rizzo
G. Tavinor
J. Williams
K. Georgila
M. English
S. Larsson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Joint Proceedings of the Intelligent Virtual Agents 2012 Workshops:Santa Cruz, CA, September 15, 2012

Author: Böck Ronald
Edlund Jens
Traum David
Publication venue: Otto von Guericke University Magdeburg
Publication date: 01/09/2012
Field of study

University of Twente Research Information

Mining Mixed-Initiative Dialogs

Author: Perugini Saverio
Publication venue: eCommons
Publication date: 01/10/2016
Field of study

Human-computer dialogs are an important vehicle through which to produce a rich and compelling form of human-computer interaction. We view the specification of a human-computer dialog as a set of sequences of progressive interactions between a user and a computer system, and mine partially ordered sets, which correspond to mixing dialog initiative, embedded in these sets of sequences—a process we refer to as dialog mining—because partially ordered sets can be advantageously exploited to reduce the control complexity of a dialog implementation. Our mining losslessly compresses the specification of a dialog. We describe our mining algorithm and report the results of a simulation-oriented evaluation. Our algorithm is sound, and our results indicate that it can compress nearly all dialog specifications, and some to a high degree. This work is part of broader research on the specification and implementation of mixed-initiative dialogs

Crossref

University of Dayton