Search CORE

2,227 research outputs found

Improving Search through A3C Reinforcement Learning based Conversational Agent

Author: EL Deci
G Shani
H Cuayhuitl
H Cuayáhuitl
J Wei
JS Bridle
RS Sutton
S Hochreiter
Publication venue
Publication date: 19/08/2018
Field of study

We develop a reinforcement learning based search assistant which can assist users through a set of actions and sequence of interactions to enable them realize their intent. Our approach caters to subjective search where the user is seeking digital assets such as images which is fundamentally different from the tasks which have objective and limited search modalities. Labeled conversational data is generally not available in such search tasks and training the agent through human interactions can be time consuming. We propose a stochastic virtual user which impersonates a real user and can be used to sample user behavior efficiently to train the agent which accelerates the bootstrapping of the agent. We develop A3C algorithm based context preserving architecture which enables the agent to provide contextual assistance to the user. We compare the A3C agent with Q-learning and evaluate its performance on average rewards and state values it obtains with the virtual user in validation episodes. Our experiments show that the agent learns to achieve higher rewards and better states.Comment: 17 pages, 7 figure

arXiv.org e-Print Archive

Crossref

A Frame Tracking Model for Memory-Enhanced Dialogue Systems

Author: Asri Layla El
Schulz Hannes
Sharma Shikhar
Zumer Jeremie
Publication venue
Publication date: 01/01/2017
Field of study

Recently, resources and tasks were proposed to go beyond state tracking in dialogue systems. An example is the frame tracking task, which requires recording multiple frames, one for each user goal set during the dialogue. This allows a user, for instance, to compare items corresponding to different goals. This paper proposes a model which takes as input the list of frames created so far during the dialogue, the current user utterance as well as the dialogue acts, slot types, and slot values associated with this utterance. The model then outputs the frame being referenced by each triple of dialogue act, slot type, and slot value. We show that on the recently published Frames dataset, this model significantly outperforms a previously proposed rule-based baseline. In addition, we propose an extensive analysis of the frame tracking task by dividing it into sub-tasks and assessing their difficulty with respect to our model

arXiv.org e-Print Archive

Crossref

Machine Learning Methods for Spoken Dialogue Simulation and Optimization

Author: Olivier Pietquin
Publication venue: 'IntechOpen'
Publication date: 01/01/2009
Field of study

Computers and electronic devices are becoming more and more present in our day-to-day life. This can of course be partly explained by their ability to ease the achievement of complex and boring tasks, the important decrease of prices or the new entertainment styles they offer. Yet, this real incursion in everybody's life would not have been possible without an important improvement of Human-Computer Interfaces (HCI). This is why HCI are now widely studied and become a major trend of research among the scientific community. Designing “user-friendly” interfaces usually requires multidisciplinary skills in fields such as computer science, ergonomics, psychology, signal processing etc. In this chapter, we argue that machine learning methods can help in designing efficient speech-based humancomputer interfaces

IntechOpen

HAL-CentraleSupelec

HAL-Rennes 1