Skip to main content
Article thumbnail
Location of Repository

By Reinforcement Learning


ABSTRACT This paper describes a method for automatic design of human-computer dialogue strategies by means of reinforcement learning, using a dialogue simulation tool to model the user behaviour andsystem recognition performance. To the authors ' knowledge this is the first application of a detailed simulation tool to this problem.The simulation tool is trained on a corpus of real user data. Compared to direct state transition modelling, it has the major advantagethat different state space representations can be studied without collecting more training data.We applied Q-learning with eligibility traces to obtain policies for a telephone-based cinema information system, comparing theeffect of different state space representations and evaluation functions. The policies outperformed handcrafted policies that operatedin the same restricted state space, and gave performance similar to the original design that had been through several iterations of man-ual refinement

Year: 2009
OAI identifier: oai:CiteSeerX.psu:
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • (external link)
  • (external link)
  • Suggested articles

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.