Hello, It’s GPT-2 - How can I help you? Towards the use of pretrained language models for task-oriented dialogue systems

Budzianowski, P; Vulić, I

Hello, It’s GPT-2 - How can I help you? Towards the use of pretrained language models for task-oriented dialogue systems

Authors: P Budzianowski
I Vulić
Publication date: 1 January 2019
Publisher: EMNLP-IJCNLP 2019 - Proceedings of the 3rd Workshop on Neural Generation and Translation
Doi

Abstract

Data scarcity is a long-standing and crucial challenge that hinders quick development of task-oriented dialogue systems across multiple domains: task-oriented dialogue models are expected to learn grammar, syntax, dialogue reasoning, decision making, and language generation from absurdly small amounts of task-specific data. In this paper, we demonstrate that recent progress in language modeling pre-training and transfer learning shows promise to overcome this problem. We propose a task-oriented dialogue model that operates solely on text input: it effectively bypasses explicit policy and language generation modules. Building on top of the TransferTransfo framework and generative model pre-training, we validate the approach on complex multi-domain task-oriented dialogues from the MultiWOZ dataset. Our automatic and human evaluations show that the proposed model is on par with a strong task-specific neural baseline. In the long run, our approach holds promise to mitigate the data scarcity problem, and to support the construction of more engaging and more eloquent task-oriented conversational agents

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

Apollo (Cambridge)

oai:www.repository.cam.ac.uk:1...

Last time updated on 24/12/2019

Crossref

Last time updated on 10/08/2021