User-Adaptive A Posteriori Restoration for Incorrectly Segmented Utterances in Spoken Dialogue Systems

Hotta, Naoki; Komatani, Kazunori; Nakano, Mikio; Sato, Satoshi

User-Adaptive A Posteriori Restoration for Incorrectly Segmented Utterances in Spoken Dialogue Systems

Authors: Naoki Hotta
Kazunori Komatani
Mikio Nakano
Satoshi Sato
Publication date: 15 December 2017
Publisher: University of Illinois at Chicago Library
Doi

Abstract

Ideally, the users of spoken dialogue systems should be able to speak at their own tempo. Thus, the systems needs to interpret utterances from various users correctly, even when the utterances contain pauses. In response to this issue, we propose an approach based on a posteriori restoration for incorrectly segmented utterances. A crucial part of this approach is to determine whether restoration is required. We use a classiﬁcation-based approach, adapted to each user. We focus on each user’s dialogue tempo, which can be obtained during the dialogue, and determine the correlation between each user’s tempo and the appropriate thresholds for classiﬁcation. A linear regression function used to convert the tempos into thresholds is also derived. Experimental results show that the proposed user adaptation approach applied to two restoration classiﬁcation methods, thresholding and decision trees, improves classiﬁcation accuracies by 3.0% and 7.4%, respectively, in cross validation

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

University of Illinois at Chicago: Journals@UIC

oai:journals.uic.edu:article/1...

Last time updated on 29/03/2023

Dialogue & Discourse (E-Journal - Universität Bielefeld)

oai:dad.uni-bielefeld.de:artic...

Last time updated on 17/10/2019