Automatic Severity Assessment of Dysarthric speech by using
  Self-supervised Model with Multi-task Learning

Choi, Kwanghee; Chung, Minhwa; Kim, Sunhee; Yeo, Eun Jung

Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Authors: Kwanghee Choi
Minhwa Chung
Sunhee Kim
Eun Jung Yeo
Publication date: 27 October 2022
Publisher

Abstract

Automatic assessment of dysarthric speech is essential for sustained treatments and rehabilitation. However, obtaining atypical speech is challenging, often leading to data scarcity issues. To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning. Wav2vec 2.0 XLS-R is jointly trained for two different tasks: severity level classification and an auxilary automatic speech recognition (ASR). For the baseline experiments, we employ hand-crafted features such as eGeMaps and linguistic features, and SVM, MLP, and XGBoost classifiers. Explored on the Korean dysarthric speech QoLT database, our model outperforms the traditional baseline methods, with a relative percentage increase of 4.79% for classification accuracy. In addition, the proposed model surpasses the model trained without ASR head, achieving 10.09% relative percentage improvements. Furthermore, we present how multi-task learning affects the severity classification performance by analyzing the latent representations and regularization effect

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2210.15387

Last time updated on 06/12/2022