Automatic classification possibilities of the voices of children with dysphonia

Tulics, Miklós Gábriel; Vicsi, Klára

Automatic classification possibilities of the voices of children with dysphonia

Authors: Miklós Gábriel Tulics
Klára Vicsi
Publication date: 1 January 2018
Publisher: 'Infocommunications Journal'
Doi

Abstract

Dysphonia is a common complaint, almost every fourth child produces a pathological voice. A mobile based filtering system, that can be used by pre-school workers in order to recognize dysphonic voiced children in order to get professional help as soon as possible, would be desired. The goal of this research is to identify acoustic parameters that are able to distinguish healthy voices of children from those with dysphonia voices of children. In addition, the possibility of automatic classification is children. In addition, the possibility of automatic classification is examined. Two sample T-tests were used for statistical significance testing for the mean values of the acoustic parameters between healthy voices and those with dysphonia. A two-class classification was performed between the two groups using leave-one-out cross validation, with support vector machine (SVM) classifier. Formant frequencies, mel-frequency cepstral coefficients (MFCCs), Harmonics-to-Noise Ratio (HNR), Soft Phonation Index (SPI) and frequency band energy ratios, based on intrinsic mode functions measured on different variations of phonemes showed statistical difference between the groups. A high classification accuracy of 93% was achieved by SVM with linear and rbf kernel using only 8 acoustic parameters. Additional data is needed to build a more general model, but this research can be a reference point in the classification of voices using continuous speech between healthy children and children with dysphonia

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Repository of the Academy's Library

oai:real.mtak.hu:119611

Last time updated on 08/04/2021