10 research outputs found

    Neuromechanical Modelling of Articulatory Movements from Surface Electromyography and Speech Formants

    Get PDF
    Speech articulation is produced by the movements of muscles in the larynx, pharynx, mouth and face. Therefore speech shows acoustic features as formants which are directly related with neuromotor actions of these muscles. The first two formants are strongly related with jaw and tongue muscular activity. Speech can be used as a simple and ubiquitous signal, easy to record and process, either locally or on e-Health platforms. This fact may open a wide set of applications in the study of functional grading and monitoring neurodegenerative diseases. A relevant question, in this sense, is how far speech correlates and neuromotor actions are related. This preliminary study is intended to find answers to this question by using surface electromyographic recordings on the masseter and the acoustic kinematics related with the first formant. It is shown in the study that relevant correlations can be found among the surface electromyographic activity (dynamic muscle behavior) and the positions and first derivatives of the first formant (kinematic variables related to vertical velocity and acceleration of the joint jaw and tongue biomechanical system). As an application example, it is shown that the probability density function associated to these kinematic variables is more sensitive than classical features as Vowel Space Area (VSA) or Formant Centralization Ratio (FCR) in characterizing neuromotor degeneration in Parkinson's Disease.This work is being funded by Grants TEC2016-77791-C4-4-R from the Ministry of Economic Affairs and Competitiveness of Spain, Teka-Park 55 02 CENIE-0348_CIE_6_E POCTEP (InterReg Programme) and 16-30805A, SIX Research Center (CZ.1.05/2.1.00/03.0072), and LO1401 from the Czech Republic Government

    Addenbrooke’s Cognitive Examination and Individual Domain Cut-Off Scores for Discriminating between Different Cognitive Subtypes of Parkinson’s Disease

    No full text
    Objective. The main aim of this study was to verify the sensitivity and specificity of Addenbrooke’s Cognitive Examination-Revised (ACE-R) in discriminating between Parkinson’s disease (PD) with normal cognition (PD-NC) and PD with mild cognitive impairment (PD-MCI) and between PD-MCI and PD with dementia (PD-D). We also evaluated how ACE-R correlates with neuropsychological cognitive tests in PD. Methods. We examined three age-matched groups of PD patients diagnosed according to the Movement Disorder Society Task Force criteria: PD-NC, PD-MCI, and PD-D. ROC analysis was used to establish specific cut-off scores of ACE-R and its domains. Correlation analyses were performed between ACE-R and its subtests with relevant neuropsychological tests. Results. Statistically significant differences between groups were demonstrated in global ACE-R scores and subscores, except in the language domain. ACE-R cut-off score of 88.5 points discriminated best between PD-MCI and PD-NC (sensitivity 0.68, specificity 0.91); ACE-R of 82.5 points distinguished best between PD-MCI and PD-D (sensitivity 0.70, specificity 0.73). The verbal fluency domain of ACE-R demonstrated the best discrimination between PD-NC and PD-MCI (cut-off score 11.5; sensitivity 0.70, specificity 0.73) while the orientation/attention subscore was best between PD-MCI and PD-D (cut-off score 15.5; sensitivity 0.90, specificity 0.97). ACE-R scores except for ACE-R language correlated with specific cognitive tests of interest

    Parkinson Disease Detection from Speech Articulation Neuromechanics

    Get PDF
    Aim: The research described is intended to give a description of articulation dynamics as a correlate of the kinematic behavior of the jaw-tongue biomechanical system, encoded as a probability distribution of an absolute joint velocity. This distribution may be used in detecting and grading speech from patients affected by neurodegenerative illnesses, as Parkinson Disease. Hypothesis: The work hypothesis is that the probability density function of the absolute joint velocity includes information on the stability of phonation when applied to sustained vowels, as well as on fluency if applied to connected speech. Methods: A dataset of sustained vowels recorded from Parkinson Disease patients is contrasted with similar recordings from normative subjects. The probability distribution of the absolute kinematic velocity of the jaw-tongue system is extracted from each utterance. A Random Least Squares Feed-Forward Network (RLSFN) has been used as a binary classifier working on the pathological and normative datasets in a leave-one-out strategy. Monte Carlo simulations have been conducted to estimate the influence of the stochastic nature of the classifier. Two datasets for each gender were tested (males and females) including 26 normative and 53 pathological subjects in the male set, and 25 normative and 38 pathological in the female set. Results: Male and female data subsets were tested in single runs, yielding equal error rates under 0.6% (Accuracy over 99.4%). Due to the stochastic nature of each experiment, Monte Carlo runs were conducted to test the reliability of the methodology. The average detection results after 200 Montecarlo runs of a 200 hyperplane hidden layer RLSFN are given in terms of Sensitivity (males: 0.9946, females: 0.9942), Specificity (males: 0.9944, females: 0.9941) and Accuracy (males: 0.9945, females: 0.9942). The area under the ROC curve is 0.9947 (males) and 0.9945 (females). The equal error rate is 0.0054 (males) and 0.0057 (females). Conclusions: The proposed methodology avails that the use of highly normalized descriptors as the probability distribution of kinematic variables of vowel articulation stability, which has some interesting properties in terms of information theory, boosts the potential of simple yet powerful classifiers in producing quite acceptable detection results in Parkinson Disease

    Vowel articulation dynamic stability related to Parkinson's disease rating features: male dataset

    Full text link
    Neurodegenerative pathologies as Parkinson's Disease (PD) show important distortions in speech, affecting fluency, prosody, articulation and phonation. Classically, measurements based on articulation gestures altering formant positions, as the Vocal Space Area (VSA) or the Formant Centralization Ratio (FCR) have been proposed to measure speech distortion, but these markers are based mainly on static positions of sustained vowels. The present study introduces a measurement based on the mutual information distance among probability density functions of kinematic correlates derived from formant dynamics. An absolute kinematic velocity associated to the position of the jaw and tongue articulation gestures is estimated and modeled statistically. The distribution of this feature may differentiate PD patients from normative speakers during sustained vowel emission. The study is based on a limited database of 53 male PD patients, contrasted to a very selected and stable set of eight normative speakers. In this sense, distances based on Kullback-Leibler divergence seem to be sensitive to PD articulation instability. Correlation studies show statistically relevant relationship between information contents based on articulation instability to certain motor and nonmotor clinical scores, such as freezing of gait, or sleep disorders. Remarkably, one of the statistically relevant correlations point out to the time interval passed since the first diagnostic. These results stress the need of defining scoring scales specifically designed for speech disability estimation and monitoring methodologies in degenerative diseases of neuromotor origin

    Robust and Complex Approach of Pathological Speech Signal Analysis

    No full text
    This article presents a~study of the approaches in the state-of-the-art in the field of pathological speech signal analysis with a~special focus on parametrization techniques. It provides a~description of 92 speech features where some of them are already widely used in this field of science and some of them have not been tried yet (they come from different areas of speech signal processing like speech recognition or coding). As an original contribution, this work introduces 36 completely new pathological voice measures based on modulation spectra, inferior colliculus coefficients, bicepstrum, sample and approximate entropy and empirical mode decomposition. The significance of these features was tested on 3 (English, Spanish and Czech) pathological voice databases with respect to classification accuracy, sensitivity and specificity. To our best knowledge the introduced approach based on complex feature extraction and robust testing outperformed all works that have been published already in this field. The results (accuracy, sensitivity and specificity equal to 100.0±0.0%100.0\pm0.0\,\%) are discussable in the case of Massachusetts Eye and Ear Infirmary (MEEI) database because of its limitation related to a~length of sustained vowels, however in the case of Pr{\'i}ncipe de Asturias (PdA) Hospital in Alcal{\'a} de Henares of Madrid database we made improvements in classification accuracy (82.1±3.3%82.1\pm3.3\,\%) and specificity (83.8±5.1%83.8\pm5.1\,\%) when considering a~single-classifier approach. Hopefully, large improvements may be achieved in the case of Czech Parkinsonian Speech Database (PARCZ), which are discussed in this work as well. All the features introduced in this work were identified by Mann-Whitney~U test as significant (p < 0.05) when processing at least one of the mentioned databases. The largest discriminative power from these proposed features has a~cepstral peak prominence extracted from the first intrinsic mode function (p=6.94431032p = 6.9443\cdot10^{-32}) which means, that among all newly designed features those that quantify especially hoarseness or breathiness are good candidates for pathological speech identification. The article also mentions some ideas for the future work in the field of pathological speech signal analysis that can be valuable especially under the clinical point of view

    Neuromechanical modelling of articulatory movements from surface electromyography and speech formants

    Full text link
    Speech articulation is produced by the movements of muscles in the larynx, pharynx, mouth and face. Therefore speech shows acoustic features as formants which are directly related with neuromotor actions of these muscles. The first two formants are strongly related with jaw and tongue muscular activity. Speech can be used as a simple and ubiquitous signal, easy to record and process, either locally or on e-Health platforms. This fact may open a wide set of applications in the study of functional grading and monitoring neurodegenerative diseases. A relevant question, in this sense, is how far speech correlates and neuromotor actions are related. This preliminary study is intended to find answers to this question by using surface electromyographic recordings on the masseter and the acoustic kinematics related with the first formant. It is shown in the study that relevant correlations can be found among the surface electromyographic activity (dynamic muscle behavior) and the positions and first derivatives of the first formant (kinematic variables related to vertical velocity and acceleration of the joint jaw and tongue biomechanical system). As an application example, it is shown that the probability density function associated to these kinematic variables is more sensitive than classical features as Vowel Space Area (VSA) or Formant Centralization Ratio (FCR) in characterizing neuromotor degeneration in Parkinson’s Disease

    Assessing Progress of Parkinson's Disease Using Acoustic Analysis of Phonation

    No full text
    This paper deals with a~complex acoustic analysis of phonation in patients with Parkinson's disease (PD) with a~special focus on estimation of disease progress that is described by 7 different clinical scales (e.\,g. Unified Parkinson's disease rating scale or Beck depression inventory). The analysis is based on parametrization of 5 Czech vowels pronounced by 84 PD patients. Using classification and regression trees we estimated all clinical scores with maximal error lower or equal to 13\,\%. Best estimation was observed in the case of Mini-mental state examination (MAE = 0.77, estimation error 5.50\,\%). Finally, we proposed a~binary classification based on random forests that is able to identify Parkinson's disease with sensitivity SEN = 92.86\,\% (SPE = 85.71\,\%). The parametrization process was based on extraction of 107 speech features quantifying different clinical signs of hypokinetic dysarthria present in PD

    Changes in phonation and their relations with Progress of Parkinson's disease

    Full text link
    Hypokinetic dysarthria, which is associated with Parkinson’s disease (PD), affects several speech dimensions, including phonation. Although the scientific community has dealt with a quantitative analysis of phonation in PD patients, a complex research revealing probable relations between phonatory features and progress of PD is missing. Therefore, the aim of this study is to explore these relations and model them mathematically to be able to estimate progress of PD during a two-year follow-up. We enrolled 51 PD patients who were assessed by three commonly used clinical scales. In addition, we quantified eight possible phonatory disorders in five vowels. To identify the relationship between baseline phonatory features and changes in clinical scores, we performed a partial correlation analysis. Finally, we trained XGBoost models to predict the changes in clinical scores during a two-year follow-up. For two years, the patients’ voices became more aperiodic with increased microperturbations of frequency and amplitude. Next, the XGBoost models were able to predict changes in clinical scores with an error in range 11–26%. Although we identified some significant correlations between changes in phonatory features and clinical scores, they are less interpretable. This study suggests that it is possible to predict the progress of PD based on the acoustic analysis of phonation. Moreover, it recommends utilizing the sustained vowel /i/ instead of /a/
    corecore