151 research outputs found

    Voice monitoring: technical and clinical aspects

    Get PDF

    Reliability of perceptions of voice quality: evidence from a problem asthma clinic population

    Get PDF
    <p>Introduction: Methods of perceptual voice evaluation have yet to achieve satisfactory consistency; complete acceptance of a recognised clinical protocol is still some way off.</p> <p>Materials and methods: Three speech and language therapists rated the voices of 43 patients attending the problem asthma clinic of a teaching hospital, according to the grade-roughness-breathiness-asthenicity-strain (GRBAS) scale and other perceptual categories.</p> <p>Results and analysis: Use of the GRBAS scale achieved only a 64.7 per cent inter-rater reliability and a 69.6 per cent intra-rater reliability for the grade component. One rater achieved a higher degree of consistency. Improved concordance on the GRBAS scale was observed for subjects with laryngeal abnormalities. Raters failed to reach any useful level of agreement in the other categories employed, except for perceived gender.</p> <p>Discussion: These results should sound a note of caution regarding routine adoption of the GRBAS scale for characterising voice quality for clinical purposes. The importance of training and the use of perceptual anchors for reliable perceptual rating need to be further investigated.</p&gt

    Spasmodic dysphonia, perceptual and acoustic analysis: presenting new diagnostic tools

    Get PDF
    In this article, we investigate whether (1) the IINFVo (Impression, Intelligibility, Noise, Fluency and Voicing) perceptual rating scale and (2) the AMPEX (Auditory Model Based Pitch Extractor) acoustical analysis are suitable for evaluating adductor spasmodic dysphonia (AdSD). Voice recordings of 12 patients were analysed. The inter-rater and intra-rater consistency showed highly significant correlations for the IINFVo rating scale, with the exception of the parameter Noise. AMPEX reliably analyses vowels (correlation between PUVF (percentage of frames with unreliable F0/voicing 0.748), running speech (correlation between PVF (percentage of voiced frames)/voicing 0.699) and syllables. Correlations between IINFVo and AMPEX range from 0.608 to 0.818, except for noise. This study indicates that IINFVo and AMPEX could be robust and complementary assessment tools for the evaluation of AdSD. Both the tools provide us with the valuable information about voice quality, stability of F0 (fundamental frequency) and specific dimensions controlling the transitions between voiced and unvoiced segments

    Tridimensional assessment of adductor spasmodic dysphonia pre- and post-treatment with Botulinum toxin

    Get PDF
    Spasmodic dysphonia voices form, in the same way as substitution voices, a particular category of dysphonia that seems not suited for a standardized basic multidimensional assessment protocol, like the one proposed by the European Laryngological Society. Thirty-three exhaustive analyses were performed on voices of 19 patients diagnosed with adductor spasmodic dysphonia (SD), before and after treatment with Botulinum toxin. The speech material consisted of 40 short sentences phonetically selected for constant voicing. Seven perceptual parameters (traditional and dedicated) were blindly rated by a panel of experienced clinicians. Nine acoustic measures (mainly based on voicing evidence and periodicity) were achieved by a special analysis program suited for strongly irregular signals and validated with synthesized deviant voices. Patients also filled in a VHI-questionnaire. Significant improvement is shown by all three approaches. The traditional GRB perceptual parameters appear to be adequate for these patients. Conversely, the special acoustic analysis program is successful in objectivating the improved regularity of vocal fold vibration: the basic jitter remains the most valuable parameter, when reliably quantified. The VHI is well suited for the voice-related quality of life. Nevertheless, when considering pre-therapy and post-therapy changes, the current study illustrates a complete lack of correlation between the perceptual, acoustic, and self-assessment dimensions. Assessment of SD-voices needs to be tridimensional

    Voicing quantification is more relevant than period perturbation in substitution voices: an advanced acoustical study

    Get PDF
    Quality of substitution voicing—i.e., phonation with a voice that is not generated by the vibration of two vocal folds—cannot be adequately evaluated with routinely used software for acoustic voice analysis that is aimed at ‘common’ dysphonias and nearly periodic voice signals. The AMPEX analysis program (Van Immerseel and Martens) has been shown previously to be able to detect periodicity in irregular signals with background noise, and to be suited for running speech. The validity of this analysis program is first tested using realistic synthesized voice signals with known levels of cycle-to-cycle perturbations and additive noise. Second, exhaustive acoustic analysis is performed of the voices of 116 patients surgically treated for advanced laryngeal cancer and recorded in seven European academic centers. All of them read out a short phonetically balanced passage. Patients were divided into six groups according to the oscillating structures they used to phonate. Results show that features related to quantification of voicing enable a distinction between the different groups, while the features reporting F0-instability fail to do so. Acoustic evaluation of voice quality in substitution voices thus best relies upon voicing quantification

    Assessment of vocal cord nodules: A case study in speech processing by using Hilbert-Huang Transform

    Get PDF
    Vocal cord nodules represent a pathological condition for which the growth of unnatural masses on vocal folds affects the patients. Among other effects, changes in the vocal cords' overall mass and stiffness alter their vibratory behaviour, thus changing the vocal emission generated by them. This causes dysphonia, i.e. abnormalities in the patients' voice, which can be analysed and inspected via audio signals. However, the evaluation of voice condition through speech processing is not a trivial task, as standard methods based on the Fourier Transform, fail to fit the non-stationary nature of vocal signals. In this study, four audio tracks, provided by a volunteer patient, whose vocal fold nodules have been surgically removed, were analysed using a relatively new technique: the Hilbert-Huang Transform (HHT) via Empirical Mode Decomposition (EMD); specifically, by using the CEEMDAN (Complete Ensemble EMD with Adaptive Noise) algorithm. This method has been applied here to speech signals, which were recorded before removal surgery and during convalescence, to investigate specific trends. Possibilities offered by the HHT are exposed, but also some limitations of decomposing the signals into so-called intrinsic mode functions (IMFs) are highlighted. The results of these preliminary studies are intended to be a basis for the development of new viable alternatives to the softwares currently used for the analysis and evaluation of pathological voice
    corecore