2 research outputs found

    Emulating the perceptual capabilities of a human evaluator to map the GRB scale for the assessment of voice disorders

    No full text
    This paper presents the design of an automatic voice quality analysis system for the assessment of voice pathologies, which emulates the perceptual capabilities of a human evaluator according the GRB scale. For this purpose, a novel methodology based on multiple sets of characteristics, ordinal classification and Gaussian regression is proposed. In particular, a reduced subset of characteristics is identified, and the regressor is used to convert the discrete perceptual scale to a continuum, more in agreement to the nature of the problem under study. The robustness of the system is evaluated in several cross-dataset experiments. Similarly, a clinical evaluation of the predictions provided by the system is carried out. Results indicate that the proposed methodology is proficient in modelling the perceptual capabilities of the human evaluator. They also show that it is possible to extend the GRB scale to a continuum through regression techniques while maintaining the consistency of the results. On average, the deviation between the labels assessed by the expert and the ones provided by the system is of about 0.5 units (in a scale from 0 to 3) for G and B, and of 0.7 units for R. Similarly, the deviation of the labels predicted by the system in the clinical assessment trials is about 0.3 units for G, 0.4 units for B, and 0.5 units for

    Emulating the perceptual capabilities of a human evaluator to map the GRB scale for the assessment of voice disorders

    Full text link
    This paper presents the design of an automatic voice quality analysis system for the assessment of voice pathologies, which emulates the perceptual capabilities of a human evaluator according the GRB scale. For this purpose, a novel methodology based on multiple sets of characteristics, ordinal classification and Gaussian regression is proposed. In particular, a reduced subset of characteristics is identified, and the regressor is used to convert the discrete perceptual scale to a continuum, more in agreement to the nature of the problem under study. The robustness of the system is evaluated in several cross-dataset experiments. Similarly, a clinical evaluation of the predictions provided by the system is carried out. Results indicate that the proposed methodology is proficient in modelling the perceptual capabilities of the human evaluator. They also show that it is possible to extend the GRB scale to a continuum through regression techniques while maintaining the consistency of the results. On average, the deviation between the labels assessed by the expert and the ones provided by the system is of about 0.5 units (in a scale from 0 to 3) for G and B, and of 0.7 units for R. Similarly, the deviation of the labels predicted by the system in the clinical assessment trials is about 0.3 units for G, 0.4 units for B, and 0.5 units for
    corecore