30 research outputs found

    Frequency-Based Analysis for the Characterization of the Dysphonic Voices

    No full text
    International audienceSince several years, many studies have focused on the objective measurement-analysis for dysphonic voice assessment, proposed as an alternative to the perceptual evaluation that is extensively used by clinicians. In most cases, these studies describe classification systems, based on acoustic, physiological and/or aerodynamic parameters, in order to improve the performance and to help clinicians to make their decision. A few studies have been dedicated to the analysis of dysphonia effects on the speech signal

    SPEAKER DIARIZATION IN THE ELISA CONSORTIUM OVER THE LAST 4 YEARS

    No full text
    International audienceThis paper summarizes the collaboration of the LIA and CLIPS laboratories, members of the ELISA consortium, along the last 4 year NIST speaker diarization system evaluation campaigns. In this context, two individual approaches, quite different, have been developed individually by each lab, to respond to the specific task of speaker segmentation. The first one relies on a classical two-step speaker segmentation strategy, based on the detection of speaker turns followed by a clustering process, while the second one corresponds to an integrated strategy where both segment boundaries and speaker tying of the segments are extracted simultaneously and challenged during the whole process. From these two main methods, various strategies were investigated for the fusion of segmentation results. Through the performance achieved along the different evaluation campaigns as well as the experience gained by the LIA and CLIPS labs in the speaker diarization task, a discussion about the overall work done in this evaluation context is drawn in this paper, proposing further investigation and progression

    Analyse Fréquentielle pour la Caractérisation des Voix Dysphoniques

    No full text
    International audienceDans le cadre de l'évaluation des voix dysphoniques, de nombreuses études se sont concentrées sur l'analyse objective, proposée comme une alternative à l'évaluation perceptive. Dans la plupart des cas, ces études décrivent des systèmes de classification collectant des mesures acoustiques, physiologiques et/ou aérodynamiques afin d'améliorer les performances de classification de la voix et d'aider les cliniciens dans leur décision. Quelques études ont été consacrées à l'analyse des effets de la dysphonie sur le signal de parole

    Dysphonic Voices and the 0-3000Hz Frequency Band

    No full text
    International audienceConcerned with pathological voice assessment, this paper aims at characterizing dysphonia in the frequency domain for a better understanding of related phenomena while most of the studies have focused only on improving classification systems for diagnosis help purposes. Based on a first study which demonstrates that the low frequencies ([0-3000]Hz) are more relevant for dys-phonia discrimination compared with higher frequencies, the authors propose in this paper to pursue by analyzing the impact of the restricted frequency band ([0-3000]Hz) on the dysphonic voice discrimination from a phonetical and perceptual point of views. A discussion around the frequency band limitation of telephone channel is also proposed

    Are the unvoiced consonants relevant for dysphonia phenomenon observation?

    No full text
    International audienceConcerned with pathological voice assessment, this paper aims at characterizing dysphonia in the speech signal for a better understanding of related phenomena while most of the studies have focused only on improving classification systems for diagnosis help purposes. This work is focused on an automatic and manual phonetic analysis, which highlights the potential and rather unexpected relevance of unvoiced consonants in the automatic classification task of dysphonia severity grades (based on the GRBAS scale)

    Pertinence des consonnes sourdes pour l'observation des phénomènes liés à la dysphonie

    No full text
    International audienceDans le cadre de l'évaluation objective de la qualité de la voix pathologique, ce travail s'intéresse à la recherche d'informations pertinentes pour la caractérisation de la dysphonie dans le signal de parole. Il s'inscrit dans un projet plus large dont l'objectif principal est d'apporter une meilleure compréhension des phénomènes acoustiques liés au trouble vocal. En ce sens et partant de l'hypothèse que la dysphonie peut être appréhendée comme n'importe quelle information extra-linguistique, le Laboratoire d'Informatique d'Avignon (LIA) utilise depuis quelques années un système de Reconnaissance Automatique du Locuteur (RAL) [2] basé sur la modélisation par Modèle de Mélange de Gaussiennes (GMM)
    corecore