Search CORE

3,653 research outputs found

Improving automatic detection of obstructive sleep apnea through nonlinear analysis of sustained speech

Author: Blanco Murillo José Luis
Fernández Pozo Rubén
Hernández Gómez Luis Alfonso
Ramos Daniel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We present a novel approach for the detection of severe obstructive sleep apnea (OSA) based on patients' voices introducing nonlinear measures to describe sustained speech dynamics. Nonlinear features were combined with state-of-the-art speech recognition systems using statistical modeling techniques (Gaussian mixture models, GMMs) over cepstral parameterization (MFCC) for both continuous and sustained speech. Tests were performed on a database including speech records from both severe OSA and control speakers. A 10 % relative reduction in classification error was obtained for sustained speech when combining MFCC-GMM and nonlinear features, and 33 % when fusing nonlinear features with both sustained and continuous MFCC-GMM. Accuracy reached 88.5 % allowing the system to be used in OSA early detection. Tests showed that nonlinear features and MFCCs are lightly correlated on sustained speech, but uncorrelated on continuous speech. Results also suggest the existence of nonlinear effects in OSA patients' voices, which should be found in continuous speech

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Biblos-e Archivo

Introducing non-linear analysis into sustained speech characterization to improve sleep apnea detection

Author: A. Tsanas
A.W. Fox
G. Coccagna
J.A. Fiz
J.I. Godino-Llorente
M.P. Robb
P. Lloberes
R. Fernandez
T. Penzel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-25020-0_28Proceedings of 5th International Conference on Nonlinear Speech Processing, NOLISP 2011, Las Palmas de Gran Canaria (Spain)We present a novel approach for detecting severe obstructive sleep apnea (OSA) cases by introducing non-linear analysis into sustained speech characterization. The proposed scheme was designed for providing additional information into our baseline system, built on top of state-of-the-art cepstral domain modeling techniques, aiming to improve accuracy rates. This new information is lightly correlated with our previous MFCC modeling of sustained speech and uncorrelated with the information in our continuous speech modeling scheme. Tests have been performed to evaluate the improvement for our detection task, based on sustained speech as well as combined with a continuous speech classifier, resulting in a 10% relative reduction in classification for the first and a 33% relative reduction for the fused scheme. Results encourage us to consider the existence of non-linear effects on OSA patients’ voices, and to think about tools which could be used to improve short-time analysis.The activities described in this paper were funded by the Spanish Ministry of Science and Innovation as part of the TEC2009-14719-C02-02 (PriorSpeech) project

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblos-e Archivo

Improving Quality of Life: Home Care for Chronically Ill and Elderly People

Author: Celani Natalia López
Ponce Sergio
Quintero Olga Lucía
Vargas-Bonilla Francisco
Publication venue: 'IntechOpen'
Publication date: 20/12/2017
Field of study

In this chapter, we propose a system especially created for elderly or chronically ill people that are with special needs and poor familiarity with technology. The system combines home monitoring of physiological and emotional states through a set of wearable sensors, user-controlled (automated) home devices, and a central control for integration of the data, in order to provide a safe and friendly environment according to the limited capabilities of the users. The main objective is to create the easy, low-cost automation of a room or house to provide a friendly environment that enhances the psychological condition of immobilized users. In addition, the complete interaction of the components provides an overview of the physical and emotional state of the user, building a behavior pattern that can be supervised by the care giving staff. This approach allows the integration of physiological signals with the patient’s environmental and social context to obtain a complete framework of the emotional states

IntechOpen

Detección automática de voz hipernasal de niños con labio y paladar hendido a partir de vocales y palabras del español usando medidas clásicas y análisis no lineal

Author: Abarbanel
Golding
Hurst
Jolliffe
Kantz
Kummer
Maier
Michaelis
Orozco
Oseledec
Scholköpf
Publication venue: 'Universidad de Antioquia'
Publication date: 01/01/2016
Field of study

RESUMEN: Este artículo presenta un sistema para la detección automática de señales de voz hipernasales basado en la combinación de dos diferentes esquemas de caracterización aplicados en las cinco vocales del español y dos palabras seleccionadas. El primer esquema está basado en características clásicas como perturbaciones del periodo fundamental, medidas de ruido y coeficientes cepstrales en la frecuencia de Mel. El segundo enfoque está basado en medidas de dinámica no lineal. Las características más relevantes son seleccionadas usando dos técnicas: análisis de componentes principales y selección flotante hacia adelante secuencial. La decisión acerca de si un registro de voz es hipernasal o sano es tomada usando una máquina de soporte vectorial de margen suave. Los experimentos consideran grabaciones de las cinco vocales del idioma español y las palabras y se consideran, asimismo, tres conjuntos de características: (1) el enfoque clásico, (2) el análisis de dinámica no lineal y (3) la combinación de ambos esquemas. En general, los aciertos son mayores y más estables cuando las características clásicas y no lineales son combinadas, indicando que el análisis de dinámica no lineal se complementa con el esquema clásico.ABSTRACT: This paper presents a system for the automatic detection of hypernasal speech signals based on the combination of two different characterization approaches applied to the five spanish vowels and two selected words. The first approach is based on classical features such as pitch period perturbations, noise measures, and Mel-Frequency Cepstral Coefficients (MFCC). The second approach is based on the Non-Linear Dynamics (NLD) analysis. The most relevant features are selected and sorted using two techniques: Principal Components Analysis (PCA) and Sequential Forward Floating Selection (SFFS). The decision about whether a voice record is hypernasal or healthy is taken using a Soft Margin - Support Vector Machine (SM-SVM). Experiments upon recordings of the five Spanish vowels and the words are performed considering three different set of features: (1) the classical approach, (2) the NLD analysis, and (3) the combination of the classical and NLD measures. In general, the accuracies are higher and more stable when the classical and NLD features are combined, indicating that the NLD analysis is complementary to the classical approach

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

Biblioteca Digital del Sistema de Bibliotecas de la Universidad de Antioquia

Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics

Author: Alonso Jesús B.
Díaz de María Fernando
Ferrer Miguel A.
Godino-Llorente Juan I.
Henríquez Patricia
Travieso Carlos M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

In this paper, we propose to quantify the quality of the recorded voice through objective nonlinear measures. Quantification of speech signal quality has been traditionally carried out with linear techniques since the classical model of voice production is a linear approximation. Nevertheless, nonlinear behaviors in the voice production process have been shown. This paper studies the usefulness of six nonlinear chaotic measures based on nonlinear dynamics theory in the discrimination between two levels of voice quality: healthy and pathological. The studied measures are first- and second-order Renyi entropies, the correlation entropy and the correlation dimension. These measures were obtained from the speech signal in the phase-space domain. The values of the first minimum of mutual information function and Shannon entropy were also studied. Two databases were used to assess the usefulness of the measures: a multiquality database composed of four levels of voice quality (healthy voice and three levels of pathological voice); and a commercial database (MEEI Voice Disorders) composed of two levels of voice quality (healthy and pathological voices). A classifier based on standard neural networks was implemented in order to evaluate the measures proposed. Global success rates of 82.47% (multiquality database) and 99.69% (commercial database) were obtained.Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Analysis and Detection of Pathological Voice using Glottal Source Features

Author: Alku Paavo
Kadiri Sudarsana Reddy
Publication venue
Publication date: 25/09/2023
Field of study

Automatic detection of voice pathology enables objective assessment and earlier intervention for the diagnosis. This study provides a systematic analysis of glottal source features and investigates their effectiveness in voice pathology detection. Glottal source features are extracted using glottal flows estimated with the quasi-closed phase (QCP) glottal inverse filtering method, using approximate glottal source signals computed with the zero frequency filtering (ZFF) method, and using acoustic voice signals directly. In addition, we propose to derive mel-frequency cepstral coefficients (MFCCs) from the glottal source waveforms computed by QCP and ZFF to effectively capture the variations in glottal source spectra of pathological voice. Experiments were carried out using two databases, the Hospital Universitario Principe de Asturias (HUPA) database and the Saarbrucken Voice Disorders (SVD) database. Analysis of features revealed that the glottal source contains information that discriminates normal and pathological voice. Pathology detection experiments were carried out using support vector machine (SVM). From the detection experiments it was observed that the performance achieved with the studied glottal source features is comparable or better than that of conventional MFCCs and perceptual linear prediction (PLP) features. The best detection performance was achieved when the glottal source features were combined with the conventional MFCCs and PLP features, which indicates the complementary nature of the features

arXiv.org e-Print Archive

Models and Analysis of Vocal Emissions for Biomedical Applications

Author
Publication venue: 'Firenze University Press'
Publication date: 31/05/2022
Field of study

The MAVEBA Workshop proceedings, held on a biannual basis, collect the scientific papers presented both as oral and poster contributions, during the conference. The main subjects are: development of theoretical and mechanical models as an aid to the study of main phonatory dysfunctions, as well as the biomedical engineering methods for the analysis of voice signals and images, as a support to clinical diagnosis and classification of vocal pathologies

Directory of Open Access Books (DOAB)

Feature selection for spontaneous speech analysis to aid in Alzheimer’s disease diagnosis: A fractal dimension approach

Author: AA
Abásolo
Aitzol Ezeiza
American Psychiatric Association (APA)
Arias-Londoño
Blanca Beitia
C.M. Travieso
Castiglioni
Chen
Chouard
Costa
Croota
Esteller
Ezeiza
Faundez-Zanuy
Godino
Godino
Gómez
Harkaitz Eguiraun
Higuchi
J.B. Alonso
Jang
Jordi Solé-Casals
Karmele López-de-Ipiña
Katz
Langi
Li
López-de-Ipiña
López-de-Ipiña
López-de-Ipiña
Maragos
Martinez
McKhann
McKhann
Miriam Ecay-Torres
Morris
Nelwamondo
Nora Barroso
Ouayoun
Pablo Martinez-Lage
Picard
Pickover
Pitsikalis
Péan
Péan
Solé
Tsonis
Van de Pole
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Alzheimer’s disease (AD) is the most prevalent form of degenerative dementia; it has a high socio-economic impact in Westerncountries. The purpose of our project is to contribute to earlier diagnosis of AD and allow better estimates of its severity by usingautomatic analysis performed through new biomarkers extracted through non-invasive intelligent methods. The method selectedis based on speech biomarkers derived from the analysis of spontaneous speech (SS). Thus the main goal of the present work isfeature search in SS, aiming at pre-clinical evaluation whose results can be used to select appropriate tests for AD diagnosis. Thefeature set employed in our earlier work offered some hopeful conclusions but failed to capture the nonlinear dynamics of speechthat are present in the speech waveforms. The extra information provided by the nonlinear features could be especially useful whentraining data is limited. In this work, the fractal dimension (FD) of the observed time series is combined with linear parameters inthe feature vector in order to enhance the performance of the original system while controlling the computational cost.© 2014 Elsevier Ltd. All rights reserved

Crossref

RIUVic