2 research outputs found

    On-the-Fly Audio Source Separation-A Novel User-Friendly Framework

    No full text
    This paper addresses the challenging problem of single-channel audio source separation. We introduce a novel userguided framework where source models that govern the separation process are learned on-the-fly from audio examples retrieved online. The user only provides the search keywords that describe the sources in the mixture. In this framework, the generic spectral characteristics of each source are modeled by a universal sound class model learned from the retrieved examples via nonnegative matrix factorization. We propose several group sparsity-inducing constraints in order to efficiently exploit a relevant subset of the universal model adapted to the mixture to be separated. We then derive the corresponding multiplicative update rules for parameter estimation. Separation results obtained from automated and user tests on mixtures containing various types of sounds confirm the effectiveness of the proposed framework

    Algoritmos de procesado de se帽al basados en Non-negative Matrix Factorization aplicados a la separaci贸n, detecci贸n y clasificaci贸n de sibilancias en se帽ales de audio respiratorias monocanal

    Get PDF
    La auscultaci贸n es el primer examen cl铆nico que un m茅dico lleva a cabo para evaluar el estado del sistema respiratorio, debido a que es un m茅todo no invasivo, de bajo coste, f谩cil de realizar y seguro para el paciente. Sin embargo, el diagn贸stico que se deriva de la auscultaci贸n sigue siendo un diagn贸stico subjetivo que se encuentra condicionado a la habilidad, experiencia y entrenamiento de cada m茅dico en la escucha e interpretaci贸n de las se帽ales de audio respiratorias. En consecuencia, se producen un alto porcentaje de diagn贸sticos err贸neos que ponen en riesgo la salud de los pacientes e incrementan el coste asociado a los centros de salud. Esta Tesis propone nuevos m茅todos basados en Non-negative Matrix Factorization aplicados a la separaci贸n, detecci贸n y clasificaci贸n de sonidos sibilantes para proporcionar una v铆a de informaci贸n complementaria al m茅dico que ayude a mejorar la fiabilidad del diagn贸stico emitido por el especialista. Auscultation is the first clinical examination that a physician performs to evaluate the condition of the respiratory system, because it is a non-invasive, low-cost, easy-to-perform and safe method for the patient. However, the diagnosis derived from auscultation remains a subjective diagnosis that is conditioned by the ability, experience and training of each physician in the listening and interpretation of respiratory audio signals. As a result, a high percentage of misdiagnoses are produced that endanger the health of patients and increase the cost associated with health centres. This Thesis proposes new methods based on Non-negative Matrix Factorization applied to separation, detection and classification of wheezing sounds in order to provide a complementary information pathway to the physician that helps to improve the reliability of the diagnosis made by the doctor.Tesis Univ. Ja茅n. Departamento INGENIER脥A DE TELECOMUNICACI脫
    corecore