44 research outputs found

    Respiratory Sound Analysis for the Evidence of Lung Health

    Get PDF
    Significant changes have been made on audio-based technologies over years in several different fields along with healthcare industry. Analysis of Lung sounds is a potential source of noninvasive, quantitative information along with additional objective on the status of the pulmonary system. To do that medical professionals listen to sounds heard over the chest wall at different positions with a stethoscope which is known as auscultation and is important in diagnosing respiratory diseases. At times, possibility of inaccurate interpretation of respiratory sounds happens because of clinician’s lack of considerable expertise or sometimes trainees such as interns and residents misidentify respiratory sounds. We have built a tool to distinguish healthy respiratory sound from non-healthy ones that come from respiratory infection carrying patients. The audio clips were characterized using Linear Predictive Cepstral Coefficient (LPCC)-based features and the highest possible accuracy of 99.22% was obtained with a Multi-Layer Perceptron (MLP)- based classifier on the publicly available ICBHI17 respiratory sounds dataset [1] of size 6800+ clips. The system also outperformed established works in literature and other machine learning techniques. In future we will try to use larger dataset with other acoustic techniques along with deep learning-based approaches and try to identify the nature and severity of infection using respiratory sounds

    Multi-Time-Scale Features for Accurate Respiratory Sound Classification

    Get PDF
    The COVID-19 pandemic has amplified the urgency of the developments in computer-assisted medicine and, in particular, the need for automated tools supporting the clinical diagnosis and assessment of respiratory symptoms. This need was already clear to the scientific community, which launched an international challenge in 2017 at the International Conference on Biomedical Health Informatics (ICBHI) for the implementation of accurate algorithms for the classification of respiratory sound. In this work, we present a framework for respiratory sound classification based on two different kinds of features: (i) short-term features which summarize sound properties on a time scale of tenths of a second and (ii) long-term features which assess sounds properties on a time scale of seconds. Using the publicly available dataset provided by ICBHI, we cross-validated the classification performance of a neural network model over 6895 respiratory cycles and 126 subjects. The proposed model reached an accuracy of 85%±3% and an precision of 80%±8%, which compare well with the body of literature. The robustness of the predictions was assessed by comparison with state-of-the-art machine learning tools, such as the support vector machine, Random Forest and deep neural networks. The model presented here is therefore suitable for large-scale applications and for adoption in clinical practice. Finally, an interesting observation is that both short-term and long-term features are necessary for accurate classification, which could be the subject of future studies related to its clinical interpretation

    Multi-time-scale features for accurate respiratory sound classification

    Get PDF
    The COVID-19 pandemic has amplified the urgency of the developments in computer-assisted medicine and, in particular, the need for automated tools supporting the clinical diagnosis and assessment of respiratory symptoms. This need was already clear to the scientific community, which launched an international challenge in 2017 at the International Conference on Biomedical Health Informatics (ICBHI) for the implementation of accurate algorithms for the classification of respiratory sound. In this work, we present a framework for respiratory sound classification based on two different kinds of features: (i) short-term features which summarize sound properties on a time scale of tenths of a second and (ii) long-term features which assess sounds properties on a time scale of seconds. Using the publicly available dataset provided by ICBHI, we cross-validated the classification performance of a neural network model over 6895 respiratory cycles and 126 subjects. The proposed model reached an accuracy of 85% ± 3% and an precision of 80% ± 8%, which compare well with the body of literature. The robustness of the predictions was assessed by comparison with state-of-the-art machine learning tools, such as the support vector machine, Random Forest and deep neural networks. The model presented here is therefore suitable for large-scale applications and for adoption in clinical practice. Finally, an interesting observation is that both short-term and long-term features are necessary for accurate classification, which could be the subject of future studies related to its clinical interpretation

    Automatic lung health screening using respiratory sounds

    Get PDF
    Significant changes have been made on audio-based technologies over years in several different fields. Healthcare is no exception. One of such avenues is health screening based on respiratory sounds. In this paper, we developed a tool to detect respiratory sounds that come from respiratory infection carrying patients. Linear Predictive Cepstral Coefficient (LPCC)-based features were used to characterize such audio clips. With Multilayer Perceptron (MLP)-based classifier, in our experiment, we achieved the highest possible accuracy of 99.22% that was tested on a publicly available respiratory sounds dataset (ICBHI17) (Rocha et al. Physiol. Meas. 40(3):035,001 20) of size 6800+ clips. In addition to other popular machine learning classifiers, our results outperformed common works that exist in the literature

    Estimasi Arah Sumber Suara Berbasis Gaussian Mixture Model

    Full text link
    Estimasi arah sumber suara menjadi topik penting yang berhubungan dengan aplikasi robot, sistem sensor dan keamanan. Variasi kondisi ekperimen dalam melakukan estimasi tersebut akan menentukan nilai akurasi. Dalam penelitian ini, variasi terhadap temperatur dan waktu pantul diambil untuk dianalisa terhadap nilai akurasi estimasi arah sumber suara. Sinyal yang digunakan adalah sinyal binaural dengan menggunakan sinyal pengganggu white noise dan human speech like (HSL) noise untuk sudut azimuth bervariasi. Estimasi dilakukan dengan menggunakan metode Gaussian Mixture Model (GMM) untuk tipe horizontal plane dan horizontal – vertical planes. Hasil eksperimen menunjukkan sudut azimuth yang dekat dengan pendengar akan menyampaikan sinyal suara lebih cepat daripada sudut yang jauh, sinyal dengan durasi waktu yang panjang yaitu 2000 milidetik akan memberikan akurasi estimasi yang lebih tinggi daripada durasi sinyal yang lebih pendek: 100, 500, dan 1000 milidetik. Selain itu, akurasi estimasi lebih tinggi untuk suara dengan white noise daripada suara dengan HSL noise. Hasil lainnya adalah estimasi memiliki performansi lebih tinggi untuk horizontal – vertical planes daripada hanya kondisi horizontal plane. Estimasi mencapai 98,6% akurasi untuk horizontal plane dan 100% akurasi untuk horizontal-vertical planes

    Multichannel analysis of normal and continuous adventitious respiratory sounds for the assessment of pulmonary function in respiratory diseases

    Get PDF
    Premi extraordinari doctorat UPC curs 2015-2016, àmbit d’Enginyeria IndustrialRespiratory sounds (RS) are produced by turbulent airflows through the airways and are inhomogeneously transmitted through different media to the chest surface, where they can be recorded in a non-invasive way. Due to their mechanical nature and airflow dependence, RS are affected by respiratory diseases that alter the mechanical properties of the respiratory system. Therefore, RS provide useful clinical information about the respiratory system structure and functioning. Recent advances in sensors and signal processing techniques have made RS analysis a more objective and sensitive tool for measuring pulmonary function. However, RS analysis is still rarely used in clinical practice. Lack of a standard methodology for recording and processing RS has led to several different approaches to RS analysis, with some methodological issues that could limit the potential of RS analysis in clinical practice (i.e., measurements with a low number of sensors, no controlled airflows, constant airflows, or forced expiratory manoeuvres, the lack of a co-analysis of different types of RS, or the use of inaccurate techniques for processing RS signals). In this thesis, we propose a novel integrated approach to RS analysis that includes a multichannel recording of RS using a maximum of five microphones placed over the trachea and the chest surface, which allows RS to be analysed at the most commonly reported lung regions, without requiring a large number of sensors. Our approach also includes a progressive respiratory manoeuvres with variable airflow, which allows RS to be analysed depending on airflow. Dual RS analyses of both normal RS and continuous adventitious sounds (CAS) are also proposed. Normal RS are analysed through the RS intensity–airflow curves, whereas CAS are analysed through a customised Hilbert spectrum (HS), adapted to RS signal characteristics. The proposed HS represents a step forward in the analysis of CAS. Using HS allows CAS to be fully characterised with regard to duration, mean frequency, and intensity. Further, the high temporal and frequency resolutions, and the high concentrations of energy of this improved version of HS, allow CAS to be more accurately characterised with our HS than by using spectrogram, which has been the most widely used technique for CAS analysis. Our approach to RS analysis was put into clinical practice by launching two studies in the Pulmonary Function Testing Laboratory of the Germans Trias i Pujol University Hospital for assessing pulmonary function in patients with unilateral phrenic paralysis (UPP), and bronchodilator response (BDR) in patients with asthma. RS and airflow signals were recorded in 10 patients with UPP, 50 patients with asthma, and 20 healthy participants. The analysis of RS intensity–airflow curves proved to be a successful method to detect UPP, since we found significant differences between these curves at the posterior base of the lungs in all patients whereas no differences were found in the healthy participants. To the best of our knowledge, this is the first study that uses a quantitative analysis of RS for assessing UPP. Regarding asthma, we found appreciable changes in the RS intensity–airflow curves and CAS features after bronchodilation in patients with negative BDR in spirometry. Therefore, we suggest that the combined analysis of RS intensity–airflow curves and CAS features—including number, duration, mean frequency, and intensity—seems to be a promising technique for assessing BDR and improving the stratification of BDR levels, particularly among patients with negative BDR in spirometry. The novel approach to RS analysis developed in this thesis provides a sensitive tool to obtain objective and complementary information about pulmonary function in a simple and non-invasive way. Together with spirometry, this approach to RS analysis could have a direct clinical application for improving the assessment of pulmonary function in patients with respiratory diseases.Los sonidos respiratorios (SR) se generan con el paso del flujo de aire a través de las vías respiratorias y se transmiten de forma no homogénea hasta la superficie torácica. Dada su naturaleza mecánica, los SR se ven afectados en gran medida por enfermedades que alteran las propiedades mecánicas del sistema respiratorio. Por lo tanto, los SR proporcionan información clínica relevante sobre la estructura y el funcionamiento del sistema respiratorio. La falta de una metodología estándar para el registro y procesado de los SR ha dado lugar a la aparición de diferentes estrategias de análisis de SR con ciertas limitaciones metodológicas que podrían haber restringido el potencial y el uso de esta técnica en la práctica clínica (medidas con pocos sensores, flujos no controlados o constantes y/o maniobras forzadas, análisis no combinado de distintos tipos de SR o uso de técnicas poco precisas para el procesado de los SR). En esta tesis proponemos un método innovador e integrado de análisis de SR que incluye el registro multicanal de SR mediante un máximo de cinco micrófonos colocados sobre la tráquea yla superficie torácica, los cuales permiten analizar los SR en las principales regiones pulmonares sin utilizar un número elevado de sensores . Nuestro método también incluye una maniobra respiratoria progresiva con flujo variable que permite analizar los SR en función del flujo respiratorio. También proponemos el análisis combinado de los SR normales y los sonidos adventicios continuos (SAC), mediante las curvas intensidad-flujo y un espectro de Hilbert (EH) adaptado a las características de los SR, respectivamente. El EH propuesto representa un avance importante en el análisis de los SAC, pues permite su completa caracterización en términos de duración, frecuencia media e intensidad. Además, la alta resolución temporal y frecuencial y la alta concentración de energía de esta versión mejorada del EH permiten caracterizar los SAC de forma más precisa que utilizando el espectrograma, el cual ha sido la técnica más utilizada para el análisis de SAC en estudios previos. Nuestro método de análisis de SR se trasladó a la práctica clínica a través de dos estudios que se iniciaron en el laboratorio de pruebas funcionales del hospital Germans Trias i Pujol, para la evaluación de la función pulmonar en pacientes con parálisis frénica unilateral (PFU) y la respuesta broncodilatadora (RBD) en pacientes con asma. Las señales de SR y flujo respiratorio se registraron en 10 pacientes con PFU, 50 pacientes con asma y 20 controles sanos. El análisis de las curvas intensidad-flujo resultó ser un método apropiado para detectar la PFU , pues encontramos diferencias significativas entre las curvas intensidad-flujo de las bases posteriores de los pulmones en todos los pacientes , mientras que en los controles sanos no encontramos diferencias significativas. Hasta donde sabemos, este es el primer estudio que utiliza el análisis cuantitativo de los SR para evaluar la PFU. En cuanto al asma, encontramos cambios relevantes en las curvas intensidad-flujo yen las características de los SAC tras la broncodilatación en pacientes con RBD negativa en la espirometría. Por lo tanto, sugerimos que el análisis combinado de las curvas intensidad-flujo y las características de los SAC, incluyendo número, duración, frecuencia media e intensidad, es una técnica prometedora para la evaluación de la RBD y la mejora en la estratificación de los distintos niveles de RBD, especialmente en pacientes con RBD negativa en la espirometría. El método innovador de análisis de SR que se propone en esta tesis proporciona una nueva herramienta con una alta sensibilidad para obtener información objetiva y complementaria sobre la función pulmonar de una forma sencilla y no invasiva. Junto con la espirometría, este método puede tener una aplicación clínica directa en la mejora de la evaluación de la función pulmonar en pacientes con enfermedades respiratoriasAward-winningPostprint (published version

    Deep sleep: deep learning methods for the acoustic analysis of sleep-disordered breathing

    Get PDF
    Sleep-disordered breathing (SDB) is a serious and prevalent condition that results from the collapse of the upper airway during sleep, which leads to oxygen desaturations, unphysiological variations in intrathoracic pressure, and sleep fragmentation. Its most common form is obstructive sleep apnoea (OSA). This has a big impact on quality of life, and is associated with cardiovascular morbidity. Polysomnography, the gold standard for diagnosing SDB, is obtrusive, time-consuming and expensive. Alternative diagnostic approaches have been proposed to overcome its limitations. In particular, acoustic analysis of sleep breathing sounds offers an unobtrusive and inexpensive means to screen for SDB, since it displays symptoms with unique acoustic characteristics. These include snoring, loud gasps, chokes, and absence of breathing. This thesis investigates deep learning methods, which have revolutionised speech and audio technology, to robustly screen for SDB in typical sleep conditions using acoustics. To begin with, the desirable characteristics for an acoustic corpus of SDB, and the acoustic definition of snoring are considered to create corpora for this study. Then three approaches are developed to tackle increasingly complex scenarios. Firstly, with the aim of leveraging a large amount of unlabelled SDB data, unsupervised learning is applied to learn novel feature representations with deep neural networks for the classification of SDB events such as snoring. The incorporation of contextual information to assist the classifier in producing realistic event durations is investigated. Secondly, the temporal pattern of sleep breathing sounds is exploited using convolutional neural networks to screen participants sleeping by themselves for OSA. The integration of acoustic features with physiological data for screening is examined. Thirdly, for the purpose of achieving robustness to bed partner breathing sounds, recurrent neural networks are used to screen a subject and their bed partner for SDB in the same session. Experiments conducted on the constructed corpora show that the developed systems accurately classify SDB events, screen for OSA with high sensitivity and specificity, and screen a subject and their bed partner for SDB with encouraging performance. In conclusion, this thesis makes promising progress in improving access to SDB diagnosis through low-cost and non-invasive methods

    Biometrics

    Get PDF
    Biometrics uses methods for unique recognition of humans based upon one or more intrinsic physical or behavioral traits. In computer science, particularly, biometrics is used as a form of identity access management and access control. It is also used to identify individuals in groups that are under surveillance. The book consists of 13 chapters, each focusing on a certain aspect of the problem. The book chapters are divided into three sections: physical biometrics, behavioral biometrics and medical biometrics. The key objective of the book is to provide comprehensive reference and text on human authentication and people identity verification from both physiological, behavioural and other points of view. It aims to publish new insights into current innovations in computer systems and technology for biometrics development and its applications. The book was reviewed by the editor Dr. Jucheng Yang, and many of the guest editors, such as Dr. Girija Chetty, Dr. Norman Poh, Dr. Loris Nanni, Dr. Jianjiang Feng, Dr. Dongsun Park, Dr. Sook Yoon and so on, who also made a significant contribution to the book
    corecore