2,215 research outputs found

    Convolutional neural network for breathing phase detection in lung sounds

    Get PDF
    We applied deep learning to create an algorithm for breathing phase detection in lung sound recordings, and we compared the breathing phases detected by the algorithm and manually annotated by two experienced lung sound researchers. Our algorithm uses a convolutional neural network with spectrograms as the features, removing the need to specify features explicitly. We trained and evaluated the algorithm using three subsets that are larger than previously seen in the literature. We evaluated the performance of the method using two methods. First, discrete count of agreed breathing phases (using 50% overlap between a pair of boxes), shows a mean agreement with lung sound experts of 97% for inspiration and 87% for expiration. Second, the fraction of time of agreement (in seconds) gives higher pseudo-kappa values for inspiration (0.73-0.88) than expiration (0.63-0.84), showing an average sensitivity of 97% and an average specificity of 84%. With both evaluation methods, the agreement between the annotators and the algorithm shows human level performance for the algorithm. The developed algorithm is valid for detecting breathing phases in lung sound recordings

    Automatic wheeze detection based on auditory modelling

    Get PDF
    Automatic wheeze detection has several potential benefits compared with reliance on human auscultation: it is experience independent, an automated historical record can easily be kept, and it allows quantification of wheeze severity. Previous attempts to detect wheezes automatically have had partial success but have not been reliable enough to become widely accepted as a useful tool. In this paper an improved algorithm for automatic wheeze detection based on auditory modelling is developed, called the frequency- and duration-dependent threshold algorithm. The mean frequency and duration of each wheeze component are obtained automatically. The detected wheezes are marked on a spectrogram. In the new algorithm, the concept of a frequency- and duration-dependent threshold for wheeze detection is introduced. Another departure from previous work is that the threshold is based not on global power but on power corresponding to a particular frequency range. The algorithm has been tested on 36 subjects, 11 of whom exhibited characteristics of wheeze. The results show a marked improvement in the accuracy of wheeze detection when compared with previous algorithms

    Measurement and analysis of breath sounds

    Get PDF
    Existing breath sound measurement systems and possible new methods have been critically investigated. The frequency response of each part of the measurement system has been studied. Emphasis has been placed on frequency response of acoustic sensors; especially, a method to study a diaphragm type air-coupler in contact use has been proposed. Two new methods of breath sounds measurement have been studied: laser Doppler vibrometer and mobile phones. It has been shown that these two methods can find applications in breath sounds measurement, however there are some restrictions. A reliable automatic wheeze detection algorithm based on auditory modelling has been developed. That is the human’s auditory system is modelled as a bank of band pass filters, in which the bandwidths are frequency dependent. Wheezes are treated as signals additive to normal breath sounds (masker). Thus wheeze is detectable when it is above the masking threshold. This new algorithm has been validated using simulated and real data. It is superior to previous algorithms, being more reliable to detect wheezes and less prone to mistakes. Simulation of cardiorespiratory sounds and wheeze audibility tests have been developed. Simulated breath sounds can be used as a training tool, as well as an evaluation method. These simulations have shown that, under certain circumstance, there are wheezes but they are inaudible. It is postulated that this could also happen in real measurements. It has been shown that simulated sounds with predefined characteristics can be used as an objective method to evaluate automatic algorithms. Finally, the efficiency and necessity of heart sounds reduction procedures has been investigated. Based on wavelet decomposition and selective synthesis, heart sounds can be reduced with a cost of unnatural breath sounds. Heart sound reduction is shown not to be necessary if a time-frequency representation is used, as heart sounds have a fixed pattern in the time-frequency plane

    Doctor of Philosophy

    Get PDF
    dissertationPatients sometimes suffer apnea during sedation procedures or after general anesthesia. Apnea presents itself in two forms: respiratory depression (RD) and respiratory obstruction (RO). During RD the patients' airway is open but they lose the drive to breathe. During RO the patients' airway is occluded while they try to breathe. Patients' respiration is rarely monitored directly, but in a few cases is monitored with a capnometer. This dissertation explores the feasibility of monitoring respiration indirectly using an acoustic sensor. In addition to detecting apnea in general, this technique has the possibility of differentiating between RD and RO. Data were recorded on 24 subjects as they underwent sedation. During the sedation, subjects experienced RD or RO. The first part of this dissertation involved detecting periods of apnea from the recorded acoustic data. A method using a parameter estimation algorithm to determine the variance of the noise of the audio signal was developed, and the envelope of the audio data was used to determine when the subject had stopped breathing. Periods of apnea detected by the acoustic method were compared to the periods of apnea detected by the direct flow measurement. This succeeded with 91.8% sensitivity and 92.8% specificity in the training set and 100% sensitivity and 98% specificity in the testing set. The second part of this dissertation used the periods during which apnea was detected to determine if the subject was experiencing RD or RO. The classifications determined from the acoustic signal were compared to the classifications based on the flow measurement in conjunction with the chest and abdomen movements. This did not succeed with a 86.9% sensitivity and 52.6% specificity in the training set, and 100% sensitivity and 0% specificity in the testing set. The third part of this project developed a method to reduce the background sounds that were commonly recorded on the microphone. Additive noise was created to simulate noise generated in typical settings and the noise was removed via an adaptive filter. This succeeded in improving or maintaining apnea detection given the different types of sounds added to the breathing data

    Analysis of Respiratory Sounds: State of the Art

    Get PDF
    Objective This paper describes state of the art, scientific publications and ongoing research related to the methods of analysis of respiratory sounds. Methods and material Review of the current medical and technological literature using Pubmed and personal experience. Results The study includes a description of the various techniques that are being used to collect auscultation sounds, a physical description of known pathologic sounds for which automatic detection tools were developed. Modern tools are based on artificial intelligence and on technics such as artificial neural networks, fuzzy systems, and genetic algorithms… Conclusion The next step will consist in finding new markers so as to increase the efficiency of decision aid algorithms and tools

    Improved Breath Phase and Continuous Adventitious Sound Detection in Lung and Tracheal Sound Using Mixed Set Training and Domain Adaptation

    Full text link
    Previously, we established a lung sound database, HF_Lung_V2 and proposed convolutional bidirectional gated recurrent unit (CNN-BiGRU) models with adequate ability for inhalation, exhalation, continuous adventitious sound (CAS), and discontinuous adventitious sound detection in the lung sound. In this study, we proceeded to build a tracheal sound database, HF_Tracheal_V1, containing 11107 of 15-second tracheal sound recordings, 23087 inhalation labels, 16728 exhalation labels, and 6874 CAS labels. The tracheal sound in HF_Tracheal_V1 and the lung sound in HF_Lung_V2 were either combined or used alone to train the CNN-BiGRU models for respective lung and tracheal sound analysis. Different training strategies were investigated and compared: (1) using full training (training from scratch) to train the lung sound models using lung sound alone and train the tracheal sound models using tracheal sound alone, (2) using a mixed set that contains both the lung and tracheal sound to train the models, and (3) using domain adaptation that finetuned the pre-trained lung sound models with the tracheal sound data and vice versa. Results showed that the models trained only by lung sound performed poorly in the tracheal sound analysis and vice versa. However, the mixed set training and domain adaptation can improve the performance of exhalation and CAS detection in the lung sound, and inhalation, exhalation, and CAS detection in the tracheal sound compared to positive controls (lung models trained only by lung sound and vice versa). Especially, a model derived from the mixed set training prevails in the situation of killing two birds with one stone.Comment: To be submitted, 31 pages, 6 figures, 5 table

    Respiratory sound analysis as a diagnosis tool for breathing disorders

    Get PDF
    This paper provides an overview of respiratory sound analysis (RSA) and its functionality as a diagnostic tool for breathing disorders. A number of respiratory conditions and the techniques used to diagnose them, including sleep apnoea, lung sound analysis (LSA), wheeze detection and phase estimation are discussed. The technologies used, from multi-channel bespoke recording systems to using a smart phone application are explained. A new study that focusses on developing a non-invasive tool for the detection and characterisation of inducible laryngeal obstruction (ILO) is presented. ILO is a debilitating condition, caused by malfunctioning structures of the upper airway, commonly triggered by exertion, leaving children feeling out of breath and unable to exercise normally. In rare cases it can lead to critical laryngeal obstruction and admission to intensive care for endotracheal intubation. The current definitive method of diagnosis is by inserting a camera through the nose while the person is exercising. This approach is invasive, uncomfortable (in particular for young children) subjective and relies on the consultant's expertise. There are only a handful of consultants with the appropriate level of expertise in the UK to diagnose this condition

    Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V1

    Full text link
    A reliable, remote, and continuous real-time respiratory sound monitor with automated respiratory sound analysis ability is urgently required in many clinical scenarios-such as in monitoring disease progression of coronavirus disease 2019-to replace conventional auscultation with a handheld stethoscope. However, a robust computerized respiratory sound analysis algorithm has not yet been validated in practical applications. In this study, we developed a lung sound database (HF_Lung_V1) comprising 9,765 audio files of lung sounds (duration of 15 s each), 34,095 inhalation labels, 18,349 exhalation labels, 13,883 continuous adventitious sound (CAS) labels (comprising 8,457 wheeze labels, 686 stridor labels, and 4,740 rhonchi labels), and 15,606 discontinuous adventitious sound labels (all crackles). We conducted benchmark tests for long short-term memory (LSTM), gated recurrent unit (GRU), bidirectional LSTM (BiLSTM), bidirectional GRU (BiGRU), convolutional neural network (CNN)-LSTM, CNN-GRU, CNN-BiLSTM, and CNN-BiGRU models for breath phase detection and adventitious sound detection. We also conducted a performance comparison between the LSTM-based and GRU-based models, between unidirectional and bidirectional models, and between models with and without a CNN. The results revealed that these models exhibited adequate performance in lung sound analysis. The GRU-based models outperformed, in terms of F1 scores and areas under the receiver operating characteristic curves, the LSTM-based models in most of the defined tasks. Furthermore, all bidirectional models outperformed their unidirectional counterparts. Finally, the addition of a CNN improved the accuracy of lung sound analysis, especially in the CAS detection tasks.Comment: 48 pages, 8 figures. To be submitte

    Multichannel analysis of normal and continuous adventitious respiratory sounds for the assessment of pulmonary function in respiratory diseases

    Get PDF
    Premi extraordinari doctorat UPC curs 2015-2016, àmbit d’Enginyeria IndustrialRespiratory sounds (RS) are produced by turbulent airflows through the airways and are inhomogeneously transmitted through different media to the chest surface, where they can be recorded in a non-invasive way. Due to their mechanical nature and airflow dependence, RS are affected by respiratory diseases that alter the mechanical properties of the respiratory system. Therefore, RS provide useful clinical information about the respiratory system structure and functioning. Recent advances in sensors and signal processing techniques have made RS analysis a more objective and sensitive tool for measuring pulmonary function. However, RS analysis is still rarely used in clinical practice. Lack of a standard methodology for recording and processing RS has led to several different approaches to RS analysis, with some methodological issues that could limit the potential of RS analysis in clinical practice (i.e., measurements with a low number of sensors, no controlled airflows, constant airflows, or forced expiratory manoeuvres, the lack of a co-analysis of different types of RS, or the use of inaccurate techniques for processing RS signals). In this thesis, we propose a novel integrated approach to RS analysis that includes a multichannel recording of RS using a maximum of five microphones placed over the trachea and the chest surface, which allows RS to be analysed at the most commonly reported lung regions, without requiring a large number of sensors. Our approach also includes a progressive respiratory manoeuvres with variable airflow, which allows RS to be analysed depending on airflow. Dual RS analyses of both normal RS and continuous adventitious sounds (CAS) are also proposed. Normal RS are analysed through the RS intensity–airflow curves, whereas CAS are analysed through a customised Hilbert spectrum (HS), adapted to RS signal characteristics. The proposed HS represents a step forward in the analysis of CAS. Using HS allows CAS to be fully characterised with regard to duration, mean frequency, and intensity. Further, the high temporal and frequency resolutions, and the high concentrations of energy of this improved version of HS, allow CAS to be more accurately characterised with our HS than by using spectrogram, which has been the most widely used technique for CAS analysis. Our approach to RS analysis was put into clinical practice by launching two studies in the Pulmonary Function Testing Laboratory of the Germans Trias i Pujol University Hospital for assessing pulmonary function in patients with unilateral phrenic paralysis (UPP), and bronchodilator response (BDR) in patients with asthma. RS and airflow signals were recorded in 10 patients with UPP, 50 patients with asthma, and 20 healthy participants. The analysis of RS intensity–airflow curves proved to be a successful method to detect UPP, since we found significant differences between these curves at the posterior base of the lungs in all patients whereas no differences were found in the healthy participants. To the best of our knowledge, this is the first study that uses a quantitative analysis of RS for assessing UPP. Regarding asthma, we found appreciable changes in the RS intensity–airflow curves and CAS features after bronchodilation in patients with negative BDR in spirometry. Therefore, we suggest that the combined analysis of RS intensity–airflow curves and CAS features—including number, duration, mean frequency, and intensity—seems to be a promising technique for assessing BDR and improving the stratification of BDR levels, particularly among patients with negative BDR in spirometry. The novel approach to RS analysis developed in this thesis provides a sensitive tool to obtain objective and complementary information about pulmonary function in a simple and non-invasive way. Together with spirometry, this approach to RS analysis could have a direct clinical application for improving the assessment of pulmonary function in patients with respiratory diseases.Los sonidos respiratorios (SR) se generan con el paso del flujo de aire a través de las vías respiratorias y se transmiten de forma no homogénea hasta la superficie torácica. Dada su naturaleza mecánica, los SR se ven afectados en gran medida por enfermedades que alteran las propiedades mecánicas del sistema respiratorio. Por lo tanto, los SR proporcionan información clínica relevante sobre la estructura y el funcionamiento del sistema respiratorio. La falta de una metodología estándar para el registro y procesado de los SR ha dado lugar a la aparición de diferentes estrategias de análisis de SR con ciertas limitaciones metodológicas que podrían haber restringido el potencial y el uso de esta técnica en la práctica clínica (medidas con pocos sensores, flujos no controlados o constantes y/o maniobras forzadas, análisis no combinado de distintos tipos de SR o uso de técnicas poco precisas para el procesado de los SR). En esta tesis proponemos un método innovador e integrado de análisis de SR que incluye el registro multicanal de SR mediante un máximo de cinco micrófonos colocados sobre la tráquea yla superficie torácica, los cuales permiten analizar los SR en las principales regiones pulmonares sin utilizar un número elevado de sensores . Nuestro método también incluye una maniobra respiratoria progresiva con flujo variable que permite analizar los SR en función del flujo respiratorio. También proponemos el análisis combinado de los SR normales y los sonidos adventicios continuos (SAC), mediante las curvas intensidad-flujo y un espectro de Hilbert (EH) adaptado a las características de los SR, respectivamente. El EH propuesto representa un avance importante en el análisis de los SAC, pues permite su completa caracterización en términos de duración, frecuencia media e intensidad. Además, la alta resolución temporal y frecuencial y la alta concentración de energía de esta versión mejorada del EH permiten caracterizar los SAC de forma más precisa que utilizando el espectrograma, el cual ha sido la técnica más utilizada para el análisis de SAC en estudios previos. Nuestro método de análisis de SR se trasladó a la práctica clínica a través de dos estudios que se iniciaron en el laboratorio de pruebas funcionales del hospital Germans Trias i Pujol, para la evaluación de la función pulmonar en pacientes con parálisis frénica unilateral (PFU) y la respuesta broncodilatadora (RBD) en pacientes con asma. Las señales de SR y flujo respiratorio se registraron en 10 pacientes con PFU, 50 pacientes con asma y 20 controles sanos. El análisis de las curvas intensidad-flujo resultó ser un método apropiado para detectar la PFU , pues encontramos diferencias significativas entre las curvas intensidad-flujo de las bases posteriores de los pulmones en todos los pacientes , mientras que en los controles sanos no encontramos diferencias significativas. Hasta donde sabemos, este es el primer estudio que utiliza el análisis cuantitativo de los SR para evaluar la PFU. En cuanto al asma, encontramos cambios relevantes en las curvas intensidad-flujo yen las características de los SAC tras la broncodilatación en pacientes con RBD negativa en la espirometría. Por lo tanto, sugerimos que el análisis combinado de las curvas intensidad-flujo y las características de los SAC, incluyendo número, duración, frecuencia media e intensidad, es una técnica prometedora para la evaluación de la RBD y la mejora en la estratificación de los distintos niveles de RBD, especialmente en pacientes con RBD negativa en la espirometría. El método innovador de análisis de SR que se propone en esta tesis proporciona una nueva herramienta con una alta sensibilidad para obtener información objetiva y complementaria sobre la función pulmonar de una forma sencilla y no invasiva. Junto con la espirometría, este método puede tener una aplicación clínica directa en la mejora de la evaluación de la función pulmonar en pacientes con enfermedades respiratoriasAward-winningPostprint (published version

    A software toolkit for acoustic respiratory analysis

    Get PDF
    Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2004.Includes bibliographical references (p. 143-147).Millions of Americans suffer from pulmonary diseases. According to recent statistics, approximately 17 million people suffer from asthma, 16.4 million from chronic obstructive pulmonary disease, 12 million from sleep apnea, and 1.3 million from pneumonia - not to mention the prevalence of many other diseases associated with the lungs. Annually, the mortality attributed to pulmonary diseases exceeds 150,000. Clinical signs of most pulmonary diseases include irregular breathing patterns, the presence of abnormal breath sounds such as wheezes and crackles, and the absence of breathing entirely. Throughout the history of medicine, physicians have always listened for such sounds at the chest wall (or over the trachea) during patient examinations to diagnose pulmonary diseases - a procedure also known as auscultation. Recent advancements in computer technology have made it possible to record, store, and digitally process breath sounds for further analysis. Although automated techniques for lung sound analysis have not been widely employed in the medical field, there has been a growing interest among researchers to use technology to understand the subtler characteristics of lung sounds and their potential correlations with physiological conditions. Based on such correlations, algorithms and tools can be developed to serve as diagnostic aids in both the clinical and non-clinical settings.(cont.) We developed a software toolkit, using MATLAB, to objectively characterize lung sounds. The toolkit includes a respiration detector, respiratory rate detector, respiratory phase onset detector, respiratory phase classifier, crackle and wheeze detectors and characterizers, and a time-scale signal expander. This document provides background on lung sounds, describes and evaluates our analysis techniques, and compares our work to approaches in other diagnostic tools.by Gina Ann Yi.M.Eng
    corecore