1,207 research outputs found

    Deep learning techniques for biological signal processing: Automatic detection of dolphin sounds

    Get PDF
    openConsidering the heterogeneous underwater acoustic transmission context, detecting and distinguishing vocalizations of cetaceans has been a challenging area of recent interest. A promising venue to improve current detection systems is constituted by machine learning algorithms. In particular, Convolutional Neural Networks (CNNs) are considered one of the most promising deep learning techniques, since they have already excelled in problems involving the automatic processing of biological sounds. Human-annotated spectrograms can be used to teach CNNs how to distinguish between information in the time-frequency domain, thus enabling the detection and classification of marine mammal sounds. However, despite these promising capabilities machine learning suffers from a lack of labeled data, which calls for the adoption of transfer learning to create accurate models even when the availability of human taggers is limited. In this thesis, we developed a dolphin whistle detection framework based on deep learning models. In particular, we investigated the performance of large-scale pre-trained models (VGG16) and compared it with the performance of a vanilla Convolutional Neural Network and several baselines (logistic regression and Support Vector Machines). The pre-trained VGG16 model achieved the best detection performance, with an accuracy of 98,9\% on a left-out test dataset.Considering the heterogeneous underwater acoustic transmission context, detecting and distinguishing vocalizations of cetaceans has been a challenging area of recent interest. A promising venue to improve current detection systems is constituted by machine learning algorithms. In particular, Convolutional Neural Networks (CNNs) are considered one of the most promising deep learning techniques, since they have already excelled in problems involving the automatic processing of biological sounds. Human-annotated spectrograms can be used to teach CNNs how to distinguish between information in the time-frequency domain, thus enabling the detection and classification of marine mammal sounds. However, despite these promising capabilities machine learning suffers from a lack of labeled data, which calls for the adoption of transfer learning to create accurate models even when the availability of human taggers is limited. In this thesis, we developed a dolphin whistle detection framework based on deep learning models. In particular, we investigated the performance of large-scale pre-trained models (VGG16) and compared it with the performance of a vanilla Convolutional Neural Network and several baselines (logistic regression and Support Vector Machines). The pre-trained VGG16 model achieved the best detection performance, with an accuracy of 98,9\% on a left-out test dataset

    Modeling and rendering for development of a virtual bone surgery system

    Get PDF
    A virtual bone surgery system is developed to provide the potential of a realistic, safe, and controllable environment for surgical education. It can be used for training in orthopedic surgery, as well as for planning and rehearsal of bone surgery procedures...Using the developed system, the user can perform virtual bone surgery by simultaneously seeing bone material removal through a graphic display device, feeling the force via a haptic deice, and hearing the sound of tool-bone interaction --Abstract, page iii

    Deep Learning for Enhanced Fault Diagnosis of Monoblock Centrifugal Pumps: Spectrogram-Based Analysis

    Get PDF
    Abstract The reliable operation of monoblock centrifugal pumps (MCP) is crucial in various industrial applications. Achieving optimal performance and minimizing costly downtime requires effectively detecting and diagnosing faults in critical pump components. This study proposes an innovative approach that leverages deep transfer learning techniques. An accelerometer was adopted to capture vibration signals emitted by the pump. These signals are then converted into spectrogram images which serve as the input for a sophisticated classification system based on deep learning. This enables the accurate identification and diagnosis of pump faults. To evaluate the effectiveness of the proposed methodology, 15 pre-trained networks including ResNet-50, InceptionV3, GoogLeNet, DenseNet-201, ShuffleNet, VGG-19, MobileNet-v2, InceptionResNetV2, VGG-16, NasNetmobile, EfficientNetb0, AlexNet, ResNet-18, Xception, ResNet101 and ResNet-18 were employed. The experimental results demonstrate the efficacy of the proposed approach with AlexNet exhibiting the highest level of accuracy among the pre-trained networks. Additionally, a meticulous evaluation of the execution time of the classification process was performed. AlexNet achieved 100.00% accuracy with an impressive execution (training) time of 17 s. This research provides invaluable insights into applying deep transfer learning for fault detection and diagnosis in MCP. Using pre-trained networks offers an efficient and precise solution for this task. The findings of this study have the potential to significantly enhance the reliability and maintenance practices of MCP in various industrial settings

    Automatic Emotion Recognition from Mandarin Speech

    Get PDF

    Guidage non-intrusif d'un bras robotique à l'aide d'un bracelet myoélectrique à électrode sèche

    Get PDF
    Depuis plusieurs années la robotique est vue comme une solution clef pour améliorer la qualité de vie des personnes ayant subi une amputation. Pour créer de nouvelles prothèses intelligentes qui peuvent être facilement intégrées à la vie quotidienne et acceptée par ces personnes, celles-ci doivent être non-intrusives, fiables et peu coûteuses. L’électromyographie de surface fournit une interface intuitive et non intrusive basée sur l’activité musculaire de l’utilisateur permettant d’interagir avec des robots. Cependant, malgré des recherches approfondies dans le domaine de la classification des signaux sEMG, les classificateurs actuels manquent toujours de fiabilité, car ils ne sont pas robustes face au bruit à court terme (par exemple, petit déplacement des électrodes, fatigue musculaire) ou à long terme (par exemple, changement de la masse musculaire et des tissus adipeux) et requiert donc de recalibrer le classifieur de façon périodique. L’objectif de mon projet de recherche est de proposer une interface myoélectrique humain-robot basé sur des algorithmes d’apprentissage par transfert et d’adaptation de domaine afin d’augmenter la fiabilité du système à long-terme, tout en minimisant l’intrusivité (au niveau du temps de préparation) de ce genre de système. L’aspect non intrusif est obtenu en utilisant un bracelet à électrode sèche possédant dix canaux. Ce bracelet (3DC Armband) est de notre (Docteur Gabriel Gagnon-Turcotte, mes co-directeurs et moi-même) conception et a été réalisé durant mon doctorat. À l’heure d’écrire ces lignes, le 3DC Armband est le bracelet sans fil pour l’enregistrement de signaux sEMG le plus performant disponible. Contrairement aux dispositifs utilisant des électrodes à base de gel qui nécessitent un rasage de l’avant-bras, un nettoyage de la zone de placement et l’application d’un gel conducteur avant l’utilisation, le brassard du 3DC peut simplement être placé sur l’avant-bras sans aucune préparation. Cependant, cette facilité d’utilisation entraîne une diminution de la qualité de l’information du signal. Cette diminution provient du fait que les électrodes sèches obtiennent un signal plus bruité que celle à base de gel. En outre, des méthodes invasives peuvent réduire les déplacements d’électrodes lors de l’utilisation, contrairement au brassard. Pour remédier à cette dégradation de l’information, le projet de recherche s’appuiera sur l’apprentissage profond, et plus précisément sur les réseaux convolutionels. Le projet de recherche a été divisé en trois phases. La première porte sur la conception d’un classifieur permettant la reconnaissance de gestes de la main en temps réel. La deuxième porte sur l’implémentation d’un algorithme d’apprentissage par transfert afin de pouvoir profiter des données provenant d’autres personnes, permettant ainsi d’améliorer la classification des mouvements de la main pour un nouvel individu tout en diminuant le temps de préparation nécessaire pour utiliser le système. La troisième phase consiste en l’élaboration et l’implémentation des algorithmes d’adaptation de domaine et d’apprentissage faiblement supervisé afin de créer un classifieur qui soit robuste au changement à long terme.For several years, robotics has been seen as a key solution to improve the quality of life of people living with upper-limb disabilities. To create new, smart prostheses that can easily be integrated into everyday life, they must be non-intrusive, reliable and inexpensive. Surface electromyography provides an intuitive interface based on a user’s muscle activity to interact with robots. However, despite extensive research in the field of sEMG signal classification, current classifiers still lack reliability due to their lack of robustness to short-term (e.g. small electrode displacement, muscle fatigue) or long-term (e.g. change in muscle mass and adipose tissue) noise. In practice, this mean that to be useful, classifier needs to be periodically re-calibrated, a time consuming process. The goal of my research project is to proposes a human-robot myoelectric interface based on transfer learning and domain adaptation algorithms to increase the reliability of the system in the long term, while at the same time reducing the intrusiveness (in terms of hardware and preparation time) of this kind of systems. The non-intrusive aspect is achieved from a dry-electrode armband featuring ten channels. This armband, named the 3DC Armband is from our (Dr. Gabriel Gagnon-Turcotte, my co-directors and myself) conception and was realized during my doctorate. At the time of writing, the 3DC Armband offers the best performance for currently available dry-electrodes, surface electromyographic armbands. Unlike gel-based electrodes which require intrusive skin preparation (i.e. shaving, cleaning the skin and applying conductive gel), the 3DC Armband can simply be placed on the forearm without any preparation. However, this ease of use results in a decrease in the quality of information. This decrease is due to the fact that the signal recorded by dry electrodes is inherently noisier than gel-based ones. In addition, other systems use invasive methods (intramuscular electromyography) to capture a cleaner signal and reduce the source of noises (e.g. electrode shift). To remedy this degradation of information resulting from the non-intrusiveness of the armband, this research project will rely on deep learning, and more specifically on convolutional networks. The research project was divided into three phases. The first is the design of a classifier allowing the recognition of hand gestures in real-time. The second is the implementation of a transfer learning algorithm to take advantage of the data recorded across multiple users, thereby improving the system’s accuracy, while decreasing the time required to use the system. The third phase is the development and implementation of a domain adaptation and self-supervised learning to enhance the classifier’s robustness to long-term changes

    Multichannel analysis of normal and continuous adventitious respiratory sounds for the assessment of pulmonary function in respiratory diseases

    Get PDF
    Premi extraordinari doctorat UPC curs 2015-2016, àmbit d’Enginyeria IndustrialRespiratory sounds (RS) are produced by turbulent airflows through the airways and are inhomogeneously transmitted through different media to the chest surface, where they can be recorded in a non-invasive way. Due to their mechanical nature and airflow dependence, RS are affected by respiratory diseases that alter the mechanical properties of the respiratory system. Therefore, RS provide useful clinical information about the respiratory system structure and functioning. Recent advances in sensors and signal processing techniques have made RS analysis a more objective and sensitive tool for measuring pulmonary function. However, RS analysis is still rarely used in clinical practice. Lack of a standard methodology for recording and processing RS has led to several different approaches to RS analysis, with some methodological issues that could limit the potential of RS analysis in clinical practice (i.e., measurements with a low number of sensors, no controlled airflows, constant airflows, or forced expiratory manoeuvres, the lack of a co-analysis of different types of RS, or the use of inaccurate techniques for processing RS signals). In this thesis, we propose a novel integrated approach to RS analysis that includes a multichannel recording of RS using a maximum of five microphones placed over the trachea and the chest surface, which allows RS to be analysed at the most commonly reported lung regions, without requiring a large number of sensors. Our approach also includes a progressive respiratory manoeuvres with variable airflow, which allows RS to be analysed depending on airflow. Dual RS analyses of both normal RS and continuous adventitious sounds (CAS) are also proposed. Normal RS are analysed through the RS intensity–airflow curves, whereas CAS are analysed through a customised Hilbert spectrum (HS), adapted to RS signal characteristics. The proposed HS represents a step forward in the analysis of CAS. Using HS allows CAS to be fully characterised with regard to duration, mean frequency, and intensity. Further, the high temporal and frequency resolutions, and the high concentrations of energy of this improved version of HS, allow CAS to be more accurately characterised with our HS than by using spectrogram, which has been the most widely used technique for CAS analysis. Our approach to RS analysis was put into clinical practice by launching two studies in the Pulmonary Function Testing Laboratory of the Germans Trias i Pujol University Hospital for assessing pulmonary function in patients with unilateral phrenic paralysis (UPP), and bronchodilator response (BDR) in patients with asthma. RS and airflow signals were recorded in 10 patients with UPP, 50 patients with asthma, and 20 healthy participants. The analysis of RS intensity–airflow curves proved to be a successful method to detect UPP, since we found significant differences between these curves at the posterior base of the lungs in all patients whereas no differences were found in the healthy participants. To the best of our knowledge, this is the first study that uses a quantitative analysis of RS for assessing UPP. Regarding asthma, we found appreciable changes in the RS intensity–airflow curves and CAS features after bronchodilation in patients with negative BDR in spirometry. Therefore, we suggest that the combined analysis of RS intensity–airflow curves and CAS features—including number, duration, mean frequency, and intensity—seems to be a promising technique for assessing BDR and improving the stratification of BDR levels, particularly among patients with negative BDR in spirometry. The novel approach to RS analysis developed in this thesis provides a sensitive tool to obtain objective and complementary information about pulmonary function in a simple and non-invasive way. Together with spirometry, this approach to RS analysis could have a direct clinical application for improving the assessment of pulmonary function in patients with respiratory diseases.Los sonidos respiratorios (SR) se generan con el paso del flujo de aire a través de las vías respiratorias y se transmiten de forma no homogénea hasta la superficie torácica. Dada su naturaleza mecánica, los SR se ven afectados en gran medida por enfermedades que alteran las propiedades mecánicas del sistema respiratorio. Por lo tanto, los SR proporcionan información clínica relevante sobre la estructura y el funcionamiento del sistema respiratorio. La falta de una metodología estándar para el registro y procesado de los SR ha dado lugar a la aparición de diferentes estrategias de análisis de SR con ciertas limitaciones metodológicas que podrían haber restringido el potencial y el uso de esta técnica en la práctica clínica (medidas con pocos sensores, flujos no controlados o constantes y/o maniobras forzadas, análisis no combinado de distintos tipos de SR o uso de técnicas poco precisas para el procesado de los SR). En esta tesis proponemos un método innovador e integrado de análisis de SR que incluye el registro multicanal de SR mediante un máximo de cinco micrófonos colocados sobre la tráquea yla superficie torácica, los cuales permiten analizar los SR en las principales regiones pulmonares sin utilizar un número elevado de sensores . Nuestro método también incluye una maniobra respiratoria progresiva con flujo variable que permite analizar los SR en función del flujo respiratorio. También proponemos el análisis combinado de los SR normales y los sonidos adventicios continuos (SAC), mediante las curvas intensidad-flujo y un espectro de Hilbert (EH) adaptado a las características de los SR, respectivamente. El EH propuesto representa un avance importante en el análisis de los SAC, pues permite su completa caracterización en términos de duración, frecuencia media e intensidad. Además, la alta resolución temporal y frecuencial y la alta concentración de energía de esta versión mejorada del EH permiten caracterizar los SAC de forma más precisa que utilizando el espectrograma, el cual ha sido la técnica más utilizada para el análisis de SAC en estudios previos. Nuestro método de análisis de SR se trasladó a la práctica clínica a través de dos estudios que se iniciaron en el laboratorio de pruebas funcionales del hospital Germans Trias i Pujol, para la evaluación de la función pulmonar en pacientes con parálisis frénica unilateral (PFU) y la respuesta broncodilatadora (RBD) en pacientes con asma. Las señales de SR y flujo respiratorio se registraron en 10 pacientes con PFU, 50 pacientes con asma y 20 controles sanos. El análisis de las curvas intensidad-flujo resultó ser un método apropiado para detectar la PFU , pues encontramos diferencias significativas entre las curvas intensidad-flujo de las bases posteriores de los pulmones en todos los pacientes , mientras que en los controles sanos no encontramos diferencias significativas. Hasta donde sabemos, este es el primer estudio que utiliza el análisis cuantitativo de los SR para evaluar la PFU. En cuanto al asma, encontramos cambios relevantes en las curvas intensidad-flujo yen las características de los SAC tras la broncodilatación en pacientes con RBD negativa en la espirometría. Por lo tanto, sugerimos que el análisis combinado de las curvas intensidad-flujo y las características de los SAC, incluyendo número, duración, frecuencia media e intensidad, es una técnica prometedora para la evaluación de la RBD y la mejora en la estratificación de los distintos niveles de RBD, especialmente en pacientes con RBD negativa en la espirometría. El método innovador de análisis de SR que se propone en esta tesis proporciona una nueva herramienta con una alta sensibilidad para obtener información objetiva y complementaria sobre la función pulmonar de una forma sencilla y no invasiva. Junto con la espirometría, este método puede tener una aplicación clínica directa en la mejora de la evaluación de la función pulmonar en pacientes con enfermedades respiratoriasAward-winningPostprint (published version

    Signals and Images in Sea Technologies

    Get PDF
    Life below water is the 14th Sustainable Development Goal (SDG) envisaged by the United Nations and is aimed at conserving and sustainably using the oceans, seas, and marine resources for sustainable development. It is not difficult to argue that signals and image technologies may play an essential role in achieving the foreseen targets linked to SDG 14. Besides increasing the general knowledge of ocean health by means of data analysis, methodologies based on signal and image processing can be helpful in environmental monitoring, in protecting and restoring ecosystems, in finding new sensor technologies for green routing and eco-friendly ships, in providing tools for implementing best practices for sustainable fishing, as well as in defining frameworks and intelligent systems for enforcing sea law and making the sea a safer and more secure place. Imaging is also a key element for the exploration of the underwater world for various scopes, ranging from the predictive maintenance of sub-sea pipelines and other infrastructure projects, to the discovery, documentation, and protection of sunken cultural heritage. The scope of this Special Issue encompasses investigations into techniques and ICT approaches and, in particular, the study and application of signal- and image-based methods and, in turn, exploration of the advantages of their application in the previously mentioned areas
    • …
    corecore