Search CORE

1,118 research outputs found

Plausibility of a Neural Network Classifier-Based Neuroprosthesis for Depression Detection via Laughter Records

Author: Castellanos A.
del Moral R.
Fernandez Rosell M.
Lahoz-Beltra R.
Marijuan P.C.
Navarro J.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2019
Field of study

The present work explores the diagnostic performance for depression of neural network classifiers analyzing the sound structures of laughter as registered from clinical patients and healthy controls. The main methodological novelty of this work is that simple sound variables of laughter are used as inputs, instead of electrophysiological signals or local field potentials (LFPs) or spoken language utterances, which are the usual protocols up-to-date. In the present study, involving 934 laughs from 30 patients and 20 controls, four different neural networks models were tested for sensitivity analysis, and were additionally trained for depression detection. Some elementary sound variables were extracted from the records: timing, fundamental frequency mean, first three formants, average power, and the Shannon-Wiener entropy. In the results obtained, two of the neural networks show a diagnostic discrimination capability of 93.02 and 91.15% respectively, while the third and fourth ones have an 87.96 and 82.40% percentage of success. Remarkably, entropy turns out to be a fundamental variable to distinguish between patients and controls, and this is a significant factor which becomes essential to understand the deep neurocognitive relationships between laughter and depression. In biomedical terms, our neural network classifier-based neuroprosthesis opens up the possibility of applying the same methodology to other mental-health and neuropsychiatric pathologies. Indeed, exploring the application of laughter in the early detection and prognosis of Alzheimer and Parkinson would represent an enticing possibility, both from the biomedical and the computational points of view

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Universidad de Zaragoza

A Comprehensive and Versatile Multimodal Deep Learning Approach for Predicting Diverse Properties of Advanced Materials

Author: Hata Kenji
Miki Yasuaki
Muroga Shun
Publication venue
Publication date: 28/03/2023
Field of study

We present a multimodal deep learning (MDL) framework for predicting physical properties of a 10-dimensional acrylic polymer composite material by merging physical attributes and chemical data. Our MDL model comprises four modules, including three generative deep learning models for material structure characterization and a fourth model for property prediction. Our approach handles an 18-dimensional complexity, with 10 compositional inputs and 8 property outputs, successfully predicting 913,680 property data points across 114,210 composition conditions. This level of complexity is unprecedented in computational materials science, particularly for materials with undefined structures. We propose a framework to analyze the high-dimensional information space for inverse material design, demonstrating flexibility and adaptability to various materials and scales, provided sufficient data is available. This study advances future research on different materials and the development of more sophisticated models, drawing us closer to the ultimate goal of predicting all properties of all materials.Comment: 38 pages, 17 figures, 1 tabl

arXiv.org e-Print Archive

Beyond the noise : high fidelity MR signal processing

Author: Masuda Reika
Publication venue: 'American Psychological Association (APA)'
Publication date: 01/01/2018
Field of study

This thesis describes a variety of methods developed to increase the sensitivity and resolution of liquid state nuclear magnetic resonance (NMR) experiments. NMR is known as one of the most versatile non-invasive analytical techniques yet often suffers from low sensitivity. The main contribution to this low sensitivity issue is a presence of noise and level of noise in the spectrum is expressed numerically as “signal-to-noise ratio”. NMR signal processing involves sensitivity and resolution enhancement achieved by noise reduction using mathematical algorithms. A singular value decomposition based reduced rank matrix method, composite property mapping, in particular is studied extensively in this thesis to present its advantages, limitations, and applications. In theory, when the sum of k noiseless sinusoidal decays is formatted into a specific matrix form (i.e., Toeplitz), the matrix is known to possess k linearly independent columns. This information becomes apparent only after a singular value decomposition of the matrix. Singular value decomposition factorises the large matrix into three smaller submatrices: right and left singular vector matrices, and one diagonal matrix containing singular values. Were k noiseless sinusoidal decays involved, there would be only k nonzero singular values appearing in the diagonal matrix in descending order providing the information of the amplitude of each sinusoidal decay. The number of non-zero singular values or the number of linearly independent columns is known as the rank of the matrix. With real NMR data none of the singular values equals zero and the matrix has full rank. The reduction of the rank of the matrix and thus the noise in the reconstructed NMR data can be achieved by replacing all the singular values except the first k values with zeroes. This noise reduction process becomes difficult when biomolecular NMR data is to be processed due to the number of resonances being unknown and the presence of a large solvent peak

Western Sydney ResearchDirect

Learning and mining from personal digital archives

Author: Li Na
Publication venue: Dublin City University. Scientific Computing and Complex Systems Modelling (Sci-Sym)
Publication date: 01/03/2020
Field of study

Given the explosion of new sensing technologies, data storage has become significantly cheaper and consequently, people increasingly rely on wearable devices to create personal digital archives. Lifelogging is the act of recording aspects of life in digital format for a variety of purposes such as aiding human memory, analysing human lifestyle and diet monitoring. In this dissertation we are concerned with Visual Lifelogging, a form of lifelogging based on the passive capture of photographs by a wearable camera. Cameras, such as Microsoft's SenseCam can record up to 4,000 images per day as well as logging data from several incorporated sensors. Considering the volume, complexity and heterogeneous nature of such data collections, it is a signifcant challenge to interpret and extract knowledge for the practical use of lifeloggers and others. In this dissertation, time series analysis methods have been used to identify and extract useful information from temporal lifelogging images data, without benefit of prior knowledge. We focus, in particular, on three fundamental topics: noise reduction, structure and characterization of the raw data; the detection of multi-scale patterns; and the mining of important, previously unknown repeated patterns in the time series of lifelog image data. Firstly, we show that Detrended Fluctuation Analysis (DFA) highlights the feature of very high correlation in lifelogging image collections. Secondly, we show that study of equal-time Cross-Correlation Matrix demonstrates atypical or non-stationary characteristics in these images. Next, noise reduction in the Cross-Correlation Matrix is addressed by Random Matrix Theory (RMT) before Wavelet multiscaling is used to characterize the `most important' or `unusual' events through analysis of the associated dynamics of the eigenspectrum. A motif discovery technique is explored for detection of recurring and recognizable episodes of an individual's image data. Finally, we apply these motif discovery techniques to two known lifelog data collections, All I Have Seen (AIHS) and NTCIR-12 Lifelog, in order to examine multivariate recurrent patterns of multiple-lifelogging users

Irish Universities

DCU Online Research Access Service

Automatic Emotion Recognition: Quantifying Dynamics and Structure in Human Behavior.

Author: Kim Yelin
Publication venue
Publication date: 01/01/2016
Field of study

Emotion is a central part of human interaction, one that has a huge influence on its overall tone and outcome. Today's human-centered interactive technology can greatly benefit from automatic emotion recognition, as the extracted affective information can be used to measure, transmit, and respond to user needs. However, developing such systems is challenging due to the complexity of emotional expressions and their dynamics in terms of the inherent multimodality between audio and visual expressions, as well as the mixed factors of modulation that arise when a person speaks. To overcome these challenges, this thesis presents data-driven approaches that can quantify the underlying dynamics in audio-visual affective behavior. The first set of studies lay the foundation and central motivation of this thesis. We discover that it is crucial to model complex non-linear interactions between audio and visual emotion expressions, and that dynamic emotion patterns can be used in emotion recognition. Next, the understanding of the complex characteristics of emotion from the first set of studies leads us to examine multiple sources of modulation in audio-visual affective behavior. Specifically, we focus on how speech modulates facial displays of emotion. We develop a framework that uses speech signals which alter the temporal dynamics of individual facial regions to temporally segment and classify facial displays of emotion. Finally, we present methods to discover regions of emotionally salient events in a given audio-visual data. We demonstrate that different modalities, such as the upper face, lower face, and speech, express emotion with different timings and time scales, varying for each emotion type. We further extend this idea into another aspect of human behavior: human action events in videos. We show how transition patterns between events can be used for automatically segmenting and classifying action events. Our experimental results on audio-visual datasets show that the proposed systems not only improve performance, but also provide descriptions of how affective behaviors change over time. We conclude this dissertation with the future directions that will innovate three main research topics: machine adaptation for personalized technology, human-human interaction assistant systems, and human-centered multimedia content analysis.PhDElectrical Engineering: SystemsUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/133459/1/yelinkim_1.pd

Deep Blue Documents at the University of Michigan