1,186 research outputs found
Speech-based automatic depression detection via biomarkers identification and artificial intelligence approaches
Depression has become one of the most prevalent mental health issues, affecting more than 300 million people all over the world. However, due to factors such as limited medical resources and accessibility to health care, there are still a large number of patients undiagnosed. In addition, the traditional approaches to depression diagnosis have limitations because they are usually time-consuming, and depend on clinical experience that varies across different clinicians. From this perspective, the use of automatic depression detection can make the diagnosis process much faster and more accessible. In this thesis, we present the possibility of using speech for automatic depression detection. This is based on the findings in neuroscience that depressed patients have abnormal cognition mechanisms thus leading to the speech differs from that of healthy people.
Therefore, in this thesis, we show two ways of benefiting from automatic depression detection, i.e., identifying speech markers of depression and constructing novel deep learning models to improve detection accuracy.
The identification of speech markers tries to capture measurable depression traces left in speech. From this perspective, speech markers such as speech duration, pauses and correlation matrices are proposed. Speech duration and pauses take speech fluency into account, while correlation matrices represent the relationship between acoustic features and aim at capturing psychomotor retardation in depressed patients. Experimental results demonstrate that these proposed markers are effective at improving the performance in recognizing depressed speakers. In addition, such markers show statistically significant differences between depressed patients and non-depressed individuals, which explains the possibility of using these markers for depression detection and further confirms that depression leaves detectable traces in speech.
In addition to the above, we propose an attention mechanism, Multi-local Attention (MLA), to emphasize depression-relevant information locally. Then we analyse the effectiveness of MLA on performance and efficiency. According to the experimental results, such a model can significantly improve performance and confidence in the detection while reducing the time required for recognition. Furthermore, we propose Cross-Data Multilevel Attention (CDMA) to emphasize different types of depression-relevant information, i.e., specific to each type of speech and common to both, by using multiple attention mechanisms. Experimental results demonstrate that the proposed model is effective to integrate different types of depression-relevant information in speech, improving the performance significantly for depression detection
The legibility of the imaged human brain
Our knowledge of the organisation of the human brain at the population-level
is yet to translate into power to predict functional differences at the
individual-level, limiting clinical applications, and casting doubt on the
generalisability of inferred mechanisms. It remains unknown whether the
difficulty arises from the absence of individuating biological patterns within
the brain, or from limited power to access them with the models and compute at
our disposal. Here we comprehensively investigate the resolvability of such
patterns with data and compute at unprecedented scale. Across 23810 unique
participants from UK Biobank, we systematically evaluate the predictability of
25 individual biological characteristics, from all available combinations of
structural and functional neuroimaging data. Over 4526 GPU*hours of
computation, we train, optimize, and evaluate out-of-sample 700 individual
predictive models, including multilayer perceptrons of demographic,
psychological, serological, chronic morbidity, and functional connectivity
characteristics, and both uni- and multi-modal 3D convolutional neural network
models of macro- and micro-structural brain imaging. We find a marked
discrepancy between the high predictability of sex (balanced accuracy 99.7%),
age (mean absolute error 2.048 years, R2 0.859), and weight (mean absolute
error 2.609Kg, R2 0.625), for which we set new state-of-the-art performance,
and the surprisingly low predictability of other characteristics. Neither
structural nor functional imaging predicted individual psychology better than
the coincidence of common chronic morbidity (p<0.05). Serology predicted common
morbidity (p<0.05) and was best predicted by it (p<0.001), followed by
structural neuroimaging (p<0.05). Our findings suggest either more informative
imaging or more powerful models will be needed to decipher individual level
characteristics from the brain.Comment: 36 pages, 6 figures, 1 table, 2 supplementary figure
Modular lifelong machine learning
Deep learning has drastically improved the state-of-the-art in many important fields, including computer vision and natural language processing (LeCun et al., 2015). However, it is expensive to train a deep neural network on a machine learning problem. The overall training cost further increases when one wants to solve additional problems. Lifelong machine learning (LML) develops algorithms that aim to efficiently learn to solve a sequence of problems, which become available one at a time. New problems are solved with less resources by transferring previously learned knowledge. At the same time, an LML algorithm needs to retain good performance on all encountered problems, thus avoiding catastrophic forgetting. Current approaches do not possess all the desired properties of an LML algorithm. First, they primarily focus on preventing catastrophic forgetting (Diaz-Rodriguez et al., 2018; Delange et al., 2021). As a result, they neglect some knowledge transfer properties. Furthermore, they assume that all problems in a sequence share the same input space. Finally, scaling these methods to a large sequence of problems remains a challenge.
Modular approaches to deep learning decompose a deep neural network into sub-networks, referred to as modules. Each module can then be trained to perform an atomic transformation, specialised in processing a distinct subset of inputs. This modular approach to storing knowledge makes it easy to only reuse the subset of modules which are useful for the task at hand.
This thesis introduces a line of research which demonstrates the merits of a modular approach to lifelong machine learning, and its ability to address the aforementioned shortcomings of other methods. Compared to previous work, we show that a modular approach can be used to achieve more LML properties than previously demonstrated. Furthermore, we develop tools which allow modular LML algorithms to scale in order to retain said properties on longer sequences of problems.
First, we introduce HOUDINI, a neurosymbolic framework for modular LML. HOUDINI represents modular deep neural networks as functional programs and accumulates a library of pre-trained modules over a sequence of problems. Given a new problem, we use program synthesis to select a suitable neural architecture, as well as a high-performing combination of pre-trained and new modules. We show that our approach has most of the properties desired from an LML algorithm. Notably, it can perform forward transfer, avoid negative transfer and prevent catastrophic forgetting, even across problems with disparate input domains and problems which require different neural architectures.
Second, we produce a modular LML algorithm which retains the properties of HOUDINI but can also scale to longer sequences of problems. To this end, we fix the choice of a neural architecture and introduce a probabilistic search framework, PICLE, for searching through different module combinations. To apply PICLE, we introduce two probabilistic models over neural modules which allows us to efficiently identify promising module combinations.
Third, we phrase the search over module combinations in modular LML as black-box optimisation, which allows one to make use of methods from the setting of hyperparameter optimisation (HPO). We then develop a new HPO method which marries a multi-fidelity approach with model-based optimisation. We demonstrate that this leads to improvement in anytime performance in the HPO setting and discuss how this can in turn be used to augment modular LML methods.
Overall, this thesis identifies a number of important LML properties, which have not all been attained in past methods, and presents an LML algorithm which can achieve all of them, apart from backward transfer
Deep learning for unsupervised domain adaptation in medical imaging: Recent advancements and future perspectives
Deep learning has demonstrated remarkable performance across various tasks in
medical imaging. However, these approaches primarily focus on supervised
learning, assuming that the training and testing data are drawn from the same
distribution. Unfortunately, this assumption may not always hold true in
practice. To address these issues, unsupervised domain adaptation (UDA)
techniques have been developed to transfer knowledge from a labeled domain to a
related but unlabeled domain. In recent years, significant advancements have
been made in UDA, resulting in a wide range of methodologies, including feature
alignment, image translation, self-supervision, and disentangled representation
methods, among others. In this paper, we provide a comprehensive literature
review of recent deep UDA approaches in medical imaging from a technical
perspective. Specifically, we categorize current UDA research in medical
imaging into six groups and further divide them into finer subcategories based
on the different tasks they perform. We also discuss the respective datasets
used in the studies to assess the divergence between the different domains.
Finally, we discuss emerging areas and provide insights and discussions on
future research directions to conclude this survey.Comment: Under Revie
Merging Deep Learning with Expert Knowledge for Seizure Onset Zone localization from rs-fMRI in Pediatric Pharmaco Resistant Epilepsy
Surgical disconnection of Seizure Onset Zones (SOZs) at an early age is an
effective treatment for Pharmaco-Resistant Epilepsy (PRE). Pre-surgical
localization of SOZs with intra-cranial EEG (iEEG) requires safe and effective
depth electrode placement. Resting-state functional Magnetic Resonance Imaging
(rs-fMRI) combined with signal decoupling using independent component (IC)
analysis has shown promising SOZ localization capability that guides iEEG lead
placement. However, SOZ ICs identification requires manual expert sorting of
100s of ICs per patient by the surgical team which limits the reproducibility
and availability of this pre-surgical screening. Automated approaches for SOZ
IC identification using rs-fMRI may use deep learning (DL) that encodes
intricacies of brain networks from scarcely available pediatric data but has
low precision, or shallow learning (SL) expert rule-based inference approaches
that are incapable of encoding the full spectrum of spatial features. This
paper proposes DeepXSOZ that exploits the synergy between DL based spatial
feature and SL based expert knowledge encoding to overcome performance
drawbacks of these strategies applied in isolation. DeepXSOZ is an
expert-in-the-loop IC sorting technique that a) can be configured to either
significantly reduce expert sorting workload or operate with high sensitivity
based on expertise of the surgical team and b) can potentially enable the usage
of rs-fMRI as a low cost outpatient pre-surgical screening tool. Comparison
with state-of-art on 52 children with PRE shows that DeepXSOZ achieves
sensitivity of 89.79%, precision of 93.6% and accuracy of 84.6%, and reduces
sorting effort by 6.7-fold. Knowledge level ablation studies show a pathway
towards maximizing patient outcomes while optimizing the machine-expert
collaboration for various scenarios.Comment: This paper is currently under review in IEEE Journa
Cerebrovascular dysfunction in cerebral small vessel disease
INTRODUCTION:
Cerebral small vessel disease (SVD) is the cause of a quarter of all ischaemic strokes and is postulated to have a role in up to half of all dementias. SVD pathophysiology remains unclear but cerebrovascular dysfunction may be important. If confirmed many licensed medications have mechanisms of action targeting vascular function, potentially enabling new treatments via drug repurposing. Knowledge is limited however, as most studies assessing cerebrovascular dysfunction are small, single centre, single imaging modality studies due to the complexities in measuring cerebrovascular dysfunctions in humans. This thesis describes the development and application of imaging techniques measuring several cerebrovascular dysfunctions to investigate SVD pathophysiology and trial medications that may improve small blood vessel function in SVD.
METHODS:
Participants with minor ischaemic strokes were recruited to a series of studies utilising advanced MRI techniques to measure cerebrovascular dysfunction. Specifically MRI scans measured the ability of different tissues in the brain to change blood flow in response to breathing carbon dioxide (cerebrovascular reactivity; CVR) and the flow and pulsatility through the cerebral arteries, venous sinuses and CSF spaces. A single centre observational study optimised and established feasibility of the techniques and tested associations of cerebrovascular dysfunctions with clinical and imaging phenotypes. Then a randomised pilot clinical trial tested two medications’ (cilostazol and isosorbide mononitrate) ability to improve CVR and pulsatility over a period of eight weeks. The techniques were then expanded to include imaging of blood brain barrier permeability and utilised in multi-centre studies investigating cerebrovascular dysfunction in both sporadic and monogenetic SVDs.
RESULTS:
Imaging protocols were feasible, consistently being completed with usable data in over 85% of participants. After correcting for the effects of age, sex and systolic blood pressure, lower CVR was associated with higher white matter hyperintensity volume, Fazekas score and perivascular space counts. Lower CVR was associated with higher pulsatility of blood flow in the superior sagittal sinus and lower CSF flow stroke volume at the foramen magnum. Cilostazol and isosorbide mononitrate increased CVR in white matter. The CVR, intra-cranial flow and pulsatility techniques, alongside blood brain barrier permeability and microstructural integrity imaging were successfully employed in a multi-centre observational study. A clinical trial assessing the effects of drugs targeting blood pressure variability is nearing completion.
DISCUSSION:
Cerebrovascular dysfunction in SVD has been confirmed and may play a more direct role in disease pathogenesis than previously established risk factors. Advanced imaging measures assessing cerebrovascular dysfunction are feasible in multi-centre studies and trials. Identifying drugs that improve cerebrovascular dysfunction using these techniques may be useful in selecting candidates for definitive clinical trials which require large sample sizes and long follow up periods to show improvement against outcomes of stroke and dementia incidence and cognitive function
Tensor-variate machine learning on graphs
Traditional machine learning algorithms are facing significant challenges as the world enters the era of big data, with a dramatic expansion in volume and range of applications and an increase in the variety of data sources. The large- and multi-dimensional nature of data often increases the computational costs associated with their processing and raises the risks of model over-fitting - a phenomenon known as the curse of dimensionality. To this end, tensors have become a subject of great interest in the data analytics community, owing to their remarkable ability to super-compress high-dimensional data into a low-rank format, while retaining the original data structure and interpretability. This leads to a significant reduction in computational costs, from an exponential complexity to a linear one in the data dimensions.
An additional challenge when processing modern big data is that they often reside on irregular domains and exhibit relational structures, which violates the regular grid assumptions of traditional machine learning models. To this end, there has been an increasing amount of research in generalizing traditional learning algorithms to graph data. This allows for the processing of graph signals while accounting for the underlying relational structure, such as user interactions in social networks, vehicle flows in traffic networks, transactions in supply chains, chemical bonds in proteins, and trading data in financial networks, to name a few.
Although promising results have been achieved in these fields, there is a void in literature when it comes to the conjoint treatment of tensors and graphs for data analytics. Solutions in this area are increasingly urgent, as modern big data is both large-dimensional and irregular in structure. To this end, the goal of this thesis is to explore machine learning methods that can fully exploit the advantages of both tensors and graphs. In particular, the following approaches are introduced: (i) Graph-regularized tensor regression framework for modelling high-dimensional data while accounting for the underlying graph structure; (ii) Tensor-algebraic approach for computing efficient convolution on graphs; (iii) Graph tensor network framework for designing neural learning systems which is both general enough to describe most existing neural network architectures and flexible enough to model large-dimensional data on any and many irregular domains. The considered frameworks were employed in several real-world applications, including air quality forecasting, protein classification, and financial modelling. Experimental results validate the advantages of the proposed methods, which achieved better or comparable performance against state-of-the-art models. Additionally, these methods benefit from increased interpretability and reduced computational costs, which are crucial for tackling the challenges posed by the era of big data.Open Acces
Self-administered transcranial direct current stimulation treatment of knee osteoarthritis alters pain-related fNIRS connectivity networks
Epub 2023 Mar 31Significance: Knee osteoarthritis (OA) is a disease that causes chronic pain in the elderly population. Currently, OA is mainly treated pharmacologically with analgesics, although research has shown that neuromodulation via transcranial direct current stimulation (tDCS) may be beneficial in reducing pain in clinical settings. However, no studies have reported the effects of home-based self-administered tDCS on functional brain networks in older adults with knee OA.
Aim: We used functional near-infrared spectroscopy (fNIRS) to investigate the functional connectivity effects of tDCS on underlying pain processing mechanisms at the central nervous level in older adults with knee OA.
Approach: Pain-related brain connectivity networks were extracted using fNIRS at baseline and for three consecutive weeks of treatment from 120 subjects randomly assigned to two groups undergoing active tDCS and sham tDCS.
Results: Our results showed that the tDCS intervention significantly modulated pain-related connectivity correlation only in the group receiving active treatment. We also found that only the active treatment group showed a significantly reduced number and strength of functional connections evoked during nociception in the prefrontal cortex, primary motor (M1), and primary somatosensory (S1) cortices. To our knowledge, this is the first study in which the effect of tDCS on pain-related connectivity networks is investigated using fNIRS.
Conclusions: fNIRS-based functional connectivity can be effectively used to investigate neural circuits of pain at the cortical level in association with nonpharmacological, self-administered tDCS treatment.S.M.H. and L.P. would like to acknowledge the support of the National Science Foundation
(Grant Nos. CNS 1650536 and 2137255) and I/UCRC for Building Reliable Advances and
Innovation in Neurotechnology. LP also acknowledges the U.S. Fulbright Scholar Program and
the Fulbright Spain Commission for sponsoring his stay at the Basque Center on Cognition,
Brain and Language. The research reported in this publication was supported by the National
Institute of Nursing Research of the National Institutes of Health (Award No. R15NR018050)
An Analytical Performance Evaluation on Multiview Clustering Approaches
The concept of machine learning encompasses a wide variety of different approaches, one of which is called clustering. The data points are grouped together in this approach to the problem. Using a clustering method, it is feasible, given a collection of data points, to classify each data point as belonging to a specific group. This can be done if the algorithm is given the collection of data points. In theory, data points that constitute the same group ought to have attributes and characteristics that are equivalent to one another, however data points that belong to other groups ought to have properties and characteristics that are very different from one another. The generation of multiview data is made possible by recent developments in information collecting technologies. The data were collected from à variety of sources and were analysed using a variety of perspectives. The data in question are what are known as multiview data. On a single view, the conventional clustering algorithms are applied. In spite of this, real-world data are complicated and can be clustered in a variety of different ways, depending on how the data are interpreted. In practise, the real-world data are messy. In recent years, Multiview Clustering, often known as MVC, has garnered an increasing amount of attention due to its goal of utilising complimentary and consensus information derived from different points of view. On the other hand, the vast majority of the systems that are currently available only enable the single-clustering scenario, whereby only makes utilization of a single cluster to split the data. This is the case since there is only one cluster accessible. In light of this, it is absolutely necessary to carry out investigation on the multiview data format. The study work is centred on multiview clustering and how well it performs compared to these other strategies
Gut-brain interactions affecting metabolic health and central appetite regulation in diabetes, obesity and aging
The central aim of this thesis was to study the effects of gut microbiota on host energy metabolism and central regulation of appetite. We specifically studied the interaction between gut microbiota-derived short-chain fatty acids (SCFAs), postprandial glucose metabolism and central regulation of appetite. In addition, we studied probable determinants that affect this interaction, specifically: host genetics, bariatric surgery, dietary intake and hypoglycemic medication.First, we studied the involvement of microbiota-derived short-chain fatty acids in glucose tolerance. In an observational study we found an association of intestinal availability of SCFAs acetate and butyrate with postprandial insulin and glucose responses. Hereafter, we performed a clinical trial, administering acetate intravenously at a constant rate and studied the effects on glucose tolerance and central regulation of appetite. The acetate intervention did not have a significant effect on these outcome measures, suggesting the association between increased gastrointestinal SCFAs and metabolic health, as observed in the observational study, is not paralleled when inducing acute plasma elevations.Second, we looked at other determinants affecting gut-brain interactions in metabolic health and central appetite signaling. Therefore, we studied the relation between the microbiota and central appetite regulation in identical twin pairs discordant for BMI. Second, we studied the relation between microbial composition and post-surgery gastrointestinal symptoms upon bariatric surgery. Third, we report the effects of increased protein intake on host microbiota composition and central regulation of appetite. Finally, we explored the effects of combination therapy with GLP-1 agonist exenatide and SGLT2 inhibitor dapagliflozin on brain responses to food stimuli
- …