3,976 research outputs found

    Undergraduate Catalog of Studies, 2023-2024

    Get PDF

    Speech-based automatic depression detection via biomarkers identification and artificial intelligence approaches

    Get PDF
    Depression has become one of the most prevalent mental health issues, affecting more than 300 million people all over the world. However, due to factors such as limited medical resources and accessibility to health care, there are still a large number of patients undiagnosed. In addition, the traditional approaches to depression diagnosis have limitations because they are usually time-consuming, and depend on clinical experience that varies across different clinicians. From this perspective, the use of automatic depression detection can make the diagnosis process much faster and more accessible. In this thesis, we present the possibility of using speech for automatic depression detection. This is based on the findings in neuroscience that depressed patients have abnormal cognition mechanisms thus leading to the speech differs from that of healthy people. Therefore, in this thesis, we show two ways of benefiting from automatic depression detection, i.e., identifying speech markers of depression and constructing novel deep learning models to improve detection accuracy. The identification of speech markers tries to capture measurable depression traces left in speech. From this perspective, speech markers such as speech duration, pauses and correlation matrices are proposed. Speech duration and pauses take speech fluency into account, while correlation matrices represent the relationship between acoustic features and aim at capturing psychomotor retardation in depressed patients. Experimental results demonstrate that these proposed markers are effective at improving the performance in recognizing depressed speakers. In addition, such markers show statistically significant differences between depressed patients and non-depressed individuals, which explains the possibility of using these markers for depression detection and further confirms that depression leaves detectable traces in speech. In addition to the above, we propose an attention mechanism, Multi-local Attention (MLA), to emphasize depression-relevant information locally. Then we analyse the effectiveness of MLA on performance and efficiency. According to the experimental results, such a model can significantly improve performance and confidence in the detection while reducing the time required for recognition. Furthermore, we propose Cross-Data Multilevel Attention (CDMA) to emphasize different types of depression-relevant information, i.e., specific to each type of speech and common to both, by using multiple attention mechanisms. Experimental results demonstrate that the proposed model is effective to integrate different types of depression-relevant information in speech, improving the performance significantly for depression detection

    Meta-learning algorithms and applications

    Get PDF
    Meta-learning in the broader context concerns how an agent learns about their own learning, allowing them to improve their learning process. Learning how to learn is not only beneficial for humans, but it has also shown vast benefits for improving how machines learn. In the context of machine learning, meta-learning enables models to improve their learning process by selecting suitable meta-parameters that influence the learning. For deep learning specifically, the meta-parameters typically describe details of the training of the model but can also include description of the model itself - the architecture. Meta-learning is usually done with specific goals in mind, for example trying to improve ability to generalize or learn new concepts from only a few examples. Meta-learning can be powerful, but it comes with a key downside: it is often computationally costly. If the costs would be alleviated, meta-learning could be more accessible to developers of new artificial intelligence models, allowing them to achieve greater goals or save resources. As a result, one key focus of our research is on significantly improving the efficiency of meta-learning. We develop two approaches: EvoGrad and PASHA, both of which significantly improve meta-learning efficiency in two common scenarios. EvoGrad allows us to efficiently optimize the value of a large number of differentiable meta-parameters, while PASHA enables us to efficiently optimize any type of meta-parameters but fewer in number. Meta-learning is a tool that can be applied to solve various problems. Most commonly it is applied for learning new concepts from only a small number of examples (few-shot learning), but other applications exist too. To showcase the practical impact that meta-learning can make in the context of neural networks, we use meta-learning as a novel solution for two selected problems: more accurate uncertainty quantification (calibration) and general-purpose few-shot learning. Both are practically important problems and using meta-learning approaches we can obtain better solutions than the ones obtained using existing approaches. Calibration is important for safety-critical applications of neural networks, while general-purpose few-shot learning tests model's ability to generalize few-shot learning abilities across diverse tasks such as recognition, segmentation and keypoint estimation. More efficient algorithms as well as novel applications enable the field of meta-learning to make more significant impact on the broader area of deep learning and potentially solve problems that were too challenging before. Ultimately both of them allow us to better utilize the opportunities that artificial intelligence presents

    Self-supervised learning for transferable representations

    Get PDF
    Machine learning has undeniably achieved remarkable advances thanks to large labelled datasets and supervised learning. However, this progress is constrained by the labour-intensive annotation process. It is not feasible to generate extensive labelled datasets for every problem we aim to address. Consequently, there has been a notable shift in recent times toward approaches that solely leverage raw data. Among these, self-supervised learning has emerged as a particularly powerful approach, offering scalability to massive datasets and showcasing considerable potential for effective knowledge transfer. This thesis investigates self-supervised representation learning with a strong focus on computer vision applications. We provide a comprehensive survey of self-supervised methods across various modalities, introducing a taxonomy that categorises them into four distinct families while also highlighting practical considerations for real-world implementation. Our focus thenceforth is on the computer vision modality, where we perform a comprehensive benchmark evaluation of state-of-the-art self supervised models against many diverse downstream transfer tasks. Our findings reveal that self-supervised models often outperform supervised learning across a spectrum of tasks, albeit with correlations weakening as tasks transition beyond classification, particularly for datasets with distribution shifts. Digging deeper, we investigate the influence of data augmentation on the transferability of contrastive learners, uncovering a trade-off between spatial and appearance-based invariances that generalise to real-world transformations. This begins to explain the differing empirical performances achieved by self-supervised learners on different downstream tasks, and it showcases the advantages of specialised representations produced with tailored augmentation. Finally, we introduce a novel self-supervised pre-training algorithm for object detection, aligning pre-training with downstream architecture and objectives, leading to reduced localisation errors and improved label efficiency. In conclusion, this thesis contributes a comprehensive understanding of self-supervised representation learning and its role in enabling effective transfer across computer vision tasks

    UMSL Bulletin 2023-2024

    Get PDF
    The 2023-2024 Bulletin and Course Catalog for the University of Missouri St. Louis.https://irl.umsl.edu/bulletin/1088/thumbnail.jp

    Deep Learning Techniques for Electroencephalography Analysis

    Get PDF
    In this thesis we design deep learning techniques for training deep neural networks on electroencephalography (EEG) data and in particular on two problems, namely EEG-based motor imagery decoding and EEG-based affect recognition, addressing challenges associated with them. Regarding the problem of motor imagery (MI) decoding, we first consider the various kinds of domain shifts in the EEG signals, caused by inter-individual differences (e.g. brain anatomy, personality and cognitive profile). These domain shifts render multi-subject training a challenging task and impede robust cross-subject generalization. We build a two-stage model ensemble architecture and propose two objectives to train it, combining the strengths of curriculum learning and collaborative training. Our subject-independent experiments on the large datasets of Physionet and OpenBMI, verify the effectiveness of our approach. Next, we explore the utilization of the spatial covariance of EEG signals through alignment techniques, with the goal of learning domain-invariant representations. We introduce a Riemannian framework that concurrently performs covariance-based signal alignment and data augmentation, while training a convolutional neural network (CNN) on EEG time-series. Experiments on the BCI IV-2a dataset show that our method performs superiorly over traditional alignment, by inducing regularization to the weights of the CNN. We also study the problem of EEG-based affect recognition, inspired by works suggesting that emotions can be expressed in relative terms, i.e. through ordinal comparisons between different affective state levels. We propose treating data samples in a pairwise manner to infer the ordinal relation between their corresponding affective state labels, as an auxiliary training objective. We incorporate our objective in a deep network architecture which we jointly train on the tasks of sample-wise classification and pairwise ordinal ranking. We evaluate our method on the affective datasets of DEAP and SEED and obtain performance improvements over deep networks trained without the additional ranking objective

    Face Emotion Recognition Based on Machine Learning: A Review

    Get PDF
    Computers can now detect, understand, and evaluate emotions thanks to recent developments in machine learning and information fusion. Researchers across various sectors are increasingly intrigued by emotion identification, utilizing facial expressions, words, body language, and posture as means of discerning an individual's emotions. Nevertheless, the effectiveness of the first three methods may be limited, as individuals can consciously or unconsciously suppress their true feelings. This article explores various feature extraction techniques, encompassing the development of machine learning classifiers like k-nearest neighbour, naive Bayesian, support vector machine, and random forest, in accordance with the established standard for emotion recognition. The paper has three primary objectives: firstly, to offer a comprehensive overview of effective computing by outlining essential theoretical concepts; secondly, to describe in detail the state-of-the-art in emotion recognition at the moment; and thirdly, to highlight important findings and conclusions from the literature, with an emphasis on important obstacles and possible future paths, especially in the creation of state-of-the-art machine learning algorithms for the identification of emotions

    Specificity Determining Features at the Interface of Biomolecular Complexes as Regulators of Biological Functions

    Get PDF
    Amino acid residues at the biomolecular interface play essential roles in many biological and cellular processes; relevant to this thesis, protein-protein interactions regulate signaling pathways and enzymatic activity, whereas protein-DNA interactions control gene expression, and protein-peptide interactions are central to the immune system. Biomolecular recognition and binding stability are largely determined by residues at the molecular interface. In this thesis, we focused on three biological datasets that are related to humans and human health: 1) dysregulated citrullination in the inflamed joints of rheumatoid arthritis patients, 2) a novel family of PRD-like transcription factors critical to the first few cell divisions in human life, and 3) epitopes that likely activate a cytotoxic T cell-mediated immune response against SARS-CoV-2 infection. For each dataset, in order to study the structural and functional consequences of molecular interactions, we applied a wide range of bioinformatics techniques to analyze sequences, structures and biological data retrieved from various databases, as well as taking into account experimental results from collaborators and from the literature. In rheumatoid arthritis, normally cytoplasmic peptidylarginine deiminase (PAD) enzymes citrullinate arginine residues in extracellular matrix (ECM) proteins. To examine specificity determining features that regulate the citrullination activity, we analyzed the sequence and structure data of the ECM proteins that were found citrullinated in chronically inflamed human joints. For citrullination, we found that an arginine side chain needs to be exposed to solvent but can arise from ÎČ-strands, α-helices, loops and ÎČ-turns. Moreover, there is no sequence motif linked to enzymatic activity. In addition, we studied the effect of citrullination on proteins important for a normal ECM, focusing on integrin binding to fibronectin and transforming growth factor-ÎČ (TGF-ÎČ). Citrullination of these proteins was found to inhibit cell attachment and spreading since PAD-treatment of the isoDGR motif in fibronectin and the RGD motif in TGF-ÎČ significantly reduced their binding with integrin αVÎČ3 and αVÎČ6, respectively. The expression of the human paired (PRD)-like transcription factors (TFs) are limited to the period of embryonic genome activation up to the 8-cell stage. We identified that one of these PRD-like TFs, LEUTX, binds to a TAATCC sequence motif. Sequence comparisons revealed that LEUTX protein is comprised of two domains: the DNA-binding homeodomain and a Leutx domain containing a transactivation domain. We identified specificity determining residues in the LEUTX homeodomain that are important for recognition of the TAATCC-containing 36 bp DNA motif enriched in genes involved in embryonic genome activation. We demonstrated using molecular models why a heterozygotic missense mutation A54V at the DNA-specificity determining position of LEUTX has significantly reduced overall transcriptional activity, as well as why the double mutant – I47T and A54V – form of LEUTX restores binding to the DNA motif similarly to that seen in the I47T mutation alone. At the onset of the COVID-19 pandemic we sought to understand the molecular factors that trigger the cytotoxic T cell-mediated immune response against the SARS-CoV-2 virus, taking advantage of binding data and 3D structures for related viruses and other pathogenic organisms. We first predicted the MHC class I (MHC-I)-specific immunogenic epitopes of length 8- to 11 amino acids from the SARS-CoV-2 proteins. Next, we predicted that the 9-mer epitopes would have the highest potential to elicit a strong immune response. For experimental validation, the predicted 9-mer epitopes were matched with the SARS-CoV-derived epitopes that are known to elicit an effective T cell response in vitro. Furthermore, our observations provide a structural explanation for the binding of SARS-CoV-2 epitopes to MHC-I molecules, identifying conserved immunogenic epitopes essential for understanding the pathogenesis of COVID-19. The three investigated datasets were made in concert with collaborative experimental studies and/or considering publicly available experimental data. The experimental studies generally provided the starting point for the in silico studies, which in turn had the objective of providing a detailed explanation of the experimental results. Furthermore, the in silico results could be used to devise novel and focused experiments, suggesting that bioinformatics predictions and wet-laboratory experimental investigations optimally take place with multiple advantages. Overall, this thesis demonstrates the synergy that is possible by applying this interdisciplinary approach to understanding the consequences of molecular interactions.Aminosyror i kontaktytan mellan olika biomolekyler spelar en viktig roll i mĂ„nga biologiska och cellulĂ€ra processer; relevanta interaktioner för den hĂ€r avhandlingen Ă€r protein-protein interaktioner som reglerar signaleringsrutter och enzymatisk aktivitet, protein-DNA interaktioner som kontrollerar genexpression, samt protein-peptid interaktioner som har en central roll i immunförsvaret. BiomolekylĂ€r igenkĂ€nning och bindningsstabilitet beror till stor del pĂ„ de aminosyror som finns i den molekylĂ€ra kontaktytan. I den hĂ€r avhandlingen fokuserade vi pĂ„ tre biologiska dataset som Ă€r relaterade till mĂ€nniskor och mĂ€nniskors hĂ€lsa: 1) felreglerad citrullinering i inflammerade leder hos patienter med reumatoid artrit, 2) en nyupptĂ€ckt familj av PRD (human paired)-lika transkriptionsfaktorer som Ă€r nödvĂ€ndiga för de första celldelningarna i mĂ€nniskolivet, och 3) epitoper som troligen aktiverar en cytotoxisk T-cell-förmedlad immunrespons mot SARS-CoV-2 infektioner. För att studera de strukturella och funktionella konsekvenserna av de molekylĂ€ra interaktionerna i varje dataset, anvĂ€ndes en mĂ€ngd olika bioinformatiska tekniker för att analysera sekvenser, strukturer och biologiska data frĂ„n olika databaser och dessutom beaktades experimentella resultat frĂ„n samarbetspartners och frĂ„n litteraturen. I reumatoid artrit citrullinerar vanligen PAD (cytoplasmatiska peptidyl arginin deiminas)-enzymer arginin-aminosyror i proteiner i det extracellulĂ€ra matrixet (ECM). För att undersöka egenskaper som avgör specificiteten hos citrullineringsaktiviteten analyserade vi sekvens- och strukturdata för ECM-proteiner som blir citrullinerade i kroniskt inflammerade leder hos mĂ€nniskor. Vi upptĂ€ckte att en argininsidokedja mĂ„ste vara i kontakt med det omgivande lösningsmedlet för att kunna citrullineras, att de kan finnas i beta-strĂ€ngar, alfa-helixar och beta-svĂ€ngar, samt att det inte finns nĂ„gra sekvensmotiv som Ă€r kopplade till enzymatisk aktivitet. Utöver detta studerade vi effekten av citrullinering pĂ„ proteiner som Ă€r viktiga för normal extracellulĂ€r matrix, med fokus pĂ„ integrinbinding till fibronektin och TGF-ÎČ (transforming growth factor-ÎČ). Citrullinering av dessa proteiner upptĂ€cktes inhibera cellvidhĂ€ftning och spridning eftersom PAD-behandling av isoDGR-motivet i fibronektin och RGD-motivet i TGF-ÎČ ordentligt reducerar deras bindning till integrin αVÎČ3 och αVÎČ6, respektive. ExpressionsnivĂ„erna av PRD-lika transkriptionsfaktorer (TF) Ă€r begrĂ€nsade till perioden av zygotens genomaktivering upp till 8-cells stadiet. Vi identifierade att en av dessa PRD-lika transkriptionsfaktorer, LEUTX, binder till ett TAATCC sekvensmotiv. SekvensjĂ€mförelser avslöjade att LEUTX proteinet bestĂ„r av tvĂ„ domĂ€ner, det DNA-bindande homeodomĂ€net och en leutx-domĂ€n som innehĂ„ller en transaktiveringsdomĂ€n. Vi identifierade specificitetsbestĂ€mmande aminosyror i LEUTX homeodomĂ€nen som Ă€r viktiga för igenkĂ€nning av TAATCC-innehĂ„llande 36 baspars DNA-motivet som Ă€r berikad med gener involverade i zygotens genomaktivering. Vi anvĂ€nde molekylĂ€ra modeller för att visa varför en heterozygotisk missense-mutation, A54V, i DNA-specificitetsbestĂ€mmande positionen i LEUTX har ordentligt minskad generell transkriptionsaktivitet, och varför dubbelmutanten I47T och A54V Ă„terstĂ€ller bindning till DNA-motivet pĂ„ samma sĂ€tt som observerats i enbart I47T mutationen. NĂ€r COVID-19 pandemin inleddes försökte vi förstĂ„ de molekylĂ€ra faktorer som startar den cytotoxiska T-cell-förmedlade immunresponsen mot SARS-CoV-2 viruset, genom att utnyttja bindningsdata och 3D strukturer för relaterade virus och andra patogena organismer. Vi förutspĂ„dde först MHC klass I (MHC-I)-specifika immunogena epitoper av lĂ€ngden 8 till 11 aminosyror frĂ„n SARS-CoV-2 proteiner. DĂ€refter förutspĂ„dde vi att epitoper bestĂ„ende av 9 aminosyror hade den högsta potentialen att orsaka en stark immunrespons. För experimentell validering matchades de 9 aminosyror lĂ„nga epitoperna med epitoper frĂ„n SARS-CoV som man vet att orsakar en effektiv T-cell respons in vitro. VĂ„ra observationer bidrar ocksĂ„ med en strukturell förklaring för bindningen av SARS-CoV-2 epitoper till MHC-I molekyler, vilket identifierar konserverade immunogena epitoper som Ă€r nödvĂ€ndiga för att förstĂ„r patogenesen hos COVID-19. De tre undersökta dataseten gjordes i samarbete med experimentella studier och/eller genom att ta allmĂ€nt tillgĂ€ngliga experimentella data i beaktande. De experimentella studierna gav en startpunkt för in silico-studierna, vilka i sin tur hade som mĂ„l att ge en detaljerad förklaring till de experimentella resultaten. In silico-resultaten kan ocksĂ„ anvĂ€ndas för att utveckla nya och fokuserade experiment, vilket indikerar att bioinformatiska förutspĂ„elser och experimentella studier optimalt sker med mĂ„nga fördelar. Över lag visar denna avhandling synergin som Ă€r möjlig genom att anvĂ€nda detta interdisciplinĂ€ra arbetssĂ€tt för att förstĂ„ konsekvenserna av molekylĂ€ra interaktioner

    Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection

    Full text link
    This paper presents our Facial Action Units (AUs) recognition submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW). Our approach consists of three main modules: (i) a pre-trained facial representation encoder which produce a strong facial representation from each input face image in the input sequence; (ii) an AU-specific feature generator that specifically learns a set of AU features from each facial representation; and (iii) a spatio-temporal graph learning module that constructs a spatio-temporal graph representation. This graph representation describes AUs contained in all frames and predicts the occurrence of each AU based on both the modeled spatial information within the corresponding face and the learned temporal dynamics among frames. The experimental results show that our approach outperformed the baseline and the spatio-temporal graph representation learning allows our model to generate the best results among all ablated systems. Our model ranks at the 4th place in the AU recognition track at the 5th ABAW Competition

    UMSL Bulletin 2022-2023

    Get PDF
    The 2022-2023 Bulletin and Course Catalog for the University of Missouri St. Louis.https://irl.umsl.edu/bulletin/1087/thumbnail.jp
    • 

    corecore