47 research outputs found

    Supervised Contrastive Learning with Nearest Neighbor Search for Speech Emotion Recognition

    Full text link
    Speech Emotion Recognition (SER) is a challenging task due to limited data and blurred boundaries of certain emotions. In this paper, we present a comprehensive approach to improve the SER performance throughout the model lifecycle, including pre-training, fine-tuning, and inference stages. To address the data scarcity issue, we utilize a pre-trained model, wav2vec2.0. During fine-tuning, we propose a novel loss function that combines cross-entropy loss with supervised contrastive learning loss to improve the model's discriminative ability. This approach increases the inter-class distances and decreases the intra-class distances, mitigating the issue of blurred boundaries. Finally, to leverage the improved distances, we propose an interpolation method at the inference stage that combines the model prediction with the output from a k-nearest neighbors model. Our experiments on IEMOCAP demonstrate that our proposed methods outperform current state-of-the-art results.Comment: Accepted by lnterspeech 2023, poste

    Zero-shot stance detection based on cross-domain feature enhancement by contrastive learning

    Full text link
    Zero-shot stance detection is challenging because it requires detecting the stance of previously unseen targets in the inference phase. The ability to learn transferable target-invariant features is critical for zero-shot stance detection. In this work, we propose a stance detection approach that can efficiently adapt to unseen targets, the core of which is to capture target-invariant syntactic expression patterns as transferable knowledge. Specifically, we first augment the data by masking the topic words of sentences, and then feed the augmented data to an unsupervised contrastive learning module to capture transferable features. Then, to fit a specific target, we encode the raw texts as target-specific features. Finally, we adopt an attention mechanism, which combines syntactic expression patterns with target-specific features to obtain enhanced features for predicting previously unseen targets. Experiments demonstrate that our model outperforms competitive baselines on four benchmark datasets

    In vitro expression and analysis of the 826 human G protein-coupled receptors

    Get PDF
    ABSTRACT G protein-coupled receptors (GPCRs) are involved in all human physiological systems where they are responsible for transducing extracellular signals into cells. GPCRs signal in response to a diverse array of stimuli including light, hormones, and lipids, where these signals affect downstream cascades to impact both health and disease states. Yet, despite their importance as therapeutic targets, detailed molecular structures of only 30 GPCRs have been determined to date. A key challenge to their structure determination is adequate protein expression. Here we report the quantification of protein expression in an insect cell expression system for all 826 human GPCRs using two different fusion constructs. Expression characteristics are analyzed in aggregate and among each of the five distinct subfamilies. These data can be used to identify trends related to GPCR expression between different fusion constructs and between different GPCR families, and to prioritize lead candidates for future structure determination feasibility

    Zero-Delay Joint Source Channel Coding for a Bivariate Gaussian Source over the Broadcast Channel with One-Bit ADC Front Ends

    No full text
    In this work, we consider the zero-delay transmission of bivariate Gaussian sources over a Gaussian broadcast channel with one-bit analog-to-digital converter (ADC) front ends. An outer bound on the conditional distortion region is derived. Focusing on the minimization of the average distortion, two types of methods are proposed to design nonparametric mappings. The first one is based on the joint optimization between the encoder and decoder with the use of an iterative algorithm. In the second method, we derive the necessary conditions to develop the optimal encoder numerically. Using these necessary conditions, an algorithm based on gradient descent search is designed. Subsequently, the characteristics of the optimized encoding mapping structure are discussed, and inspired by which, several parametric mappings are proposed. Numerical results show that the proposed parametric mappings outperform the uncoded scheme and previous parametric mappings for broadcast channels with infinite resolution ADC front ends. The nonparametric mappings succeed in outperforming the parametric mappings. The causes for the differences between the performances of two nonparametric mappings are analyzed. The average distortions of the parametric and nonparametric mappings proposed here are close to the bound for the cases with one-bit ADC front ends in low channel signal-to-noise ratio regions

    Research and implementation of license plate recognition based on android platform

    Get PDF
    This paper studies and optimizes license plate location and recognition in license plate recognition. A license plate recognition system based on Android platform is designed and implemented. Opencv and Tesseract OCR are integrated in Android studio environment. The license plate number is located by combining Laplace algorithm and HSV model. On the basis of fully understanding the principle of Tesseract OCR recognition, a large number of training pictures are generated by license plate number simulation generator, and license plate character library is generated by using jtessboxeditor tool, which realizes offline recognition of license plate number

    Research and implementation of license plate recognition based on android platform

    No full text
    This paper studies and optimizes license plate location and recognition in license plate recognition. A license plate recognition system based on Android platform is designed and implemented. Opencv and Tesseract OCR are integrated in Android studio environment. The license plate number is located by combining Laplace algorithm and HSV model. On the basis of fully understanding the principle of Tesseract OCR recognition, a large number of training pictures are generated by license plate number simulation generator, and license plate character library is generated by using jtessboxeditor tool, which realizes offline recognition of license plate number
    corecore