909 research outputs found

    FATRER: Full-Attention Topic Regularizer for Accurate and Robust Conversational Emotion Recognition

    Full text link
    This paper concentrates on the understanding of interlocutors' emotions evoked in conversational utterances. Previous studies in this literature mainly focus on more accurate emotional predictions, while ignoring model robustness when the local context is corrupted by adversarial attacks. To maintain robustness while ensuring accuracy, we propose an emotion recognizer augmented by a full-attention topic regularizer, which enables an emotion-related global view when modeling the local context in a conversation. A joint topic modeling strategy is introduced to implement regularization from both representation and loss perspectives. To avoid over-regularization, we drop the constraints on prior distributions that exist in traditional topic modeling and perform probabilistic approximations based entirely on attention alignment. Experiments show that our models obtain more favorable results than state-of-the-art models, and gain convincing robustness under three types of adversarial attacks

    Twitter Sentiment Analysis on Leicester City\u27s Phenomenal 2015/16 EPL Title Winning Season

    Get PDF
    Microblogging has become one of the most useful tools for sharing everyday life events and news, especially popular sporting events, and for expressing opinions about those events. The English Premier League (EPL), the most popular professional soccer league in the world, is talked about on Twitter every day, and the 2015/16 season, whose title underdogs Leicester City managed to win, was one for the history books to remember. As Twitter posts are short and constantly being generated, they are a great source for providing public sentiment towards events that occurred throughout the 2015/16 EPL season. In this project, we examine the effectiveness of machine learning and text sentiment analysis on classifying the sentiment of tweets about Leicester City. We accomplish this by collecting tweets containing the words “Leicester City” using the python library GetOldTweets3; manually labelling those tweets as positive, negative, or neutral; and training an SVM classifier to classify tweets about Leicester City from the 2015/16 season. Our model achieved an F1-score of 0.76. We use the sentiments returned from the classifier to find correlations between real-life events and sentiment changes throughout the whole season and during individual games. From our analysis, we discovered an increase in tweets about Leicester City but a sentiment change from positive to negative as the season progressed. We also observed a wide range of changes in sentiment during a single match involving Leicester City due to real-life events as well as other factors which we discuss in detail

    The Emerging Trends of Multi-Label Learning

    Full text link
    Exabytes of data are generated daily by humans, leading to the growing need for new efforts in dealing with the grand challenges for multi-label learning brought by big data. For example, extreme multi-label classification is an active and rapidly growing research area that deals with classification tasks with an extremely large number of classes or labels; utilizing massive data with limited supervision to build a multi-label classification model becomes valuable for practical applications, etc. Besides these, there are tremendous efforts on how to harvest the strong learning capability of deep learning to better capture the label dependencies in multi-label learning, which is the key for deep learning to address real-world classification tasks. However, it is noted that there has been a lack of systemic studies that focus explicitly on analyzing the emerging trends and new challenges of multi-label learning in the era of big data. It is imperative to call for a comprehensive survey to fulfill this mission and delineate future research directions and new applications.Comment: Accepted to TPAMI 202

    Finetuning Pre-Trained Language Models for Sentiment Classification of COVID19 Tweets

    Get PDF
    It is a common practice in today’s world for the public to use different micro-blogging and social networking platforms, predominantly Twitter, to share opinions, ideas, news, and information about many things in life. Twitter is also becoming a popular channel for information sharing during pandemic outbreaks and disaster events. The world has been suffering from economic crises ever since COVID-19 cases started to increase rapidly since January 2020. The virus has killed more than 800 thousand people ever since the discovery as per the statistics from Worldometer [1] which is the authorized tracking website. So many researchers around the globe are researching into this new virus from different perspectives. One such area is analysing micro-blogging sites like twitter to understand public sentiments. Traditional sentiment analysis methods require complex feature engineering. Many embedding representations have come these days but, their context-independent nature limits their representative power in rich context, due to which performance gets degraded in NLP tasks. Transfer learning has gained the popularity and pretrained language models like BERT(bi-directional Encoder Representations from Transformers) and XLNet which is a Generalised autoregressive model have started overtaking traditional machine learning and deep learning models like Random Forests, Naïve Bayes, Convolutional Neural Networks etc. Despite the great performance results by pretrained language models, it has been observed that finetuning a large pretrained model on downstream task with less training instances is prone to degrade the performance of the model. This research is based on a regularization technique called Mixout proposed by Lee (Lee, 2020). Mixout stochastically mixes the parameters of vanilla network and dropout network. This work is to understand the performance variations of finetuning BERT and XLNet base models on COVID-19 tweets by using Mixout regularization for sentiment classification
    corecore