Search CORE

857 research outputs found

Computational Sarcasm Analysis on Social Media: A Systematic Review

Author: Hasan Kamrul
Kabir Mohsinul
Kader Faria Binte
Mahmud Hasan
Nujat Nafisa Hossain
Sogir Tasmia Binte
Publication venue
Publication date: 20/09/2022
Field of study

Sarcasm can be defined as saying or writing the opposite of what one truly wants to express, usually to insult, irritate, or amuse someone. Because of the obscure nature of sarcasm in textual data, detecting it is difficult and of great interest to the sentiment analysis research community. Though the research in sarcasm detection spans more than a decade, some significant advancements have been made recently, including employing unsupervised pre-trained transformers in multimodal environments and integrating context to identify sarcasm. In this study, we aim to provide a brief overview of recent advancements and trends in computational sarcasm research for the English language. We describe relevant datasets, methodologies, trends, issues, challenges, and tasks relating to sarcasm that are beyond detection. Our study provides well-summarized tables of sarcasm datasets, sarcastic features and their extraction methods, and performance analysis of various approaches which can help researchers in related domains understand current state-of-the-art practices in sarcasm detection.Comment: 50 pages, 3 tables, Submitted to 'Data Mining and Knowledge Discovery' for possible publicatio

arXiv.org e-Print Archive

Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM

Author: Agarwal Apoorv
Alshenqeeti Hamza
Baccianella Stefano
Clayton
Dimson Thomas
Esuli Andrea
González-Ibánez Roberto
Hu Tianran
Kouloumpis Efthymios
Li Weijian
Li Xiang
Liu Kun-Lin
Novak Petra Kralj
Pak Alexander
van der Maaten Laurens
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/08/2018
Field of study

Sentiment analysis on large-scale social media data is important to bridge the gaps between social media contents and real world activities including political election prediction, individual and public emotional status monitoring and analysis, and so on. Although textual sentiment analysis has been well studied based on platforms such as Twitter and Instagram, analysis of the role of extensive emoji uses in sentiment analysis remains light. In this paper, we propose a novel scheme for Twitter sentiment analysis with extra attention on emojis. We first learn bi-sense emoji embeddings under positive and negative sentimental tweets individually, and then train a sentiment classifier by attending on these bi-sense emoji embeddings with an attention-based long short-term memory network (LSTM). Our experiments show that the bi-sense embedding is effective for extracting sentiment-aware embeddings of emojis and outperforms the state-of-the-art models. We also visualize the attentions to show that the bi-sense emoji embedding provides better guidance on the attention mechanism to obtain a more robust understanding of the semantics and sentiments

arXiv.org e-Print Archive

Automatic stance detection on political discourse in Twitter

Author: Zotova Elena
Publication venue
Publication date: 01/01/2019
Field of study

The majority of opinion mining tasks in natural language processing (NLP) have been focused on sentiment analysis of texts about products and services while there is comparatively less research on automatic detection of political opinion. Almost all previous research work has been done for English, while this thesis is focused on the automatic detection of stance (whether he or she is favorable or not towards important political topic) from Twitter posts in Catalan, Spanish and English. The main objective of this work is to build and compare automatic stance detection systems using supervised both classic machine and deep learning techniques. We also study the influence of text normalization and perform experiments with differentt methods for word representations such as TF-IDF measures for unigrams, word embeddings, tweet embeddings, and contextual character-based embeddings. We obtain state-of-the-art results in the stance detection task on the IberEval 2018 dataset. Our research shows that text normalization and feature selection is important for the systems with unigram features, and does not affect the performance when working with word vector representations. Classic methods such as unigrams and SVM classifier still outperform deep learning techniques, but seem to be prone to overfitting. The classifiers trained using word vector representations and the neural network models encoded with contextual character-based vectors show greater robustness

Archivo Digital para la Docencia y la Investigación

Automatic stance detection on political discourse in Twitter

Author: Zotova Elena
Publication venue
Publication date: 23/09/2019
Field of study

Archivo Digital para la Docencia y la Investigación