178 research outputs found

    Better Safe Than Sorry: An Adversarial Approach to Improve Social Bot Detection

    Full text link
    The arm race between spambots and spambot-detectors is made of several cycles (or generations): a new wave of spambots is created (and new spam is spread), new spambot filters are derived and old spambots mutate (or evolve) to new species. Recently, with the diffusion of the adversarial learning approach, a new practice is emerging: to manipulate on purpose target samples in order to make stronger detection models. Here, we manipulate generations of Twitter social bots, to obtain - and study - their possible future evolutions, with the aim of eventually deriving more effective detection techniques. In detail, we propose and experiment with a novel genetic algorithm for the synthesis of online accounts. The algorithm allows to create synthetic evolved versions of current state-of-the-art social bots. Results demonstrate that synthetic bots really escape current detection techniques. However, they give all the needed elements to improve such techniques, making possible a proactive approach for the design of social bot detection systems.Comment: This is the pre-final version of a paper accepted @ 11th ACM Conference on Web Science, June 30-July 3, 2019, Boston, U

    Deductions from a Sub-Saharan African bank’s tweets: A sentiment analysis approach

    Get PDF
    The upsurge in social media websites has in no doubt triggered a huge source of data for mining interesting expressions on a variety of subjects. These expressions on social media websites empower firms and individuals to discover varied interpretations regarding the opinions expressed. In Sub-Saharan Africa, financial institutions are making the needed technological investments required to remain competitive in today’s challenging global business environment. Twitter as one of the digital communication tools has in recent times been integrated into the marketing communication tools of banks to augment the free flow of information. In this light, the purpose of the present study is to perform a sentiment analysis on a large dataset of tweets associated with the Ecobank Group, a prominent pan-African bank in sub-Saharan Africa using four different sentiment lexicons to determine the best lexicon based on its performance. Our results show that Valence Aware Dictionary and sEntiment Reasoner (VADER) outperforms all the other three lexicons based on accuracy and computational efficiency. Additionally, we generated a word cloud to visually examine the terms in the positive and negative sentiment categories based on VADER. Our approach demonstrates that in today’s world of empowered customers, firms need to focus on customer engagement to enhance customer experience via social media channels (e.g., Twitter) since the meaning of competitive advantage has shifted from purely competing over price and product to building loyalty and trust. In theory, the study contributes to broadening the scope of online banking given the interplay of consumer sentiments via the social media channel. Limitations and future research directions are discussed at the end of the paper. © 2020, © 2020 The Author(s). This open access article is distributed under a Creative Commons Attribution (CC-BY) 4.0 license.Tomas Bata University in Zlin [IGA/CebiaTech/2020/001

    ASPECT-BASED OPINION MINING OF PRODUCT REVIEWS IN MICROBLOGS USING MOST RELEVANT FREQUENT CLUSTERS OF TERMS

    Get PDF
    Aspect-based Opinion Mining (ABOM) systems take as input a corpus about a product and aim to mine the aspects (the features or parts) of the product and obtain the opinions of each aspect (how positive or negative the appraisal or emotions towards the aspect is). A few systems like Twitter Aspect Classifier and Twitter Summarization Framework have been proposed to perform ABOM on microblogs. However, the accuracy of these techniques are easily affected by spam posts and buzzwords. In this thesis we address this problem of removing noisy aspects in ABOM by proposing an algorithm called Microblog Aspect Miner (MAM). MAM classifies the microblog posts into subjective and objective posts, represents the frequent nouns in the subjective posts as vectors, and then clusters them to obtain relevant aspects of the product. MAM achieves a 50% improvement in accuracy in obtaining relevant aspects of products compared to previous systems

    Vader: A parsimonious rule-based model for sentiment analysis of social media text

    Get PDF
    Abstract The inherent nature of social media content poses serious challenges to practical applications of sentiment analysis. We present VADER, a simple rule-based model for general sentiment analysis, and compare its effectiveness to eleven typical state-of-practice benchmarks including LIWC, ANEW, the General Inquirer, SentiWordNet, and machine learning oriented techniques relying on Naive Bayes, Maximum Entropy, and Support Vector Machine (SVM) algorithms. Using a combination of qualitative and quantitative methods, we first construct and empirically validate a goldstandard list of lexical features (along with their associated sentiment intensity measures) which are specifically attuned to sentiment in microblog-like contexts. We then combine these lexical features with consideration for five general rules that embody grammatical and syntactical conventions for expressing and emphasizing sentiment intensity. Interestingly, using our parsimonious rule-based model to assess the sentiment of tweets, we find that VADER outperforms individual human raters (F1 Classification Accuracy = 0.96 and 0.84, respectively), and generalizes more favorably across contexts than any of our benchmarks
    • …
    corecore