Search CORE

35,030 research outputs found

Automatic Analysis of Plot for Story Rewriting

Author: Halpin Harry
Moore Johanna D.
Robertson Judy
Publication venue
Publication date: 01/01/2004
Field of study

Organized Behavior Classification of Tweet Sets using Supervised Learning Methods

Author: Foucault Nicolas
Levchuk Georgiy
Ratkiewicz Jacob
Vijayaraghavan Prashanth
Publication venue
Publication date: 29/11/2017
Field of study

During the 2016 US elections Twitter experienced unprecedented levels of propaganda and fake news through the collaboration of bots and hired persons, the ramifications of which are still being debated. This work proposes an approach to identify the presence of organized behavior in tweets. The Random Forest, Support Vector Machine, and Logistic Regression algorithms are each used to train a model with a data set of 850 records consisting of 299 features extracted from tweets gathered during the 2016 US presidential election. The features represent user and temporal synchronization characteristics to capture coordinated behavior. These models are trained to classify tweet sets among the categories: organic vs organized, political vs non-political, and pro-Trump vs pro-Hillary vs neither. The random forest algorithm performs better with greater than 95% average accuracy and f-measure scores for each category. The most valuable features for classification are identified as user based features, with media use and marking tweets as favorite to be the most dominant.Comment: 51 pages, 5 figure

arXiv.org e-Print Archive