Search CORE

2,687 research outputs found

ConStance: Modeling Annotation Contexts to Improve Stance Classification

Author: Friedland Lisa
Hobbs William
Joseph Kenneth
Lazer David
Tsur Oren
Publication venue
Publication date: 01/01/2017
Field of study

Manual annotations are a prerequisite for many applications of machine learning. However, weaknesses in the annotation process itself are easy to overlook. In particular, scholars often choose what information to give to annotators without examining these decisions empirically. For subjective tasks such as sentiment analysis, sarcasm, and stance detection, such choices can impact results. Here, for the task of political stance detection on Twitter, we show that providing too little context can result in noisy and uncertain annotations, whereas providing too strong a context may cause it to outweigh other signals. To characterize and reduce these biases, we develop ConStance, a general model for reasoning about annotations across information conditions. Given conflicting labels produced by multiple annotators seeing the same instances with different contexts, ConStance simultaneously estimates gold standard labels and also learns a classifier for new instances. We show that the classifier learned by ConStance outperforms a variety of baselines at predicting political stance, while the model's interpretable parameters shed light on the effects of each context.Comment: To appear at EMNLP 201

arXiv.org e-Print Archive

Crossref

Semantic Sentiment Analysis of Twitter Data

Author: B Jansen
B Liu
B Pang
F Sebastiani
G Forman
J Bollen
J Villena-Roman
J Wiebe
J Wiebe
JW Pennebaker
KW Church
P Nakov
PJ Stone
S Burton
S Kiritchenko
SR Das
Publication venue
Publication date: 04/10/2017
Field of study

Internet and the proliferation of smart mobile devices have changed the way information is created, shared, and spreads, e.g., microblogs such as Twitter, weblogs such as LiveJournal, social networks such as Facebook, and instant messengers such as Skype and WhatsApp are now commonly used to share thoughts and opinions about anything in the surrounding world. This has resulted in the proliferation of social media content, thus creating new opportunities to study public opinion at a scale that was never possible before. Naturally, this abundance of data has quickly attracted business and research interest from various fields including marketing, political science, and social studies, among many others, which are interested in questions like these: Do people like the new Apple Watch? Do Americans support ObamaCare? How do Scottish feel about the Brexit? Answering these questions requires studying the sentiment of opinions people express in social media, which has given rise to the fast growth of the field of sentiment analysis in social media, with Twitter being especially popular for research due to its scale, representativeness, variety of topics discussed, as well as ease of public access to its messages. Here we present an overview of work on sentiment analysis on Twitter.Comment: Microblog sentiment analysis; Twitter opinion mining; In the Encyclopedia on Social Network Analysis and Mining (ESNAM), Second edition. 201

arXiv.org e-Print Archive

Crossref

Deep Memory Networks for Attitude Identification

Author: Chang C.-C.
Collobert R.
De Marneffe M.-C.
Faulkner A.
Gimpel K.
Glorot X.
Hasan K. S.
Hermann K. M.
Irsoy O.
Jiang L.
Kingma D.
Kobayashi N.
Le Q. V.
Li F.
Mikolov T.
Mohammad S. M.
Popescu A.
Socher R.
Socher R.
Sukhbaatar S.
Vo D.-T.
Walker M. A.
Wang S.
Zhang M.
Zirn C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/01/2017
Field of study

We consider the task of identifying attitudes towards a given set of entities from text. Conventionally, this task is decomposed into two separate subtasks: target detection that identifies whether each entity is mentioned in the text, either explicitly or implicitly, and polarity classification that classifies the exact sentiment towards an identified entity (the target) into positive, negative, or neutral. Instead, we show that attitude identification can be solved with an end-to-end machine learning architecture, in which the two subtasks are interleaved by a deep memory network. In this way, signals produced in target detection provide clues for polarity classification, and reversely, the predicted polarity provides feedback to the identification of targets. Moreover, the treatments for the set of targets also influence each other -- the learned representations may share the same semantics for some targets but vary for others. The proposed deep memory network, the AttNet, outperforms methods that do not consider the interactions between the subtasks or those among the targets, including conventional machine learning methods and the state-of-the-art deep learning models.Comment: Accepted to WSDM'1

arXiv.org e-Print Archive

Crossref

A Retrospective Analysis of the Fake News Challenge Stance Detection Task

Author: Caspelherr Felix
Chaudhuri Debanjan
Gurevych Iryna
Hanselowski Andreas
Meyer Christian M.
PVS Avinesh
Schiller Benjamin
Publication venue
Publication date: 13/06/2018
Field of study

The 2017 Fake News Challenge Stage 1 (FNC-1) shared task addressed a stance classification task as a crucial first step towards detecting fake news. To date, there is no in-depth analysis paper to critically discuss FNC-1's experimental setup, reproduce the results, and draw conclusions for next-generation stance classification methods. In this paper, we provide such an in-depth analysis for the three top-performing systems. We first find that FNC-1's proposed evaluation metric favors the majority class, which can be easily classified, and thus overestimates the true discriminative power of the methods. Therefore, we propose a new F1-based metric yielding a changed system ranking. Next, we compare the features and architectures used, which leads to a novel feature-rich stacked LSTM model that performs on par with the best systems, but is superior in predicting minority classes. To understand the methods' ability to generalize, we derive a new dataset and perform both in-domain and cross-domain experiments. Our qualitative and quantitative study helps interpreting the original FNC-1 scores and understand which features help improving performance and why. Our new dataset and all source code used during the reproduction study are publicly available for future research

arXiv.org e-Print Archive

TUbiblio