1,073 research outputs found
Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm
NLP tasks are often limited by scarcity of manually annotated data. In social
media sentiment analysis and related tasks, researchers have therefore used
binarized emoticons and specific hashtags as forms of distant supervision. Our
paper shows that by extending the distant supervision to a more diverse set of
noisy labels, the models can learn richer representations. Through emoji
prediction on a dataset of 1246 million tweets containing one of 64 common
emojis we obtain state-of-the-art performance on 8 benchmark datasets within
sentiment, emotion and sarcasm detection using a single pretrained model. Our
analyses confirm that the diversity of our emotional labels yield a performance
improvement over previous distant supervision approaches.Comment: Accepted at EMNLP 2017. Please include EMNLP in any citations. Minor
changes from the EMNLP camera-ready version. 9 pages + references and
supplementary materia
Argument Strength is in the Eye of the Beholder: Audience Effects in Persuasion
Americans spend about a third of their time online, with many participating
in online conversations on social and political issues. We hypothesize that
social media arguments on such issues may be more engaging and persuasive than
traditional media summaries, and that particular types of people may be more or
less convinced by particular styles of argument, e.g. emotional arguments may
resonate with some personalities while factual arguments resonate with others.
We report a set of experiments testing at large scale how audience variables
interact with argument style to affect the persuasiveness of an argument, an
under-researched topic within natural language processing. We show that belief
change is affected by personality factors, with conscientious, open and
agreeable people being more convinced by emotional arguments.Comment: European Chapter of the Association for Computational Linguistics
(EACL 2017
Deep Learning for User Comment Moderation
Experimenting with a new dataset of 1.6M user comments from a Greek news
portal and existing datasets of English Wikipedia comments, we show that an RNN
outperforms the previous state of the art in moderation. A deep,
classification-specific attention mechanism improves further the overall
performance of the RNN. We also compare against a CNN and a word-list baseline,
considering both fully automatic and semi-automatic moderation
- …