19 research outputs found
An emotional mess! Deciding on a framework for building a Dutch emotion-annotated corpus
Seeing the myriad of existing emotion models, with the categorical versus dimensional opposition the most important dividing line, building an emotion-annotated corpus requires some well thought-out strategies concerning framework choice. In our work on automatic emotion detection in Dutch texts, we investigate this problem by means of two case studies. We find that the labels joy, love, anger, sadness and fear are well-suited to annotate texts coming from various domains and topics, but that the connotation of the labels strongly depends on the origin of the texts. Moreover, it seems that information is lost when an emotional state is forcedly classified in a limited set of categories, indicating that a bi-representational format is desirable when creating an emotion corpus.Seeing the myriad of existing emotion models, with the categorical versus dimensional opposition the most important dividing line, building an emotion-annotated corpus requires some well thought-out strategies concerning framework choice. In our work on automatic emotion detection in Dutch texts, we investigate this problem by means of two case studies. We find that the labels joy, love, anger, sadness and fear are well-suited to annotate texts coming from various domains and topics, but that the connotation of the labels strongly depends on the origin of the texts. Moreover, it seems that information is lost when an emotional state is forcedly classified in a limited set of categories, indicating that a bi-representational format is desirable when creating an emotion corpus.P
IEST: WASSA-2018 Implicit Emotions Shared Task
Past shared tasks on emotions use data with both overt expressions of
emotions (I am so happy to see you!) as well as subtle expressions where the
emotions have to be inferred, for instance from event descriptions. Further,
most datasets do not focus on the cause or the stimulus of the emotion. Here,
for the first time, we propose a shared task where systems have to predict the
emotions in a large automatically labeled dataset of tweets without access to
words denoting emotions. Based on this intention, we call this the Implicit
Emotion Shared Task (IEST) because the systems have to infer the emotion mostly
from the context. Every tweet has an occurrence of an explicit emotion word
that is masked. The tweets are collected in a manner such that they are likely
to include a description of the cause of the emotion - the stimulus.
Altogether, 30 teams submitted results which range from macro F1 scores of 21 %
to 71 %. The baseline (MaxEnt bag of words and bigrams) obtains an F1 score of
60 % which was available to the participants during the development phase. A
study with human annotators suggests that automatic methods outperform human
predictions, possibly by honing into subtle textual clues not used by humans.
Corpora, resources, and results are available at the shared task website at
http://implicitemotions.wassa2018.com.Comment: Accepted at Proceedings of the 9th Workshop on Computational
Approaches to Subjectivity, Sentiment and Social Media Analysi
From Independent Prediction to Re-ordered Prediction: Integrating Relative Position and Global Label Information to Emotion Cause Identification
Emotion cause identification aims at identifying the potential causes that
lead to a certain emotion expression in text. Several techniques including rule
based methods and traditional machine learning methods have been proposed to
address this problem based on manually designed rules and features. More
recently, some deep learning methods have also been applied to this task, with
the attempt to automatically capture the causal relationship of emotion and its
causes embodied in the text. In this work, we find that in addition to the
content of the text, there are another two kinds of information, namely
relative position and global labels, that are also very important for emotion
cause identification. To integrate such information, we propose a model based
on the neural network architecture to encode the three elements (, text
content, relative position and global label), in an unified and end-to-end
fashion. We introduce a relative position augmented embedding learning
algorithm, and transform the task from an independent prediction problem to a
reordered prediction problem, where the dynamic global label information is
incorporated. Experimental results on a benchmark emotion cause dataset show
that our model achieves new state-of-the-art performance and performs
significantly better than a number of competitive baselines. Further analysis
shows the effectiveness of the relative position augmented embedding learning
algorithm and the reordered prediction mechanism with dynamic global labels.Comment: Accepted by AAAI 201