128 research outputs found
Fine-Grained Emotion Classification of Chinese Microblogs Based on Graph Convolution Networks
Microblogs are widely used to express people's opinions and feelings in daily
life. Sentiment analysis (SA) can timely detect personal sentiment polarities
through analyzing text. Deep learning approaches have been broadly used in SA
but still have not fully exploited syntax information. In this paper, we
propose a syntax-based graph convolution network (GCN) model to enhance the
understanding of diverse grammatical structures of Chinese microblogs. In
addition, a pooling method based on percentile is proposed to improve the
accuracy of the model. In experiments, for Chinese microblogs emotion
classification categories including happiness, sadness, like, anger, disgust,
fear, and surprise, the F-measure of our model reaches 82.32% and exceeds the
state-of-the-art algorithm by 5.90%. The experimental results show that our
model can effectively utilize the information of dependency parsing to improve
the performance of emotion detection. What is more, we annotate a new dataset
for Chinese emotion classification, which is open to other researchers.Comment: 20 pages, 6 figures, submitted to the World Wide Web Journa
Microblog Hashtag Generation via Encoding Conversation Contexts
Automatic hashtag annotation plays an important role in content understanding
for microblog posts. To date, progress made in this field has been restricted
to phrase selection from limited candidates, or word-level hashtag discovery
using topic models. Different from previous work considering hashtags to be
inseparable, our work is the first effort to annotate hashtags with a novel
sequence generation framework via viewing the hashtag as a short sequence of
words. Moreover, to address the data sparsity issue in processing short
microblog posts, we propose to jointly model the target posts and the
conversation contexts initiated by them with bidirectional attention. Extensive
experimental results on two large-scale datasets, newly collected from English
Twitter and Chinese Weibo, show that our model significantly outperforms
state-of-the-art models based on classification. Further studies demonstrate
our ability to effectively generate rare and even unseen hashtags, which is
however not possible for most existing methods.Comment: NAACL 2019 (10 pages
Tag Recommendation by Word-Level Tag Sequence Modeling
In this paper, we transform tag recommendation into a word-based text
generation problem and introduce a sequence-to-sequence model. The model
inherits the advantages of LSTM-based encoder for sequential modeling and
attention-based decoder with local positional encodings for learning relations
globally. Experimental results on Zhihu datasets illustrate the proposed model
outperforms other state-of-the-art text classification based methods.Comment: This is a full length version of the paper in DASFAA 201
Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media
A large number of individuals are suffering from suicidal ideation in the
world. There are a number of causes behind why an individual might suffer from
suicidal ideation. As the most popular platform for self-expression, emotion
release, and personal interaction, individuals may exhibit a number of symptoms
of suicidal ideation on social media. Nevertheless, challenges from both data
and knowledge aspects remain as obstacles, constraining the social media-based
detection performance. Data implicitness and sparsity make it difficult to
discover the inner true intentions of individuals based on their posts.
Inspired by psychological studies, we build and unify a high-level
suicide-oriented knowledge graph with deep neural networks for suicidal
ideation detection on social media. We further design a two-layered attention
mechanism to explicitly reason and establish key risk factors to individual's
suicidal ideation. The performance study on microblog and Reddit shows that: 1)
with the constructed personal knowledge graph, the social media-based suicidal
ideation detection can achieve over 93% accuracy; and 2) among the six
categories of personal factors, post, personality, and experience are the top-3
key indicators. Under these categories, posted text, stress level, stress
duration, posted image, and ruminant thinking contribute to one's suicidal
ideation detection.Comment: Accepted to IEEE Transaction on Multimedi
Over a Decade of Social Opinion Mining: A Systematic Review
Social media popularity and importance is on the increase due to people using
it for various types of social interaction across multiple channels. This
systematic review focuses on the evolving research area of Social Opinion
Mining, tasked with the identification of multiple opinion dimensions, such as
subjectivity, sentiment polarity, emotion, affect, sarcasm and irony, from
user-generated content represented across multiple social media platforms and
in various media formats, like text, image, video and audio. Through Social
Opinion Mining, natural language can be understood in terms of the different
opinion dimensions, as expressed by humans. This contributes towards the
evolution of Artificial Intelligence which in turn helps the advancement of
several real-world use cases, such as customer service and decision making. A
thorough systematic review was carried out on Social Opinion Mining research
which totals 485 published studies and spans a period of twelve years between
2007 and 2018. The in-depth analysis focuses on the social media platforms,
techniques, social datasets, language, modality, tools and technologies, and
other aspects derived. Social Opinion Mining can be utilised in many
application areas, ranging from marketing, advertising and sales for
product/service management, and in multiple domains and industries, such as
politics, technology, finance, healthcare, sports and government. The latest
developments in Social Opinion Mining beyond 2018 are also presented together
with future research directions, with the aim of leaving a wider academic and
societal impact in several real-world applications.Comment: 170 pages, 3 figures. This is a preprint of an article published in
Artificial Intelligence Review (2021
Mining Dual Emotion for Fake News Detection
Emotion plays an important role in detecting fake news online. When
leveraging emotional signals, the existing methods focus on exploiting the
emotions of news contents that conveyed by the publishers (i.e., publisher
emotion). However, fake news often evokes high-arousal or activating emotions
of people, so the emotions of news comments aroused in the crowd (i.e., social
emotion) should not be ignored. Furthermore, it remains to be explored whether
there exists a relationship between publisher emotion and social emotion (i.e.,
dual emotion), and how the dual emotion appears in fake news. In this paper, we
verify that dual emotion is distinctive between fake and real news and propose
Dual Emotion Features to represent dual emotion and the relationship between
them for fake news detection. Further, we exhibit that our proposed features
can be easily plugged into existing fake news detectors as an enhancement.
Extensive experiments on three real-world datasets (one in English and the
others in Chinese) show that our proposed feature set: 1) outperforms the
state-of-the-art task-related emotional features; 2) can be well compatible
with existing fake news detectors and effectively improve the performance of
detecting fake news.Comment: Accepted by WWW 202
Deep Learning for Sentiment Analysis : A Survey
Deep learning has emerged as a powerful machine learning technique that
learns multiple layers of representations or features of the data and produces
state-of-the-art prediction results. Along with the success of deep learning in
many other application domains, deep learning is also popularly used in
sentiment analysis in recent years. This paper first gives an overview of deep
learning and then provides a comprehensive survey of its current applications
in sentiment analysis.Comment: 34 pages, 9 figures, 2 table
Mining Significant Microblogs for Misinformation Identification: An Attention-based Approach
With the rapid growth of social media, massive misinformation is also
spreading widely on social media, such as microblog, and bring negative effects
to human life. Nowadays, automatic misinformation identification has drawn
attention from academic and industrial communities. For an event on social
media usually consists of multiple microblogs, current methods are mainly based
on global statistical features. However, information on social media is full of
noisy and outliers, which should be alleviated. Moreover, most of microblogs
about an event have little contribution to the identification of
misinformation, where useful information can be easily overwhelmed by useless
information. Thus, it is important to mine significant microblogs for a
reliable misinformation identification method. In this paper, we propose an
Attention-based approach for Identification of Misinformation (AIM). Based on
the attention mechanism, AIM can select microblogs with largest attention
values for misinformation identification. The attention mechanism in AIM
contains two parts: content attention and dynamic attention. Content attention
is calculated based textual features of each microblog. Dynamic attention is
related to the time interval between the posting time of a microblog and the
beginning of the event. To evaluate AIM, we conduct a series of experiments on
the Weibo dataset and the Twitter dataset, and the experimental results show
that the proposed AIM model outperforms the state-of-the-art methods
Topic-Aware Neural Keyphrase Generation for Social Media Language
A huge volume of user-generated content is daily produced on social media. To
facilitate automatic language understanding, we study keyphrase prediction,
distilling salient information from massive posts. While most existing methods
extract words from source posts to form keyphrases, we propose a
sequence-to-sequence (seq2seq) based neural keyphrase generation framework,
enabling absent keyphrases to be created. Moreover, our model, being
topic-aware, allows joint modeling of corpus-level latent topic
representations, which helps alleviate the data sparsity that widely exhibited
in social media language. Experiments on three datasets collected from English
and Chinese social media platforms show that our model significantly
outperforms both extraction and generation models that do not exploit latent
topics. Further discussions show that our model learns meaningful topics, which
interprets its superiority in social media keyphrase generation.Comment: ACL 2019 (11 pages
Emotion Correlation Mining Through Deep Learning Models on Natural Language Text
Emotion analysis has been attracting researchers' attention. Most previous
works in the artificial intelligence field focus on recognizing emotion rather
than mining the reason why emotions are not or wrongly recognized. Correlation
among emotions contributes to the failure of emotion recognition. In this
paper, we try to fill the gap between emotion recognition and emotion
correlation mining through natural language text from web news. Correlation
among emotions, expressed as the confusion and evolution of emotion, is
primarily caused by human emotion cognitive bias. To mine emotion correlation
from emotion recognition through text, three kinds of features and two deep
neural network models are presented. The emotion confusion law is extracted
through orthogonal basis. The emotion evolution law is evaluated from three
perspectives, one-step shift, limited-step shifts, and shortest path transfer.
The method is validated using three datasets-the titles, the bodies, and the
comments of news articles, covering both objective and subjective texts in
varying lengths (long and short). The experimental results show that, in
subjective comments, emotions are easily mistaken as anger. Comments tend to
arouse emotion circulations of love-anger and sadness-anger. In objective news,
it is easy to recognize text emotion as love and cause fear-joy circulation.
That means, journalists may try to attract attention using fear and joy words
but arouse the emotion love instead; After news release, netizens generate
emotional comments to express their intense emotions, i.e., anger, sadness, and
love. These findings could provide insights for applications regarding
affective interaction such as network public sentiment, social media
communication, and human-computer interaction
- …