612 research outputs found
Detecting and Monitoring Hate Speech in Twitter
Social Media are sensors in the real world that can be used to measure the pulse of societies.
However, the massive and unfiltered feed of messages posted in social media is a phenomenon that
nowadays raises social alarms, especially when these messages contain hate speech targeted to a
specific individual or group. In this context, governments and non-governmental organizations
(NGOs) are concerned about the possible negative impact that these messages can have on individuals
or on the society. In this paper, we present HaterNet, an intelligent system currently being used by
the Spanish National Office Against Hate Crimes of the Spanish State Secretariat for Security that
identifies and monitors the evolution of hate speech in Twitter. The contributions of this research
are many-fold: (1) It introduces the first intelligent system that monitors and visualizes, using social
network analysis techniques, hate speech in Social Media. (2) It introduces a novel public dataset on
hate speech in Spanish consisting of 6000 expert-labeled tweets. (3) It compares several classification
approaches based on different document representation strategies and text classification models. (4)
The best approach consists of a combination of a LTSM+MLP neural network that takes as input the
tweet’s word, emoji, and expression tokens’ embeddings enriched by the tf-idf, and obtains an area
under the curve (AUC) of 0.828 on our dataset, outperforming previous methods presented in the
literatureThe work by Quijano-Sanchez was supported by the Spanish Ministry of Science and Innovation
grant FJCI-2016-28855. The research of Liberatore was supported by the Government of Spain, grant MTM2015-65803-R, and by the European Union’s Horizon 2020 Research and Innovation Programme, under the Marie Sklodowska-Curie grant agreement No. 691161 (GEOSAFE). All the financial support is gratefully acknowledge
Sentiment Analysis Based on Deep Learning: A Comparative Study
The study of public opinion can provide us with valuable information. The
analysis of sentiment on social networks, such as Twitter or Facebook, has
become a powerful means of learning about the users' opinions and has a wide
range of applications. However, the efficiency and accuracy of sentiment
analysis is being hindered by the challenges encountered in natural language
processing (NLP). In recent years, it has been demonstrated that deep learning
models are a promising solution to the challenges of NLP. This paper reviews
the latest studies that have employed deep learning to solve sentiment analysis
problems, such as sentiment polarity. Models using term frequency-inverse
document frequency (TF-IDF) and word embedding have been applied to a series of
datasets. Finally, a comparative study has been conducted on the experimental
results obtained for the different models and input feature
Social media cross-source and cross-domain sentiment classification
Due to the expansion of Internet and Web 2.0 phenomenon, there is a growing interest in the sentiment analysis of freely opinionated text. In this paper, we propose a novel cross-source cross-domain sentiment classification, in which cross-domain labeled Web sources (Amazon and Tripadvisor) are used to train supervised learning models (including two deep learning algorithms) that are tested on typically non labeled social media reviews (Facebook and Twitter). We explored a three step methodology, in which dis- tinct balanced training, text preprocessing and machine learning methods were tested, using two languages: English and Italian. The best results were achieved when using undersampling training and a Convolutional Neural Network. Interesting cross-source classification performances were achieved, in particular when using Amazon and Tripadvisor reviews to train a model that is tested on Facebook data for both English and Italian.Research carried out with the support of resources of Big&Open Data Innovation Laboratory (BODaI-Lab), the University of Brescia, granted by Fondazione Cariplo and Regione Lombardia. The work of P. Cortez was supported by FCT - Fundacao para a Ciencia e Tecnologia within the Project Scope UID/CEC/00319/2019. We would also like to thank the three anonymous reviewers for their helpful suggestions
Sentiment Analysis Using Deep Learning: A Comparison Between Chinese And English
With the increasing popularity of opinion-rich resources, opinion mining and
sentiment analysis has received increasing attention. Sentiment analysis is one of
the most effective ways to find the opinion of authors. By mining what people think,
sentiment analysis can provide the basis for decision making. Most of the objects of
analysis are text data, such as Facebook status and movie reviews. Despite many
sentiment classification models having good performance on English corpora, they
are not good at Chinese or other languages. Traditional sentiment approaches
impose many restrictions on the raw data, and they don't have enough capacity to
deal with long-distance sequential dependencies.
So, we propose a model based on recurrent neural network model using a
context vector space model. Chinese information entropy is typically higher than
English, we therefore hypothesise that context vector space model can be used to
improve the accuracy of sentiment analysis. Our algorithm represents each complex
input by a dense vector trained to translate sequence data to another sequence, like
the translation of English and French. Then we build a recurrent neural network with
the Long-Short-Term Memory model to deal the long-distance dependencies in input
data, such as movie review. The results show that our approach has promise but still
has a lot of room for improvement
- …