1,328 research outputs found
Combination of Domain Knowledge and Deep Learning for Sentiment Analysis of Short and Informal Messages on Social Media
Sentiment analysis has been emerging recently as one of the major natural
language processing (NLP) tasks in many applications. Especially, as social
media channels (e.g. social networks or forums) have become significant sources
for brands to observe user opinions about their products, this task is thus
increasingly crucial. However, when applied with real data obtained from social
media, we notice that there is a high volume of short and informal messages
posted by users on those channels. This kind of data makes the existing works
suffer from many difficulties to handle, especially ones using deep learning
approaches. In this paper, we propose an approach to handle this problem. This
work is extended from our previous work, in which we proposed to combine the
typical deep learning technique of Convolutional Neural Networks with domain
knowledge. The combination is used for acquiring additional training data
augmentation and a more reasonable loss function. In this work, we further
improve our architecture by various substantial enhancements, including
negation-based data augmentation, transfer learning for word embeddings, the
combination of word-level embeddings and character-level embeddings, and using
multitask learning technique for attaching domain knowledge rules in the
learning process. Those enhancements, specifically aiming to handle short and
informal messages, help us to enjoy significant improvement in performance once
experimenting on real datasets.Comment: A Preprint of an article accepted for publication by Inderscience in
IJCVR on September 201
The Today Tendency of Sentiment Classification
Sentiment classification has already been studied for many years because it has had many crucial contributions to many different fields in everyday life, such as in political activities, commodity production, and commercial activities. There have been many kinds of the sentiment analysis such as machine learning approaches, lexicon-based approaches, etc., for many years. The today tendency of the sentiment classification is as follows: (1) Processing many big data sets with shortening execution times (2) Having a high accuracy (3) Integrating flexibly and easily into many small machines or many different approaches. We will present each category in more details
Domain-specific Sentiment Dictionary Construction for Sentiment Classification
Sentiment dictionaries are commonly used to solve the problem of sentiment classification for customer reviews. The number of sentiment words in the generalized dictionaries such as SentiWordNet is limited and lack of many sentiment words especially domain-specific sentiment words. Different domains have different sentiment words and the sentiment of a word depends on the domain in which it is used. In this paper, an approach based on Point-wise Mutual Information (PMI) is proposed to construct a domain-specific sentiment dictionary effectively and automatically. The proposed system is evaluated on three diverse datasets from different domains by using 10-fold cross validation. Accordingly to the experimental results, the goodness of the extracted dictionary is relatively high and significantly improves the performance of sentiment classification. The experimental results show that the extracted domain-specific dictionary outperforms the generalized dictionary, SentiWordNet. The proposed method learns the domain-specific sentiment words efficiently and it is domain adaptable
A Survey on Deep Learning Techniques for Sentiment Analysis
Social media is a rich source of information nowadays. If we look into social media, sentiment analysis is one of the challenging problems. Sentiment analysis is a substantial area of research in the field of Natural Language Processing. This survey paper reviews and provides the comparative study of deep learning approaches CNN, RNN, LSTM and ensemble-based methods
Text pre-processing of multilingual for sentiment analysis based on social network data
Sentiment analysis (SA) is an enduring area for research especially in the field of text analysis. Text pre-processing is an important aspect to perform SA accurately. This paper presents a text processing model for SA, using natural language processing techniques for twitter data. The basic phases for machine learning are text collection, text cleaning, pre-processing, feature extractions in a text and then categorize the data according to the SA techniques. Keeping the focus on twitter data, the data is extracted in domain specific manner. In data cleaning phase, noisy data, missing data, punctuation, tags and emoticons have been considered. For pre-processing, tokenization is performed which is followed by stop word removal (SWR). The proposed article provides an insight of the techniques, that are used for text pre-processing, the impact of their presence on the dataset. The accuracy of classification techniques has been improved after applying text pre-processing and dimensionality has been reduced. The proposed corpus can be utilized in the area of market analysis, customer behaviour, polling analysis, and brand monitoring. The text pre-processing process can serve as the baseline to apply predictive analysis, machine learning and deep learning algorithms which can be extended according to problem definition
- …