1,328 research outputs found

    Combination of Domain Knowledge and Deep Learning for Sentiment Analysis of Short and Informal Messages on Social Media

    Full text link
    Sentiment analysis has been emerging recently as one of the major natural language processing (NLP) tasks in many applications. Especially, as social media channels (e.g. social networks or forums) have become significant sources for brands to observe user opinions about their products, this task is thus increasingly crucial. However, when applied with real data obtained from social media, we notice that there is a high volume of short and informal messages posted by users on those channels. This kind of data makes the existing works suffer from many difficulties to handle, especially ones using deep learning approaches. In this paper, we propose an approach to handle this problem. This work is extended from our previous work, in which we proposed to combine the typical deep learning technique of Convolutional Neural Networks with domain knowledge. The combination is used for acquiring additional training data augmentation and a more reasonable loss function. In this work, we further improve our architecture by various substantial enhancements, including negation-based data augmentation, transfer learning for word embeddings, the combination of word-level embeddings and character-level embeddings, and using multitask learning technique for attaching domain knowledge rules in the learning process. Those enhancements, specifically aiming to handle short and informal messages, help us to enjoy significant improvement in performance once experimenting on real datasets.Comment: A Preprint of an article accepted for publication by Inderscience in IJCVR on September 201

    The Today Tendency of Sentiment Classification

    Get PDF
    Sentiment classification has already been studied for many years because it has had many crucial contributions to many different fields in everyday life, such as in political activities, commodity production, and commercial activities. There have been many kinds of the sentiment analysis such as machine learning approaches, lexicon-based approaches, etc., for many years. The today tendency of the sentiment classification is as follows: (1) Processing many big data sets with shortening execution times (2) Having a high accuracy (3) Integrating flexibly and easily into many small machines or many different approaches. We will present each category in more details

    Domain-specific Sentiment Dictionary Construction for Sentiment Classification

    Get PDF
    Sentiment dictionaries are commonly used to solve the problem of sentiment classification for customer reviews. The number of sentiment words in the generalized dictionaries such as SentiWordNet is limited and lack of many sentiment words especially domain-specific sentiment words. Different domains have different sentiment words and the sentiment of a word depends on the domain in which it is used. In this paper, an approach based on Point-wise Mutual Information (PMI) is proposed to construct a domain-specific sentiment dictionary effectively and automatically. The proposed system is evaluated on three diverse datasets from different domains by using 10-fold cross validation. Accordingly to the experimental results, the goodness of the extracted dictionary is relatively high and significantly improves the performance of sentiment classification. The experimental results show that the extracted domain-specific dictionary outperforms the generalized dictionary, SentiWordNet. The proposed method learns the domain-specific sentiment words efficiently and it is domain adaptable

    A Survey on Deep Learning Techniques for Sentiment Analysis

    Get PDF
    Social media is a rich source of information nowadays. If we look into social media, sentiment analysis is one of the challenging problems. Sentiment analysis is a substantial area of research in the field of Natural Language Processing. This survey paper reviews and provides the comparative study of deep learning approaches CNN, RNN, LSTM and ensemble-based methods

    Text pre-processing of multilingual for sentiment analysis based on social network data

    Get PDF
    Sentiment analysis (SA) is an enduring area for research especially in the field of text analysis. Text pre-processing is an important aspect to perform SA accurately. This paper presents a text processing model for SA, using natural language processing techniques for twitter data. The basic phases for machine learning are text collection, text cleaning, pre-processing, feature extractions in a text and then categorize the data according to the SA techniques. Keeping the focus on twitter data, the data is extracted in domain specific manner. In data cleaning phase, noisy data, missing data, punctuation, tags and emoticons have been considered. For pre-processing, tokenization is performed which is followed by stop word removal (SWR). The proposed article provides an insight of the techniques, that are used for text pre-processing, the impact of their presence on the dataset. The accuracy of classification techniques has been improved after applying text pre-processing and dimensionality has been reduced. The proposed corpus can be utilized in the area of market analysis, customer behaviour, polling analysis, and brand monitoring. The text pre-processing process can serve as the baseline to apply predictive analysis, machine learning and deep learning algorithms which can be extended according to problem definition
    • …
    corecore