Search CORE

60,250 research outputs found

Combination of Domain Knowledge and Deep Learning for Sentiment Analysis of Short and Informal Messages on Social Media

Author: Mai Trung
Nguyen Mao
Nguyen Tri
Pham Dang
Quan Tho
Truong Minh
Vo Khuong
Publication venue: 'Inderscience Publishers'
Publication date: 01/01/2019
Field of study

Sentiment analysis has been emerging recently as one of the major natural language processing (NLP) tasks in many applications. Especially, as social media channels (e.g. social networks or forums) have become significant sources for brands to observe user opinions about their products, this task is thus increasingly crucial. However, when applied with real data obtained from social media, we notice that there is a high volume of short and informal messages posted by users on those channels. This kind of data makes the existing works suffer from many difficulties to handle, especially ones using deep learning approaches. In this paper, we propose an approach to handle this problem. This work is extended from our previous work, in which we proposed to combine the typical deep learning technique of Convolutional Neural Networks with domain knowledge. The combination is used for acquiring additional training data augmentation and a more reasonable loss function. In this work, we further improve our architecture by various substantial enhancements, including negation-based data augmentation, transfer learning for word embeddings, the combination of word-level embeddings and character-level embeddings, and using multitask learning technique for attaching domain knowledge rules in the learning process. Those enhancements, specifically aiming to handle short and informal messages, help us to enjoy significant improvement in performance once experimenting on real datasets.Comment: A Preprint of an article accepted for publication by Inderscience in IJCVR on September 201

arXiv.org e-Print Archive

Crossref

Domain-specific lexicon generation for emotion detection from text.

Author: Bandhakavi Anil
Publication venue
Publication date: 31/01/2018
Field of study

Emotions play a key role in effective and successful human communication. Text is popularly used on the internet and social media websites to express and share emotions, feelings and sentiments. However useful applications and services built to understand emotions from text are limited in effectiveness due to reliance on general purpose emotion lexicons that have static vocabulary and sentiment lexicons that can only interpret emotions coarsely. Thus emotion detection from text calls for methods and knowledge resources that can deal with challenges such as dynamic and informal vocabulary, domain-level variations in emotional expressions and other linguistic nuances. In this thesis we demonstrate how labelled (e.g. blogs, news headlines) and weakly-labelled (e.g. tweets) emotional documents can be harnessed to learn word-emotion lexicons that can account for dynamic and domain-specific emotional vocabulary. We model the characteristics of realworld emotional documents to propose a generative mixture model, which iteratively estimates the language models that best describe the emotional documents using expectation maximization (EM). The proposed mixture model has the ability to model both emotionally charged words and emotion-neutral words. We then generate a word-emotion lexicon using the mixture model to quantify word-emotion associations in the form of a probability vectors. Secondly we introduce novel feature extraction methods to utilize the emotion rich knowledge being captured by our word-emotion lexicon. The extracted features are used to classify text into emotion classes using machine learning. Further we also propose hybrid text representations for emotion classification that use the knowledge of lexicon based features in conjunction with other representations such as n-grams, part-of-speech and sentiment information. Thirdly we propose two different methods which jointly use an emotion-labelled corpus of tweets and emotion-sentiment mapping proposed in psychology to learn word-level numerical quantification of sentiment strengths over a positive to negative spectrum. Finally we evaluate all the proposed methods in this thesis through a variety of emotion detection and sentiment analysis tasks on benchmark data sets covering domains from blogs to news articles to tweets and incident reports

Open Access Institutional Repository at Robert Gordon University

Multitask Learning for Fine-Grained Twitter Sentiment Analysis

Author: Amini Massih-Reza
Balikas Georgios
Moura Simon
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/07/2017
Field of study

Traditional sentiment analysis approaches tackle problems like ternary (3-category) and fine-grained (5-category) classification by learning the tasks separately. We argue that such classification tasks are correlated and we propose a multitask approach based on a recurrent neural network that benefits by jointly learning them. Our study demonstrates the potential of multitask models on this type of problems and improves the state-of-the-art results in the fine-grained sentiment classification problem.Comment: International ACM SIGIR Conference on Research and Development in Information Retrieval 201

arXiv.org e-Print Archive

Crossref

UCL Discovery

Recommended from our members

Scientific Literacy in the digital age: tools, environments and resources for co-inquiry

Author: Achmetova Almira
Baildinova Klara
Kozhanova Saule
Kuanysheva Anar
Maukayeva Saule
Nuralinova Gulnar
Orazalina Ainash
Tusupova Karlygash
Zhanaspayev Marat
Zhunusova Aigul
Publication venue
Publication date: 01/01/2013
Field of study

This paper describes some European and International projects to promote Scientific Literacy in the digital age as well as technologies, environments and resources for co-inquiry. The aim of this research is also to describe computer applications, software tools and environments that were designed to support processes of collaborative inquiry learning to promote Scientific Literacy. These tools are analyzed by describing their interfaces and functionalities. The outcomes of this descriptive research points out some effects on student learning and competences developed known from the literature. This paper argues the importance of promoting scientific citizenship not only through schools and Universities (formal learning), but also non-credit online courses and community-based learning programmes (non-formal context), as well as daily life activities, educational open digital materials through social networks (informal scenario)

Open Research Online (The Open University)

European Scientific Journal, ESJ

European Scientific Journal (European Scientific Institute)