Search CORE

5,671 research outputs found

A comparative study of Bayesian models for unsupervised sentiment detection

Author: Everson Richard
He Yulan
Lin Chenghua
Publication venue
Publication date: 01/01/2010
Field of study

This paper presents a comparative study of three closely related Bayesian models for unsupervised document level sentiment classification, namely, the latent sentiment model (LSM), the joint sentimenttopic (JST) model, and the Reverse-JST model. Extensive experiments have been conducted on two corpora, the movie review dataset and the multi-domain sentiment dataset. It has been found that while all the three models achieve either better or comparable performance on these two corpora when compared to the existing unsupervised sentiment classification approaches, both JST and Reverse-JST are able to extract sentiment-oriented topics. In addition, Reverse-JST always performs worse than JST suggesting that the JST model is more appropriate for joint sentiment topic detection

CiteSeerX

Open Research Online (The Open University)

Sentiment Analysis of Spanish Words of Arabic Origin Related to Islam: A Social Network Analysis

Author: Mostafa Mohamed M.
Nebot Nicolás Roser
Publication venue: 'Academy Publication'
Publication date: 01/11/2017
Field of study

With the arrival of Muslims in 711 till their expulsion in the 1600s, Arabic language was present in Spain for more than eight centuries. Although social networks have become a valuable resource for mining sentiments, there is no previous research investigating the layman’s sentiment towards Spanish words of Arabic etymology related to Islamic terminology. This study aim at analyzing Spanish words of Arabic origin related to Islam. A random sample of 4586 out of 45860 tweets was used to evaluate general sentiment towards some Spanish words of Arabic origin related to Islam. An expert-predefined Spanish lexicon of around 6800 seed adjectives was used to conduct the analysis. Results indicate a generally positive sentiment towards several Spanish words of Arabic etymology related to Islam. By implementing both a qualitative and quantitative methodology to analyze tweets’ sentiments towards Spanish words of Arabic etymology, this research adds breadth and depth to the debate over Arabic linguistic influence on Spanish vocabulary

Text-based Emotion Aware Recommender

Author: Griva Igor
Kennedy William G.
Leung John Kalung
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 28/07/2020
Field of study

We apply the concept of users' emotion vectors (UVECs) and movies' emotion vectors (MVECs) as building components of Emotion Aware Recommender System. We built a comparative platform that consists of five recommenders based on content-based and collaborative filtering algorithms. We employed a Tweets Affective Classifier to classify movies' emotion profiles through movie overviews. We construct MVECs from the movie emotion profiles. We track users' movie watching history to formulate UVECs by taking the average of all the MVECs from all the movies a user has watched. With the MVECs, we built an Emotion Aware Recommender as one of the comparative platforms' algorithms. We evaluated the top-N recommendation lists generated by these Recommenders and found the top-N list of Emotion Aware Recommender showed serendipity recommendations.Comment: 13 pages, 8 tables, International Conference on Natural Language Computing and AI (NLCAI2020) July25-26, London, United Kingdo

arXiv.org e-Print Archive

Crossref

Evaluation in Discourse: a Corpus-Based Study

Author: Asher Nicholas
Benamara Farah
Chardon Baptiste
Mathieu Yvette Yannick
Popescu Vladimir
Publication venue: University of Illinois at Chicago Library
Publication date: 01/01/2016
Field of study

This paper describes the CASOAR corpus, the first manually annotated corpus that explores the impact of discourse structure on sentiment analysis with a study of movie reviews in French and in English as well as letters to the editor in French. While annotating opinions at the expression, the sentence or the document level is a well-established task and relatively straightforward, discourse annotation remains difficult, especially for non-experts. Therefore, combining both annotations poses several methodological problems that we address here. We propose a multi-layered annotation scheme that includes: the complete discourse structure according to the Segmented Discourse Representation Theory, the opinion orientation of elementary discourse units and opinion expressions, and their associated features. We detail each layer, explore the interactions between them and discuss our results. In particular, we examine the correlation between discourse and semantic category of opinion expressions, the impact of discourse relations on both subjectivity and polarity analysis and the impact of discourse on the determination of the overall opinion of a document. Our results demonstrate that discourse is an important cue for sentiment analysis, at least for the corpus genres we have studied

University of Illinois at Chicago: Journals@UIC

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

Hal-Diderot

Dialogue & Discourse (E-Journal - Universität Bielefeld)

Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN

Author: Appel
Baccianella
Bengio
Brun
Carrillo-de Albornoz
Chaturvedi
Collobert
Crystal
Dong
Elman
Fernández-Gavilanes
Ganapathibhotla
González-Ibánez
Graves
Hatzivassiloglou
Hercig
Hinton
Hochreiter
Hu
Irsoy
Irsoy
Jia
Jiang
Jindal
Jindal
Kalchbrenner
Kessler
Kim
Kumar
Lafferty
Le
Liu
Liu
Liu
Liu
McCallum
Mitchell
Muhammad
Nakagawa
Narayanan
Pang
Pang
Pang
Park
Pontiki
Popescu
Poria
Poria
Qiu
Ravi
Rill
Riloff
Riloff
Ruifeng Xu
Schuster
Socher
Socher
Socher
Stoyanov
Tang
Tang
Tao Chen
Toh
Tsur
Turney
Vo
Wang
Wiebe
Wiebe
Wiebe
Xuan Wang
Yang
Yu
Yulan He
Zhang
Zhang
Álvarez López
Çetin
Publication venue: 'Elsevier BV'
Publication date: 15/04/2017
Field of study

Different types of sentences express sentiment in very different ways. Traditional sentence-level sentiment classification research focuses on one-technique-fits-all solution or only centers on one special type of sentences. In this paper, we propose a divide-and-conquer approach which first classifies sentences into different types, then performs sentiment analysis separately on sentences from each type. Specifically, we find that sentences tend to be more complex if they contain more sentiment targets. Thus, we propose to first apply a neural network based sequence model to classify opinionated sentences into three types according to the number of targets appeared in a sentence. Each group of sentences is then fed into a one-dimensional convolutional neural network separately for sentiment classification. Our approach has been evaluated on four sentiment classification datasets and compared with a wide range of baselines. Experimental results show that: (1) sentence type classification can improve the performance of sentence-level sentiment analysis; (2) the proposed approach achieves state-of-the-art results on several benchmarking datasets

Elsevier - Publisher Connector

Crossref

Aston Publications Explorer

On Being Negative

Author: AE Goldberg
B Ginsberg
B Liu
B Louw
B Pang
C Izard
C Perelman
CMIM Matthiessen
D Biber
D Biber
D Biber
D Garcia
D Kahneman
D Walton
FN Ribeiro
J Wiebe
JD Boucher
JH Greenberg
JR Martin
JR Martin
JR Martin
K Burke
L Alba-Juez
LR Horn
M Israel
M Taboada
MAK Halliday
MAK Halliday
MAK Halliday
MW Matlin
P Rozin
P Rozin
PS Dodds
R Mochales
R Trnavac
S Eggins
S Hunston
S Hunston
S Toulmin
T Borchers
V Curtis
WC Mann
WM Keith
Z Jing-Schmidt
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2017
Field of study

This paper investigates the pragmatic expressions of negative evaluation (negativity) in two corpora: (i) comments posted online in response to newspaper opinion articles; and (ii) online reviews of movies, books and consumer products. We propose a taxonomy of linguistic resources that are deployed in the expression of negativity, with two broad groups at the top level of the taxonomy: resources from the lexicogrammar or from discourse semantics. We propose that rhetorical figures can be considered part of the discourse semantic resources used in the expression of negativity. Using our taxonomy as starting point, we carry out a corpus analysis, and focus on three phenomena: adverb + adjective combinations; rhetorical questions; and rhetorical figures. Although the analysis in this paper is corpus-assisted rather than corpus-driven, the final goal of our research is to make it quantitative, in extracting patterns and resources that can be detected automatically

Crossref

Simon Fraser University Institutional Repository