Search CORE

3 research outputs found

A Semantic Relatedness Measure Based on Combined Encyclopedic, Ontological and Collocational Knowledge

Author: Haralambous Yannis
Klyuev Vitaly
Publication venue
Publication date: 19/08/2011
Field of study

We describe a new semantic relatedness measure combining the Wikipedia-based Explicit Semantic Analysis measure, the WordNet path measure and the mixed collocation index. Our measure achieves the currently highest results on the WS-353 test: a Spearman rho coefficient of 0.79 (vs. 0.75 in (Gabrilovich and Markovitch, 2007)) when applying the measure directly, and a value of 0.87 (vs. 0.78 in (Agirre et al., 2009)) when using the prediction of a polynomial SVM classifier trained on our measure. In the appendix we discuss the adaptation of ESA to 2011 Wikipedia data, as well as various unsuccessful attempts to enhance ESA by filtering at word, sentence, and section level.Comment: 6 pages, 6 figures, accepted for publication at IJCNLP2011 Conferenc

arXiv.org e-Print Archive

HAL-Université de Bretagne Occidentale

HAL Descartes

On negative results when using sentiment analysis tools for software engineering research

Author: AI Rousinopoulos
AJ Viera
Alexander Serebrenik
B Pang
B Vasilescu
C Giraud-Carrier
DW Zimmerman
E Brunner
F Konietschke
F Wilcoxon
FJ Shull
G Destefanis
J Cohen
JL Fleiss
KR Gabriel
L Hubert
M Hall
M Thelwall
M Thelwall
OJ Dunn
P Pritchard
P Tonella
Proshanta Sarkar
Robbert Jongeling
Subhajit Datta
WM Rand
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Recent years have seen an increasing attention to social aspects of software engineering, including studies of emotions and sentiments experienced and expressed by the software developers. Most of these studies reuse existing sentiment analysis tools such as SentiStrength and NLTK. However, these tools have been trained on product reviews and movie reviews and, therefore, their results might not be applicable in the software engineering domain. In this paper we study whether the sentiment analysis tools agree with the sentiment recognized by human evaluators (as reported in an earlier study) as well as with each other. Furthermore, we evaluate the impact of the choice of a sentiment analysis tool on software engineering studies by conducting a simple study of differences in issue resolution times for positive, negative and neutral texts. We repeat the study for seven datasets (issue trackers and Stack Overflow questions) and different sentiment analysis tools and observe that the disagreement between the tools can lead to diverging conclusions. Finally, we perform two replications of previously published studies and observe that the results of those studies cannot be confirmed when a different sentiment analysis tool is used

Repository TU/e

Crossref

Springer - Publisher Connector

Pure OAI Repository

Institutional Knowledge at Singapore Management University