Sentiment Analysis on Social Media Via Machine Learning

Abstract

Social media are shaping users\u27 attitudes and behaviors through spreading information anytime and anywhere. Monitoring user opinions on social media is an effective solution to measure users\u27 preferences towards brands or events. Currently, supervised machine learning-based methods dominate this area. However, as far as we know, there is no comprehensive comparison of performances of different models to figure out which model will be better for individual datasets. The focus of this thesis is to compare the performance of different supervised machine learning models. In detail, we built six classifiers, including support vector machine, random forest, neural network, Adaboost, decision tree, and Naive Bayes on two datasets and compare their performance. Furthermore, we introduced feature selection to remove unrelated attributes to preprocess the data and compare performance by building classifiers on the preprocessed data. Experimental results show that without feature selection, there is no significant difference in the performance. After feature selection, random forest outperformed other classifiers

    Similar works