5,374 research outputs found

    Twitter Activity Of Urban And Rural Colleges: A Sentiment Analysis Using The Dialogic Loop

    Get PDF
    The purpose of the present study is to ascertain if colleges are achieving their ultimate communication goals of maintaining and attracting students through their microblogging activity, which according to Dialogic Loop Theory, is directly correlated to the use of positive and negative sentiment. The study focused on a cross-section of urban and rural community colleges within the United States to identify the sentiment score of their microblogging activity. The study included a content analysis on the Twitter activity of these colleges. A data-mining process was employed to collect a census of the tweets associated with these colleges. Further processing was then applied using data linguistic software that removed all irrelevant text, word abbreviations, emoticons, and other Twitter specific classifiers. The resulting data set was then processed through a Multinomial Naive Bayes Classifier, which refers to a probability of word counts in a text. The classifier was trained using a data source of 1.5 million tweets, called Sentiment140, that qualitatively analyzed the corpus of these tweets, labeling them as positive and negative sentiment. The Multinomial Naive Bayes Classifier distinguished specific wording and phrases from the corpus, comparing the data to a specific database of sentiment word identifiers. The sentiment analysis process categorized the text as being positive or negative. Finally, statistical analysis was conducted on the outcome of the sentiment analysis. A significant contribution of the current work was extending Kent and Taylor\u27s (1998) Dialogic Loop Theory, which was designed specifically for identifying the relationship building capabilities of a Web site, to encompass the microblogging concept used in Twitter. Specifically, Dialogic Loop Theory is applied and enhanced to develop a model for social media communication to augment relationship building capabilities, which the current study established as a new form for evaluating Twitter tweets, labeled in the current body of work as Microblog Dialogic Communication. The implication is that by using Microblog Dialogic Communication, a college can address and correct their microblogging sentiment. The results of the data collected found that rural colleges tweeted more positive sentiment tweets and less negative sentiment tweets when compared to the urban colleges tweets

    An evaluation of the role of sentiment in second screen microblog search tasks

    Get PDF
    The recent prominence of the real-time web is proving both challenging and disruptive for information retrieval and web data mining research. User-generated content on the real-time web is perhaps best epitomised by content on microblogging platforms, such as Twitter. Given the substantial quantity of microblog posts that may be relevant to a user's query at a point in time, automated methods are required to sift through this information. Sentiment analysis offers a promising direction for modelling microblog content. We build and evaluate a sentiment-based filtering system using real-time user studies. We find a significant role played by sentiment in the search scenarios, observing detrimental effects in filtering out certain sentiment types. We make a series of observations regarding associations between document-level sentiment and user feedback, including associations with user profile attributes, and users' prior topic sentiment

    The Royal Birth of 2013: Analysing and Visualising Public Sentiment in the UK Using Twitter

    Full text link
    Analysis of information retrieved from microblogging services such as Twitter can provide valuable insight into public sentiment in a geographic region. This insight can be enriched by visualising information in its geographic context. Two underlying approaches for sentiment analysis are dictionary-based and machine learning. The former is popular for public sentiment analysis, and the latter has found limited use for aggregating public sentiment from Twitter data. The research presented in this paper aims to extend the machine learning approach for aggregating public sentiment. To this end, a framework for analysing and visualising public sentiment from a Twitter corpus is developed. A dictionary-based approach and a machine learning approach are implemented within the framework and compared using one UK case study, namely the royal birth of 2013. The case study validates the feasibility of the framework for analysis and rapid visualisation. One observation is that there is good correlation between the results produced by the popular dictionary-based approach and the machine learning approach when large volumes of tweets are analysed. However, for rapid analysis to be possible faster methods need to be developed using big data techniques and parallel methods.Comment: http://www.blessonv.com/research/publicsentiment/ 9 pages. Submitted to IEEE BigData 2013: Workshop on Big Humanities, October 201

    Automatic creation of stock market lexicons for sentiment analysis using StockTwits data

    Get PDF
    Sentiment analysis has been increasingly applied to the stock market domain. In particular, investor sentiment indicators can be used to model and predict stock market variables. In this context, the quality of the sentiment analysis is highly dependent of the opinion lexicon adopted. However, there is a lack of lexicons adjusted to microblogging stock market data. In this work, we propose an automatic procedure for the creation of such lexicon by exploring a large set of labeled messages from StockTwits, a popular financial microblogging service, and using four statistical measures: adaptations of the known TF-IDF, Information Gain, Class Percentage, and a newly proposed Weighted Class Probability. The obtained lexicons are competitive when compared with a set of six reference lexicons. Moreover, we verified that it is beneficial to use continuous sentiment scores instead of sentiment labels.We wish to thank StockTwits for kindly providing their data. This work has been supported by FCT - Fundacao para a Ciencia e Tecnologia within the Project Scope: PEst-OE/EEI/UI0319/2014

    Market Liquidity and Its Dimensions: Linking the Liquidity Dimensions to Sentiment Analysis through Microblogging Data

    Full text link
    [EN] Market liquidity has an immediate impact on the execution of transactions in financial markets. Informed counterparty risk is often priced into market liquidity. This study investigates whether microblogging data, as a non-financial information tool, is priced along with market liquidity dimensions. The analysis is based on the Australian Securities Exchange (ASX), and from the results, we conclude that microblogging content in pessimistic periods has a higher impact on liquidity and its dimensions. On a daily basis, pessimistic investor sentiments lead to higher trading costs, illiquidity, a larger price dispersion and a lower trading volume.Guijarro, F.; Moya Clemente, I.; Saleemi, J. (2021). Market Liquidity and Its Dimensions: Linking the Liquidity Dimensions to Sentiment Analysis through Microblogging Data. Journal of Risk and Financial Management (Online). 14(9):1-12. https://doi.org/10.3390/jrfm1409039411214

    Sentiment analysis and real-time microblog search

    Get PDF
    This thesis sets out to examine the role played by sentiment in real-time microblog search. The recent prominence of the real-time web is proving both challenging and disruptive for a number of areas of research, notably information retrieval and web data mining. User-generated content on the real-time web is perhaps best epitomised by content on microblogging platforms, such as Twitter. Given the substantial quantity of microblog posts that may be relevant to a user query at a given point in time, automated methods are required to enable users to sift through this information. As an area of research reaching maturity, sentiment analysis offers a promising direction for modelling the text content in microblog streams. In this thesis we review the real-time web as a new area of focus for sentiment analysis, with a specific focus on microblogging. We propose a system and method for evaluating the effect of sentiment on perceived search quality in real-time microblog search scenarios. Initially we provide an evaluation of sentiment analysis using supervised learning for classi- fying the short, informal content in microblog posts. We then evaluate our sentiment-based filtering system for microblog search in a user study with simulated real-time scenarios. Lastly, we conduct real-time user studies for the live broadcast of the popular television programme, the X Factor, and for the Leaders Debate during the Irish General Election. We find that we are able to satisfactorily classify positive, negative and neutral sentiment in microblog posts. We also find a significant role played by sentiment in many microblog search scenarios, observing some detrimental effects in filtering out certain sentiment types. We make a series of observations regarding associations between document-level sentiment and user feedback, including associations with user profile attributes, and users’ prior topic sentiment

    Sentiment Analysis Models for Mapping Public Engagement on Twitter Data

    Get PDF
    Unstructured data in the form of text, which is widely distributed on the internet, often has valuable information. Due to its unstructured form, an effort is needed to extract that information. Twitter is a microblogging social media platform used by many people to express their opinions or thoughts. Sentiment analysis is a way to map a sentence whether the value is positive or not. Sentiment analysis is a series of processes used to classify text documents into two classes, namely positive sentiment class and negative sentiment class. The dataset is obtained from sentiment 140 as training data to build the sentiment analysis model. To test the model, the data used by the crawler algorithm were extracted using the Twitter API. This study focuses on determining public sentiment based on their writing on Twitter. The classification model used in the study is multiclass naive Bayes. The TF-IDF method was also used to weigh the selected feature. The experimental results show that the resulting model has an accuracy of 74.16% with an average precision of 74%, a recall of 74%, and an f-measure of 74%

    Joint Distribution pada Weighted Majority Vote (WMV) untuk Peningkatan Kinerja Sentiment Analysis Tersupervisi pada Dataset Twitter

    Get PDF
    Sentiment analysis adalah teknik komputasi text mining berbasis natural language processing (NLP) untuk mengekstraksi pendapat seseorang yang diungkapkan dalam platform online, termasuk dalam platform microblogging Twitter, salah satu platform microblogging yang paling popular digunakan di Indonesia. Ada dua pendekatan yang umum digunakan dalam teknik sentiment analysis yaitu pendekatan berbasis machine learning (ML) dan pendekatan berbasis sentiment lexicon (SL). Fokus penelitian ini adalah untuk pengembangan teknik sentiment analysis berbasis machine learning yang disebut juga teknik tersupervisi pada dataset Twitter. Sebagian besar sentiment analysis pada dataset Twitter berbahasa Indonesia mengandalkan single machine learning algorithm. Penelitian ini menggabungkan kinerja berbagai algoritma/experts seraya mengurangi tingkat kesalahan klasifikasi dengan meng-update bobot secara dinamis menggunakan weighted majority vote (WMV) berbasis joint distribution dari Bayesian Network. Pada tahap pertama, data di grabbing dari Twitter dengan 3 hashtag terkait Covid-19 sebagai data eksperimen. Selanjutnya kinerja weighted majority vote secara ekstensif dibandingkan dengan 4 metode baseline sebagai pembanding, yaitu: Naïve Bayes, Gaussian Naïve Bayes, Multinomial Naïve Bayes dan Majority Vote dari ketiga single classifier tersebut. Metrics kinerja yang digunakan adalah precision, recall, fmeasure, accuracy dan Mathews correlation coeficient (MCCC). Dalam eksperimen, terbukti bahwa WMV mampu meningkatkan kinerja sentiment analysis pada ketiga topik dataset dengan evaluator berbagai metrics kinerja sentiment analysis. AbstractSentiment analysis is a computational text mining technique based on natural language processing (NLP) to extract someone's opinion expressed in online platforms, including the Twitter microblogging platform, one of the most popular microblogging platforms used in Indonesia. There are two approaches that are commonly used in sentiment analysis techniques, namely the machine learning (ML) based approach and the sentiment lexicon (SL) based approach. The focus of this research is the development of machine learning-based sentiment analysis techniques which are also called supervised techniques on the Twitter dataset. Most of the sentiment analysis on the Indonesian language Twitter dataset relies on a single machine learning algorithm. This study combines the performance of various algorithms/experts while reducing the level of misclassification by updating the weights dynamically using a joint distribution-based weighted majority vote (WMV) from the Bayesian Network. In the first stage, data was grabbed from Twitter with 3 hashtags related to Covid-19 as experimental data. Furthermore, the performance of the weighted majority vote was extensively compared with 4 baseline methods for comparison, namely: Naïve Bayes, Gaussian Naïve Bayes, Multinomial Nave Bayes and Majority Vote from the three single classifiers. Performance metrics used are precision, recall, fmeasure, accuracy and Mathews correlation coeficient. In experiments, it is proven that WMV is able to improve sentiment analysis performance on the three dataset topics with various evaluators of sentiment analysis performance metrics
    corecore