227 research outputs found

    Sentiment analytics: Lexicons construction and analysis

    Get PDF
    With the increasing amount of text data, sentiment analysis (SA) is becoming more and more important. An automated approach is needed to parse the online reviews and comments, and analyze their sentiments. Since lexicon is the most important component in SA, enhancing the quality of lexicons will improve the efficiency and accuracy of sentiment analysis. In this research, the effect of coupling a general lexicon with a specialized lexicon (for a specific domain) and its impact on sentiment analysis was presented. Two special domains and one general domain were studied. The two special domains are the petroleum domain and the biology domain. The general domain is the social network domain. The specialized lexicon for the petroleum domain was created as part of this research. The results, as expected, show that coupling a general lexicon with a specialized lexicon improves the sentiment analysis. However, coupling a general lexicon with another general lexicon does not improve the sentiment analysis --Abstract, page iii

    Machine Learning as a Tool for Wildlife Management and Research: The Case of Wild Pig-Related Content on Twitter

    Get PDF
    Wild pigs (Sus scrofa) are a non-native, invasive species that cause considerable damage and transmit a variety of diseases to livestock, people, and wildlife. We explored Twitter, the most popular social media micro-blogging platform, to demonstrate how social media data can be leveraged to investigate social identity and sentiment toward wild pigs. In doing so, we employed a sophisticated machine learning approach to investigate: (1) the overall sentiment associated with the dataset, (2) online identities via user profile descriptions, and (3) the extent to which sentiment varied by online identity. Results indicated that the largest groups of online identity represented in our dataset were females and people whose occupation was in journalism and media communication. While the majority of our data indicated a negative sentiment toward wild pigs and other related search terms, users who identified with agriculture-related occupations had more favorable sentiment. Overall, this article is an important starting point for further investigation of the use of social media data and social identity in the context of wild pigs and other invasive species

    Sentiment Analysis of News Tweets

    Get PDF
    Sentiment Analysis is a process of extracting information from a large amount of data and classifying them into different classes called sentiments. Python is a simple yet powerful, high-level, interpreted, and dynamic programming language, which is well known for its functionality of processing natural language data by using NLTK (Natural Language Toolkit). NLTK is a library of python, which provides a base for building programs and classification of data. NLTK also provides a graphical demonstration for representing various results or trends and it also provides sample data to train and test various classifiers respectively. Sentiment classification aims to automatically predict the sentiment polarity of users publishing sentiment data. Although traditional classification algorithms can be used to train sentiment classifiers from manually labeled text data, the labeling work can be time-consuming and expensive. Meanwhile, users often use different words when they express sentiment in different domains. If we directly apply a classifier trained in one domain to other domains, the performance will be very low due to the difference between these domains. In this work, we develop a general solution to sentiment classification when we do not have any labels in the target domain but have some labeled data in a different domain, regarded as the source domain. The purpose of this study is to analyze the tweets of the popular local and international news agencies and classify the tweeted news as positive, negative, or neutral categories

    Patient Health Care Opinion Systems using Ensemble Learning

    Get PDF
    The patients’ experience is considered a dominant reputation in the hospital administration and medical fields. Online patient reviews are recognized as an important criterion for evaluating hospital service quality and performance. The classical approach of evaluating service excellence is often found to be tedious. But with machine learning classifiers and opinion mining techniques the data assessing, and evaluation is made casual and its saves time. Currently, patient satisfaction and quality of service for patients in hospitals plays a major role in health care sector. In this paper a novel Ensemble Model is proposed to Analyze Patient Health Care Opinion Systems. The Classification models are used to classify patients’ feelings as positive, negative, or neutral using a machine learning approach to predict superlative models in data analysis. Ensemble techniques are used to analyze the opinions classified by the model, and the recommendation for health care is analyzed based on sentiment polarity

    Applying text mining techniques to forecast the stock market fluctuations of large it companies with twitter data: descriptive and predictive approaches to enhance the research of stock market predictions with textual and semantic data

    Get PDF
    Project Work presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies ManagementThis research project applies advanced text mining techniques as a method to predict stock market fluctuations by merging published tweets and daily stock market prices for a set of American Information Technology companies. This project executes a systematical approach to investigate and further analyze, by using mainly R code, two main objectives: i) which are the descriptive criteria, patterns, and variables, which are correlated with the stock fluctuation and ii) does the single usage of tweets indicate moderate signal to predict with high accuracy the stock market fluctuations. The main supposition and expected output of the research work is to deliver findings about the twitter text significance and predictability power to indicate the importance of social media content in terms of stock market fluctuations by using descriptive and predictive data mining approaches, as natural language processing, topic modelling, sentiment analysis and binary classification with neural networks

    Impact of Social Media on Dubai Stock Market using Sentiment Analysis

    Get PDF
    One of the main objectives of having securities stock markets is to ensure fair trading. Our analysis of this study will show how sentiment analysis and text mining techniques can help stock markets to sense wipes in the market participants\u27 behaviors and how the market community can benefit from it. The ability to detect potential insiders and investors\u27 mood would help the stock market to take necessary actions to protect the trading environment and enhance investors’ trust in the market. In this project, we will be building a pilot proof of concept utilizing sentiment analysis on Twitter, one of the most popular social media applications, and Dubai Financial Market, one of the most active stock markets in the United Arab Emirates (UAE), in the English language. The project can grow in sophistication and coverage in the future. In this project, I am using R as a primary development tool where a statistical and visual analysis will be carried out utilizing its rich open community libraries
    • …
    corecore