    Semantic Sentiment Analysis of Twitter Data

    Internet and the proliferation of smart mobile devices have changed the way information is created, shared, and spreads, e.g., microblogs such as Twitter, weblogs such as LiveJournal, social networks such as Facebook, and instant messengers such as Skype and WhatsApp are now commonly used to share thoughts and opinions about anything in the surrounding world. This has resulted in the proliferation of social media content, thus creating new opportunities to study public opinion at a scale that was never possible before. Naturally, this abundance of data has quickly attracted business and research interest from various fields including marketing, political science, and social studies, among many others, which are interested in questions like these: Do people like the new Apple Watch? Do Americans support ObamaCare? How do Scottish feel about the Brexit? Answering these questions requires studying the sentiment of opinions people express in social media, which has given rise to the fast growth of the field of sentiment analysis in social media, with Twitter being especially popular for research due to its scale, representativeness, variety of topics discussed, as well as ease of public access to its messages. Here we present an overview of work on sentiment analysis on Twitter.Comment: Microblog sentiment analysis; Twitter opinion mining; In the Encyclopedia on Social Network Analysis and Mining (ESNAM), Second edition. 201

    Implicit Sentiment Identification using Aspect based Opinion Mining

    Opinion mining or sentiment analysis is the computational study of opinions or emotions towards aspects or things. The aspects are nothing but attributes or components of the individuals, events, topics, products and organizations. Opinion mining has been an active research area in Web mining and Natural Language Processing (NLP) in recent years. With the explosive growth of E-commerce, there are millions of product options available and people tend to review the viewpoint of others before buying a product. An aspect-based opinion mining approach helps in analyzing opinions about product features and attributes. This project is based on extracting aspects and related customer sentiments on tourism domain. This offers an approach to discover consumer preferences about tourism products and services using statistical opinion mining. The proposed system tries to extract both explicit aspects as well as implicit aspects from customer reviews. It thus increases the sentiment orientation of opinion. Most of the researches were based on explicit opinions of customers. This system tries to retrieve implicit sentiments. Due to the growing availability of unstructured reviews, the proposed system gives a summarized form of the information that is obtained from the reviews in order to furnish customers with pin point or crisp results. DOI: 10.17762/ijritcc2321-8169.16049

    Automatic Summarization in Chinese Product Reviews

    With the increasing number of online comments, it was hard for buyers to find useful information in a short time so it made sense to do research on automatic summarization which fundamental work was focused on product reviews mining. Previous studies mainly focused on explicit features extraction whereas often ignored implicit features which hadn't been stated clearly but containing necessary information for analyzing comments. So how to quickly and accurately mine features from web reviews had important significance for summarization technology. In this paper, explicit features and “feature-opinion” pairs in the explicit sentences were extracted by Conditional Random Field and implicit product features were recognized by a bipartite graph model based on random walk algorithm. Then incorporating features and corresponding opinions into a structured text and the abstract was generated based on the extraction results. The experiment results demonstrated the proposed methods outpreferred baselines

    Opinion mining: Reviewed from word to document level

    International audienceOpinion mining is one of the most challenging tasks of the field of information retrieval. Research community has been publishing a number of articles on this topic but a significant increase in interest has been observed during the past decade especially after the launch of several online social networks. In this paper, we provide a very detailed overview of the related work of opinion mining. Following features of our review make it stand unique among the works of similar kind: (1) it presents a very different perspective of the opinion mining field by discussing the work on different granularity levels (like word, sentences, and document levels) which is very unique and much required, (2) discussion of the related work in terms of challenges of the field of opinion mining, (3) document level discussion of the related work gives an overview of opinion mining task in blogosphere, one of most popular online social network, and (4) highlights the importance of online social networks for opinion mining task and other related sub-tasks

    A Survey and Taxonomy of Sequential Recommender Systems for E-commerce Product Recommendation

    E-commerce recommendation systems facilitate customers’ purchase decision by recommending products or services of interest (e.g., Amazon). Designing a recommender system tailored toward an individual customer’s need is crucial for retailers to increase revenue and retain customers’ loyalty. As users’ interests and preferences change with time, the time stamp of a user interaction (click, view or purchase event) is an important characteristic to learn sequential patterns from these user interactions and, hence, understand users’ long- and short-term preferences to predict the next item(s) for recommendation. This paper presents a taxonomy of sequential recommendation systems (SRecSys) with a focus on e-commerce product recommendation as an application and classifies SRecSys under three main categories as: (i) traditional approaches (sequence similarity, frequent pattern mining and sequential pattern mining), (ii) factorization and latent representation (matrix factorization and Markov models) and (iii) neural network-based approaches (deep neural networks, advanced models). This classification contributes towards enhancing the understanding of existing SRecSys in the literature with the application domain of e-commerce product recommendation and provides current status of the solutions available alongwith future research directions. Furthermore, a classification of surveyed systems according to eight important key features supported by the techniques along with their limitations is also presented. A comparative performance analysis of the presented SRecSys based on experiments performed on e-commerce data sets (Amazon and Online Retail) showed that integrating sequential purchase patterns into the recommendation process and modeling users’ sequential behavior improves the quality of recommendations

    Метод межъязыкового аспектно-ориентированного анализа высказываний с использованием машинного обучение категоризационной модели.

    Product reviews are the foremost source of information for customers and manufacturers to help them make appropriate purchasing and production decisions. Today, the Internet has become the largest source of consumer thought. Sentiment analysis and opinion mining is the field of study that analyzes people’s opinions, sentiments, evaluations, attitudes, and emotions from written language. In this paper, we present a study of aspect-based opinion mining using a lexicon-based approach and their adaptation to the processing of responses written in Ukrainian and English. This information helps to build systems to understand customer’s feedback and plan business strategies accordingly. This also helps in predicting the chances of product failure. In this paper, it is explained how machine learning can be used for opinion mining. The research methods used in the work are based on data mining methods, Web mining, machine learning, and information retrieval. The stages of the method of cross-language aspect-oriented analysis of statements are presented. The cross-language categorization of characteristics of goods is considered. The algorithm describes the model learning in cross-language virtual contextual documents.Відгуки про продукцію є головним джерелом інформації для клієнтів і виробників, щоб допомогти їм прийняти відповідні рішення щодо закупівель і виробництва. Сьогодні Інтернет став найбільшим джерелом споживчої думки. Аналіз настроїв і видобування думок є сферою дослідження, яка аналізує думки людей, почуття, оцінки, ставлення та емоції з природно-мовного тексту. У даній роботі представлено дослідження аспектно-орієнтованого видобування думок з використанням лексіконного підходу та його адаптація до обробки відповідей, написаних українською та англійською мовами. Ця інформація допомагає створювати системи для розуміння зворотного зв'язку клієнта та планування відповідних бізнес-стратегій. Це також допомагає прогнозувати шляхи запобігання невдач при просуванні на ринку продуктів. У цій роботі розглянуто використання машинного навчання для видобутку думок клієнтів. Методи дослідження, що використовуються в роботі, базуються на методах інтелектуального аналізу даних, веб-добуванні, машинному навчанні та пошуку інформації. Представлено етапи методу міжмовного аспектно-орієнтованого аналізу тверджень. Розглянуто перехресну категоризацію характеристик товарів. Алгоритм описує модель навчання на міжмовному віртуальному контекстному документі.Отзывы о продукции является главным источником информации для клиентов и производителей, чтобы помочь им принять соответствующие решения в части закупок и производства. Сегодня Интернет стал крупнейшим источником потребительского мнения. Анализ настроений и выявления мыслей является сферой исследования, которая анализирует мнения людей, чувства, оценки, отношения и эмоции с естественно-языкового текста. В данной работе представлено исследование аспектно-ориентированного выявления мыслей с использованием лексиконного подхода и его адаптация к обработки ответов, написанных на украинском и английском языках. Эта информация помогает создавать системы для понимания обратной связи клиента и планирования соответствующих бизнес-стратегий. Это также помогает прогнозировать пути предотвращения неудач при продвижении на рынке продуктов. В этой работе рассмотрено использование машинного обучения для выявления мнений клиентов. Методы исследования, используемые в работе, базируются на методах интеллектуального анализа данных, веб-добывании, машинном обучении и поиска информации. Представлены этапы метода межъязыкового аспектно-ориентированного анализа утверждений. Рассмотрена перекрестная категоризацию характеристик товаров. Алгоритм описывает модель обучения на межъязыковой виртуальном контекстном документе

    Doctor of Philosophy in Computer Science

    dissertationOver the last decade, social media has emerged as a revolutionary platform for informal communication and social interactions among people. Publicly expressing thoughts, opinions, and feelings is one of the key characteristics of social media. In this dissertation, I present research on automatically acquiring knowledge from social media that can be used to recognize people's affective state (i.e., what someone feels at a given time) in text. This research addresses two types of affective knowledge: 1) hashtag indicators of emotion consisting of emotion hashtags and emotion hashtag patterns, and 2) affective understanding of similes (a form of figurative comparison). My research introduces a bootstrapped learning algorithm for learning hashtag in- dicators of emotions from tweets with respect to five emotion categories: Affection, Anger/Rage, Fear/Anxiety, Joy, and Sadness/Disappointment. With a few seed emotion hashtags per emotion category, the bootstrapping algorithm iteratively learns new hashtags and more generalized hashtag patterns by analyzing emotion in tweets that contain these indicators. Emotion phrases are also harvested from the learned indicators to train additional classifiers that use the surrounding word context of the phrases as features. This is the first work to learn hashtag indicators of emotions. My research also presents a supervised classification method for classifying affective polarity of similes in Twitter. Using lexical, semantic, and sentiment properties of different simile components as features, supervised classifiers are trained to classify a simile into a positive or negative affective polarity class. The property of comparison is also fundamental to the affective understanding of similes. My research introduces a novel framework for inferring implicit properties that 1) uses syntactic constructions, statistical association, dictionary definitions and word embedding vector similarity to generate and rank candidate properties, 2) re-ranks the top properties using influence from multiple simile components, and 3) aggregates the ranks of each property from different methods to create a final ranked list of properties. The inferred properties are used to derive additional features for the supervised classifiers to further improve affective polarity recognition. Experimental results show substantial improvements in affective understanding of similes over the use of existing sentiment resources