16,411 research outputs found

    Cross-domain sentiment classification using a sentiment sensitive thesaurus

    Get PDF
    Automatic classification of sentiment is important for numerous applications such as opinion mining, opinion summarization, contextual advertising, and market analysis. However, sentiment is expressed differently in different domains, and annotating corpora for every possible domain of interest is costly. Applying a sentiment classifier trained using labeled data for a particular domain to classify sentiment of user reviews on a different domain often results in poor performance. We propose a method to overcome this problem in cross-domain sentiment classification. First, we create a sentiment sensitive distributional thesaurus using labeled data for the source domains and unlabeled data for both source and target domains. Sentiment sensitivity is achieved in the thesaurus by incorporating document level sentiment labels in the context vectors used as the basis for measuring the distributional similarity between words. Next, we use the created thesaurus to expand feature vectors during train and test times in a binary classifier. The proposed method significantly outperforms numerous baselines and returns results that are comparable with previously proposed cross-domain sentiment classification methods. We conduct an extensive empirical analysis of the proposed method on single and multi-source domain adaptation, unsupervised and supervised domain adaptation, and numerous similarity measures for creating the sentiment sensitive thesaurus

    Deep Memory Networks for Attitude Identification

    Full text link
    We consider the task of identifying attitudes towards a given set of entities from text. Conventionally, this task is decomposed into two separate subtasks: target detection that identifies whether each entity is mentioned in the text, either explicitly or implicitly, and polarity classification that classifies the exact sentiment towards an identified entity (the target) into positive, negative, or neutral. Instead, we show that attitude identification can be solved with an end-to-end machine learning architecture, in which the two subtasks are interleaved by a deep memory network. In this way, signals produced in target detection provide clues for polarity classification, and reversely, the predicted polarity provides feedback to the identification of targets. Moreover, the treatments for the set of targets also influence each other -- the learned representations may share the same semantics for some targets but vary for others. The proposed deep memory network, the AttNet, outperforms methods that do not consider the interactions between the subtasks or those among the targets, including conventional machine learning methods and the state-of-the-art deep learning models.Comment: Accepted to WSDM'1

    Exploring the Mechanisms of Information Sharing

    Get PDF
    abstract: Online product ratings offer consumers information about products. In this dissertation, I explore how the design of the rating system impacts consumers’ sharing behavior and how different players are affected by rating mechanisms. The first two chapters investigate how consumers choose to share their experiences of different attributes, how their preferences are reflected in numerical ratings and textual reviews, whether and how multi-dimensional rating systems affect consumer satisfaction through product ratings, and whether and how multi-dimensional rating systems affect the interplay between numerical ratings and textual reviews. The identification strategy of the observational study hinges on a natural experiment on TripAdvisor when the website reengineered its rating system from single-dimensional to multi-dimensional in January 2009. Rating data on the same set of restaurants from Yelp, were used to identify the causal effect using a difference-in-difference approach. Text mining skills were deployed to identify potential topics from textual reviews when consumers didn’t provide dimensional ratings in both SD and MD systems. Results show that ratings in a single-dimensional rating system have a downward trend and a higher dispersion, whereas ratings in a multi-dimensional rating system are significantly higher and convergent. Textual reviews in MDR are in greater width and depth than textual reviews in SDR. The third chapter tries to uncover how the introduction of monetary incentives would influence different players in the online e-commerce market in the short term and in the long run. These three studies together contribute to the understanding of rating system/mechanism designs and different players in the online market.Dissertation/ThesisDoctoral Dissertation Business Administration 201

    Measure and Mitigate the Dimensional Bias in Online Reviews and Ratings

    Get PDF
    Online word-of-mouth in the form of online reviews and ratings is an increasingly important resource for consumers to acquire product information for their purchase decision. However, dimensional review bias, originated from consumer heterogeneity and their multidimensional product preferences and experiences, have been shown to undermine the information transfer among consumers. Through a novel text mining approach, we identify and quantify two types of dimensional bias from textual reviews: dimensional preference bias and dimensional rating bias. We also introduce a quantitative method to mitigate the dimensional rating bias. We examined the effectiveness and applicability of our bias measures and de-bias method in the context of multi-dimensional and single-dimensional rating systems. Specifically, we focused on the hotel reviews on TripAdvisor.com and Expedia.com. Our preliminary results showed promising theoretical and managerial contributions
    corecore