Skip to main content
Article thumbnail
Location of Repository

Measuring praise and criticism: Inference of semantic orientation from association

By Peter Turney and Michael Littman

Abstract

The evaluative character of a word is called its semantic orientation. Positive semantic orientation indicates praise (e.g., "honest", "intrepid") and negative semantic orientation indicates criticism (e.g., "disturbing", "superfluous"). Semantic orientation varies in both direction (positive or negative) and degree (mild to strong). An automated system for measuring semantic orientation would have application in text classification, text filtering, tracking opinions in online discussions, analysis of survey responses, and automated chat systems (chatbots). This paper introduces a method for inferring the semantic orientation of a word from its statistical association with a set of positive and negative paradigm words. Two instances of this approach are evaluated, based on two different statistical measures of word association: pointwise mutual information (PMI) and latent semantic analysis (LSA). The method is experimentally tested with 3,596 words (including adjectives, adverbs, nouns, and verbs) that have been manually labeled positive (1,614 words) and negative (1,982 words). The method attains an accuracy of 82.8% on the full test set, but the accuracy rises above 95% when the algorithm is allowed to abstain from classifying mild words

Topics: Statistical Models, Language, Computational Linguistics, Semantics, Machine Learning
Year: 2003
OAI identifier: oai:cogprints.org:3164

Suggested articles

Citations

  1. (1997). A solution to Plato’s problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. doi
  2. (1957). A Synopsis of Linguistic Theory 1930-1955.
  3. (1993). Accurate methods for the statistics of surprise and coincidence.
  4. (1996). An introduction to categorical data analysis.
  5. (1992). Direction-based text interpretation as an information access refinement. In
  6. (2000). Effects of adjective orientation and gradability on sentence subjectivity.
  7. (1990). Indexing by latent semantic analysis.
  8. (2001). Language games and other meaningful pursuits. Presentation slides.
  9. (1996). Matrix Computations. Third edition. Johns Hopkins
  10. (2002). On the computational basis of learning and cognition: Arguments from LSA. doi
  11. (1997). Predicting the semantic orientation of adjectives.
  12. (1993). Retrieving collocations from Text: Xtract.
  13. (2001). Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures.
  14. (1957). The Measurement of Meaning. doi
  15. (2002). Thumbs up? Sentiment classification using machine learning techniques.
  16. (1995). Using information content to evaluate semantic similarity in a taxonomy.
  17. (1989). Word association norms, mutual information and lexicography.
  18. (1993). Word space.
  19. (1990). WordNet: An on-line lexical database. doi
  20. (2002). Words with attitude.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.