Skip to main content
Article thumbnail
Location of Repository

Exploring the use of linguistic features in sentiment analysis

By M. Genereux and M. Santini


In this paper we describe some explorations of the potential of genre-revealing features on automatic sentiment analysis. In particular, we use a small subset of the ‘linguistic facets’ employed in recent experiments on automatic genre identification in combination with more traditional sentiment-revealing features on two different single-genre corpora: a corpus of English blogs and a corpus of French reviews(relectures). Although still preliminary, results show that linguistic facets might have a positive influence on sentiment analysis because 6 out of 14 facets used in the experiments are among the first 22 most important discriminative features

Topics: Q100 Linguistics
Year: 2007
OAI identifier:

Suggested articles


  1. (1999). Longman Grammar of Spoken and Written English. doi
  2. (2007). Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification.
  3. (2001). Semantic distance in wordnet: an experimental, application-oriented evaluation of five measures.
  4. (1989). Word association norms, mutual information and lexicography. doi
  5. (2007). Sentiment Polarity Identification in Financial News: A Cohesion-based Approach.
  6. (2000). Authorship attribution with support vector machines.
  7. (1972). Universal and cultural differences in facial expression of emotion. doi
  8. (2005). Determining the semantic orientation of terms through gloss analysis. doi
  9. (2006). Sentiwordnet: A publicly available lexical resource for opinion mining.
  10. (2007). PageRanking WordNet Synsets: An Application to Opinion Mining.
  11. (2006). Towards a validated model for affective classification of texts. doi
  12. (2007). Classification de textes français subjectifs.
  13. (2007). Opinion Mining using Econometrics: A Case Study on Reputation Systems.
  14. (2000). Effects of adjective orientation and gradability on sentence subjectivity. doi
  15. (1997). Text Categorization With Support Vector Machines: Learning With Many Relevant Features. Rapport Interne Ls8-Report 23, Universität Dortmund. Ls Viii-Report.
  16. (2007). Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents.
  17. (2002). Words With Attitude.
  18. (2007). Crystal: Analyzing Predictive Opinions on the Web.
  19. (2007). Extracting Aspect-Evaluation and AspectOf Relations in Opinion Mining.
  20. (2007). Test Collection Selection and Gold Standard Generation for a Multiply-Annotated Opinion Corpus. doi
  21. (2007). Structured Models for Fine-to-Coarse Sentiment Analysis.
  22. (2007). Weakly Supervised Learning for Hedge Classification in Scientific Literature.
  23. (2007). Learning Multilingual Subjective Language via Cross-Lingual Projections.
  24. (2007). Annotating Expressions of Appraisal in English. doi
  25. (2003). Learning Extraction Patterns For Subjective Expressions. doi
  26. (2005). Linguistic Facets For Genre and Text Type Identification: A Description Of Linguistically-Motivated Features.
  27. (2007). Automatic Identification Of Genre In Web Pages. Phd Thesis,
  28. (1966). The General Inquirer: A Computerapproach To Content Analysis. doi
  29. (2004). Wordnet-Affect: An Affective Extension Of Wordnet.
  30. (2002). Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. doi
  31. (2005). Data Mining: Practical Machine Learning Tools and Techniques. doi
  32. (2007). Building Emotion Lexicon from Weblog Corpora. doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.