Skip to main content
Article thumbnail
Location of Repository

Exploring the use of linguistic features in sentiment analysis

By M. Genereux and M. Santini


In this paper we describe some explorations of the potential of genre-revealing features on automatic sentiment analysis. In particular, we use a small subset of the ‘linguistic facets’ employed in recent experiments on automatic genre identification in combination with more traditional sentiment-revealing features on two different single-genre corpora: a corpus of English blogs and a corpus of French reviews(relectures). Although still preliminary, results show that linguistic facets might have a positive influence on sentiment analysis because 6 out of 14 facets used in the experiments are among the first 22 most important discriminative features

Topics: Q100 Linguistics
Year: 2007
OAI identifier:

Suggested articles


  1. (2007). Annotating Expressions of Appraisal in English. doi
  2. (2000). Authorship attribution with support vector machines.
  3. (2007). Automatic Identification Of Genre In Web Pages. Phd Thesis,
  4. (2007). Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification.
  5. (2007). Building Emotion Lexicon from Weblog Corpora. doi
  6. (2007). Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents.
  7. (2007). Classification de textes français subjectifs.
  8. (2007). Crystal: Analyzing Predictive Opinions on the Web.
  9. (2005). Data Mining: Practical Machine Learning Tools and Techniques. doi
  10. (2005). Determining the semantic orientation of terms through gloss analysis. doi
  11. (2000). Effects of adjective orientation and gradability on sentence subjectivity. doi
  12. (2007). Extracting Aspect-Evaluation and AspectOf Relations in Opinion Mining.
  13. (2003). Learning Extraction Patterns For Subjective Expressions. doi
  14. (2007). Learning Multilingual Subjective Language via Cross-Lingual Projections.
  15. (2005). Linguistic Facets For Genre and Text Type Identification: A Description Of Linguistically-Motivated Features.
  16. (1999). Longman Grammar of Spoken and Written English. doi
  17. (2007). Opinion Mining using Econometrics: A Case Study on Reputation Systems.
  18. (2007). PageRanking WordNet Synsets: An Application to Opinion Mining.
  19. (2001). Semantic distance in wordnet: an experimental, application-oriented evaluation of five measures.
  20. (2007). Sentiment Polarity Identification in Financial News: A Cohesion-based Approach.
  21. (2006). Sentiwordnet: A publicly available lexical resource for opinion mining.
  22. (2007). Structured Models for Fine-to-Coarse Sentiment Analysis.
  23. (2007). Test Collection Selection and Gold Standard Generation for a Multiply-Annotated Opinion Corpus. doi
  24. (1997). Text Categorization With Support Vector Machines: Learning With Many Relevant Features. Rapport Interne Ls8-Report 23, Universität Dortmund. Ls Viii-Report.
  25. (1966). The General Inquirer: A Computerapproach To Content Analysis. doi
  26. (2002). Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. doi
  27. (2006). Towards a validated model for affective classification of texts. doi
  28. (1972). Universal and cultural differences in facial expression of emotion. doi
  29. (2007). Weakly Supervised Learning for Hedge Classification in Scientific Literature.
  30. (1989). Word association norms, mutual information and lexicography. doi
  31. (2004). Wordnet-Affect: An Affective Extension Of Wordnet.
  32. (2002). Words With Attitude.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.