Search CORE

3,287 research outputs found

Valence, arousal and dominance estimation for English, German, Greek,Portuguese and Spanish lexica using semantic models

Author: Koutsakis P.
Losif E.
Palogiannidi E.
Potamianos A.
Publication venue
Publication date: 01/01/2015
Field of study

We propose and evaluate the use of an affective-semantic model to expand the affective lexica of German, Greek, English, Spanish and Portuguese. Motivated by the assumption that semantic similarity implies affective similarity, we use word level semantic similarity scores as semantic features to estimate their corresponding affective scores. Various context-based semantic similarity metrics are investigated using contextual features that include both words and character n-grams. The model produces continuous affective ratings in three dimensions (valence, arousal and dominance) for all five languages, achieving consistent performance. We achieve classification accuracy (valence polarity task) between 85% and 91% for all five languages. For morphologically rich languages the proposed use of character n-grams is shown to improve performance

Research Repository

Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation

Author: Amir Silvio
Astudillo Ramón Fernandez
Black Alan W.
Dyer Chris
Ling Wang
Luís Tiago
Marujo Luís
Trancoso Isabel
Publication venue
Publication date: 01/01/2015
Field of study

We introduce a model for constructing vector representations of words by composing characters using bidirectional LSTMs. Relative to traditional word representation models that have independent vectors for each word type, our model requires only a single vector per character type and a fixed set of parameters for the compositional model. Despite the compactness of this model and, more importantly, the arbitrary nature of the form-function relationship in language, our "composed" word representations yield state-of-the-art results in language modeling and part-of-speech tagging. Benefits over traditional baselines are particularly pronounced in morphologically rich languages (e.g., Turkish)

arXiv.org e-Print Archive