Search CORE

38,663 research outputs found

Sentiment Recognition in Egocentric Photostreams

Author: EA Holmes
ES Dan-Glauser
P Foggia
S Poria
Y Yu
Publication venue
Publication date: 01/01/2017
Field of study

Lifelogging is a process of collecting rich source of information about daily life of people. In this paper, we introduce the problem of sentiment analysis in egocentric events focusing on the moments that compose the images recalling positive, neutral or negative feelings to the observer. We propose a method for the classification of the sentiments in egocentric pictures based on global and semantic image features extracted by Convolutional Neural Networks. We carried out experiments on an egocentric dataset, which we organized in 3 classes on the basis of the sentiment that is recalled to the user (positive, negative or neutral)

arXiv.org e-Print Archive

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

More cat than cute? Interpretable Prediction of Adjective-Noun Pairs

Author: Campos Victor
Chang Shih-Fu
Fernandez Delia
Giro-i-Nieto Xavier
Jou Brendan
Woodward Alejandro
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

The increasing availability of affect-rich multimedia resources has bolstered interest in understanding sentiment and emotions in and from visual content. Adjective-noun pairs (ANP) are a popular mid-level semantic construct for capturing affect via visually detectable concepts such as "cute dog" or "beautiful landscape". Current state-of-the-art methods approach ANP prediction by considering each of these compound concepts as individual tokens, ignoring the underlying relationships in ANPs. This work aims at disentangling the contributions of the `adjectives' and `nouns' in the visual prediction of ANPs. Two specialised classifiers, one trained for detecting adjectives and another for nouns, are fused to predict 553 different ANPs. The resulting ANP prediction model is more interpretable as it allows us to study contributions of the adjective and noun components. Source code and models are available at https://imatge-upc.github.io/affective-2017-musa2/ .Comment: Oral paper at ACM Multimedia 2017 Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes (MUSA2

arXiv.org e-Print Archive

Crossref

UPCommons. Portal del coneixement obert de la UPC

Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology

Author: Balahur A.
Bautin M.
Dryer M. S.
Esuli A.
Güngördü Z.
Krizhevsky A.
Lang P.
Lee J. H.
McCarthy E. D.
Mesquita B.
Mihalcea R.
Mikolov T.
Plutchik R.
Schmid H.
Vessel E. A.
You Q.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 19/08/2015
Field of study

Every culture and language is unique. Our work expressly focuses on the uniqueness of culture and language in relation to human affect, specifically sentiment and emotion semantics, and how they manifest in social multimedia. We develop sets of sentiment- and emotion-polarized visual concepts by adapting semantic structures called adjective-noun pairs, originally introduced by Borth et al. (2013), but in a multilingual context. We propose a new language-dependent method for automatic discovery of these adjective-noun constructs. We show how this pipeline can be applied on a social multimedia platform for the creation of a large-scale multilingual visual sentiment concept ontology (MVSO). Unlike the flat structure in Borth et al. (2013), our unified ontology is organized hierarchically by multilingual clusters of visually detectable nouns and subclusters of emotionally biased versions of these nouns. In addition, we present an image-based prediction task to show how generalizable language-specific models are in a multilingual context. A new, publicly available dataset of >15.6K sentiment-biased visual concepts across 12 languages with language-specific detector banks, >7.36M images and their metadata is also released.Comment: 11 pages, to appear at ACM MM'1

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

CentralNet: a Multilayer Approach for Multimodal Fusion

Author: A Dhall
D Lahat
M Kang
N Neverova
N Neverova
PK Atrey
S Chandar
S Escalera
Y LeCun
Z Gu
Publication venue
Publication date: 22/08/2018
Field of study

This paper proposes a novel multimodal fusion approach, aiming to produce best possible decisions by integrating information coming from multiple media. While most of the past multimodal approaches either work by projecting the features of different modalities into the same space, or by coordinating the representations of each modality through the use of constraints, our approach borrows from both visions. More specifically, assuming each modality can be processed by a separated deep convolutional network, allowing to take decisions independently from each modality, we introduce a central network linking the modality specific networks. This central network not only provides a common feature embedding but also regularizes the modality specific networks through the use of multi-task learning. The proposed approach is validated on 4 different computer vision tasks on which it consistently improves the accuracy of existing multimodal fusion approaches

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

Group Affect Prediction Using Multimodal Distributions

Author: Rawat Bhanu Pratap Singh
Shamsi Saqib
Wadhwa Manya
Publication venue
Publication date: 12/03/2018
Field of study

We describe our approach towards building an efficient predictive model to detect emotions for a group of people in an image. We have proposed that training a Convolutional Neural Network (CNN) model on the emotion heatmaps extracted from the image, outperforms a CNN model trained entirely on the raw images. The comparison of the models have been done on a recently published dataset of Emotion Recognition in the Wild (EmotiW) challenge, 2017. The proposed method achieved validation accuracy of 55.23% which is 2.44% above the baseline accuracy, provided by the EmotiW organizers.Comment: This research paper has been accepted at Workshop on Computer Vision for Active and Assisted Living, WACV 201

arXiv.org e-Print Archive

Crossref