Search CORE

1,671 research outputs found

CentralNet: a Multilayer Approach for Multimodal Fusion

Author: A Dhall
D Lahat
M Kang
N Neverova
N Neverova
PK Atrey
S Chandar
S Escalera
Y LeCun
Z Gu
Publication venue
Publication date: 22/08/2018
Field of study

This paper proposes a novel multimodal fusion approach, aiming to produce best possible decisions by integrating information coming from multiple media. While most of the past multimodal approaches either work by projecting the features of different modalities into the same space, or by coordinating the representations of each modality through the use of constraints, our approach borrows from both visions. More specifically, assuming each modality can be processed by a separated deep convolutional network, allowing to take decisions independently from each modality, we introduce a central network linking the modality specific networks. This central network not only provides a common feature embedding but also regularizes the modality specific networks through the use of multi-task learning. The proposed approach is validated on 4 different computer vision tasks on which it consistently improves the accuracy of existing multimodal fusion approaches

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

Machine learning and deep learning for emotion recognition

Author: Sisquella Andrés Joan
Publication venue: Universitat Politècnica de Catalunya
Publication date: 03/10/2019
Field of study

Ús de diferents tècniques de deep learning per al reconeixement d'emocions a partir d'imatges i videos. Les diferents tècniques s'apliquen, es valoren i comparen amb l'objectiu de fer-les servir conjuntament en una aplicació final.Outgoin

UPCommons. Portal del coneixement obert de la UPC