Search CORE

294 research outputs found

Chapter From the Lab to the Real World: Affect Recognition Using Multiple Cues and Modalities

Author: Gunes Hatice
Or Jimmy
Pantic Maja
Piccardi Massimo
Publication venue: 'IntechOpen'
Publication date: 02/06/2021
Field of study

Interdisciplinary concept of dissipative soliton is unfolded in connection with ultrafast fibre lasers. The different mode-locking techniques as well as experimental realizations of dissipative soliton fibre lasers are surveyed briefly with an emphasis on their energy scalability. Basic topics of the dissipative soliton theory are elucidated in connection with concepts of energy scalability and stability. It is shown that the parametric space of dissipative soliton has reduced dimension and comparatively simple structure that simplifies the analysis and optimization of ultrafast fibre lasers. The main destabilization scenarios are described and the limits of energy scalability are connected with impact of optical turbulence and stimulated Raman scattering. The fast and slow dynamics of vector dissipative solitons are exposed

Directory of Open Access Books (DOAB)

Multi-modality Empowered Network For Facial Action Unit Detection

Author: Liu Peng
Yang Huiyuan
Yin Lijun
Zhang Zheng
Publication venue: Scholars\u27 Mine
Publication date: 04/03/2019
Field of study

This paper presents a new thermal empowered multi-task network (TEMT-Net) to improve facial action unit detection. Our primary goal is to leverage the situation that the training set has multi-modality data while the application scenario only has one modality. Thermal images are robust to illumination and face color. In the proposed multi-task framework, we utilize both modality data. Action unit detection and facial landmark detection are correlated tasks. To utilize the advantage and the correlation of different modalities and different tasks, we propose a novel thermal empowered multi-task deep neural network learning approach for action unit detection, facial landmark detection and thermal image reconstruction simultaneously. The thermal image generator and facial landmark detection provide regularization on the learned features with shared factors as the input color images. Extensive experiments are conducted on the BP4D and MMSE databases, with the comparison to the state-of-the-art methods. The experiments show that the multi-modality framework improves the AU detection significantly

Crossref

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Spatiotemporal Analysis of RGB-D-T Facial Images for Multimodal Pain Level Recognition

Author: Bahnsen Chris
Corneanu Ciprian
Guerrero Sergio Escalera
Irani Ramin
Klitgaard Marie-Louise
Lundtoft Dennis Holm
Moeslund Thomas B.
Nasrollahi Kamal
Oliu Simon Marc
Pedersen Tanja
Petrini Laura
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 07/06/2015
Field of study

VBN

Complementary Cohort Strategy for Multimodal Face Pair Matching

Author: Nasrollahi Kamal
Sun Yunlian
Sun Zhenan
Tan Tieniu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 24/02/2016
Field of study

VBN

Adaptive Multimodal Fusion For Facial Action Units Recognition

Author: Grabner Helmut
Irani Ramin
Xu Kelvin
Yang Huiyuan
Zhang Zheng
Publication venue: Scholars\u27 Mine
Publication date: 12/10/2020
Field of study

Multimodal facial action units (AU) recognition aims to build models that are capable of processing, correlating, and integrating information from multiple modalities (i.e., 2D images from a visual sensor, 3D geometry from 3D imaging, and thermal images from an infrared sensor). Although the multimodal data can provide rich information, there are two challenges that have to be addressed when learning from multimodal data: 1) the model must capture the complex cross-modal interactions in order to utilize the additional and mutual information effectively; 2) the model must be robust enough in the circumstance of unexpected data corruptions during testing, in case of a certain modality missing or being noisy. In this paper, we propose a novel Adaptive Multimodal Fusion method (AMF) for AU detection, which learns to select the most relevant feature representations from different modalities by a re-sampling procedure conditioned on a feature scoring module. The feature scoring module is designed to allow for evaluating the quality of features learned from multiple modalities. As a result, AMF is able to adaptively select more discriminative features, thus increasing the robustness to missing or corrupted modalities. In addition, to alleviate the over-fitting problem and make the model generalize better on the testing data, a cut-switch multimodal data augmentation method is designed, by which a random block is cut and switched across multiple modalities. We have conducted a thorough investigation on two public multimodal AU datasets, BP4D and BP4D+, and the results demonstrate the effectiveness of the proposed method. Ablation studies on various circumstances also show that our method remains robust to missing or noisy modalities during tests

Crossref

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Thermal imaging for vehicle occupant monitoring

Author: Gustavo Rocha da Silva
Publication venue
Publication date: 20/07/2018
Field of study

Repositório Aberto da Universidade do Porto

ET-CycleGAN: Generating thermal images from images in the visible spectrum for facial emotion recognition

Author: César Garcia P.S. (Pablo Santiago)
El Ali A. (Abdallah)
Pons G. (Gerard)
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/10/2020
Field of study

Facial thermal imaging has in recent years shown to be an efficient modality for facial emotion recognition. However, the use of deep learning in this field is still not fully exploited given the small number and size of the current datasets. The goal of this work is to improve the performance of the existing deep networks in thermal facial emotion recognition by generating new synthesized thermal images from images in the visual spectrum (RGB). To address this challenging problem, we propose an emotion-guided thermal CycleGAN (ET-CycleGAN). This Generative Adversarial Network (GAN) regularizes the training with facial and emotion priors by extracting features from Convolutional Neural Networks (CNNs) trained for face recognition and facial emotion recognition, respectively. To assess this approach, we generated synthesized images from the training set of the USTC-NVIE dataset, and included the new data to the training set as a data augmentation strategy. By including images generated using the ET-CycleGAN, the accuracy for emotion recognition increased by 10.9%. Our initial findings highlight the importance of adding priors related to training set image attributes (in our case face and emotion priors), to ensure such attributes are maintained in the generated images

CWI's Institutional Repository