62,308 research outputs found
Transferable Positive/Negative Speech Emotion Recognition via Class-wise Adversarial Domain Adaptation
Speech emotion recognition plays an important role in building more
intelligent and human-like agents. Due to the difficulty of collecting speech
emotional data, an increasingly popular solution is leveraging a related and
rich source corpus to help address the target corpus. However, domain shift
between the corpora poses a serious challenge, making domain shift adaptation
difficult to function even on the recognition of positive/negative emotions. In
this work, we propose class-wise adversarial domain adaptation to address this
challenge by reducing the shift for all classes between different corpora.
Experiments on the well-known corpora EMODB and Aibo demonstrate that our
method is effective even when only a very limited number of target labeled
examples are provided.Comment: 5 pages, 3 figures, accepted to ICASSP 201
A Deep Cascade Network for Unaligned Face Attribute Classification
Humans focus attention on different face regions when recognizing face
attributes. Most existing face attribute classification methods use the whole
image as input. Moreover, some of these methods rely on fiducial landmarks to
provide defined face parts. In this paper, we propose a cascade network that
simultaneously learns to localize face regions specific to attributes and
performs attribute classification without alignment. First, a weakly-supervised
face region localization network is designed to automatically detect regions
(or parts) specific to attributes. Then multiple part-based networks and a
whole-image-based network are separately constructed and combined together by
the region switch layer and attribute relation layer for final attribute
classification. A multi-net learning method and hint-based model compression is
further proposed to get an effective localization model and a compact
classification model, respectively. Our approach achieves significantly better
performance than state-of-the-art methods on unaligned CelebA dataset, reducing
the classification error by 30.9%
- …