16 research outputs found
Shape Consistent 2D Keypoint Estimation under Domain Shift
Recent unsupervised domain adaptation methods based on deep architectures
have shown remarkable performance not only in traditional classification tasks
but also in more complex problems involving structured predictions (e.g.
semantic segmentation, depth estimation). Following this trend, in this paper
we present a novel deep adaptation framework for estimating keypoints under
domain shift}, i.e. when the training (source) and the test (target) images
significantly differ in terms of visual appearance. Our method seamlessly
combines three different components: feature alignment, adversarial training
and self-supervision. Specifically, our deep architecture leverages from
domain-specific distribution alignment layers to perform target adaptation at
the feature level. Furthermore, a novel loss is proposed which combines an
adversarial term for ensuring aligned predictions in the output space and a
geometric consistency term which guarantees coherent predictions between a
target sample and its perturbed version. Our extensive experimental evaluation
conducted on three publicly available benchmarks shows that our approach
outperforms state-of-the-art domain adaptation methods in the 2D keypoint
prediction task
Adversarial Learning and Self-Teaching Techniques for Domain Adaptation in Semantic Segmentation
Deep learning techniques have been widely used in autonomous driving systems
for the semantic understanding of urban scenes. However, they need a huge
amount of labeled data for training, which is difficult and expensive to
acquire. A recently proposed workaround is to train deep networks using
synthetic data, but the domain shift between real world and synthetic
representations limits the performance. In this work, a novel Unsupervised
Domain Adaptation (UDA) strategy is introduced to solve this issue. The
proposed learning strategy is driven by three components: a standard supervised
learning loss on labeled synthetic data; an adversarial learning module that
exploits both labeled synthetic data and unlabeled real data; finally, a
self-teaching strategy applied to unlabeled data. The last component exploits a
region growing framework guided by the segmentation confidence. Furthermore, we
weighted this component on the basis of the class frequencies to enhance the
performance on less common classes. Experimental results prove the
effectiveness of the proposed strategy in adapting a segmentation network
trained on synthetic datasets, like GTA5 and SYNTHIA, to real world datasets
like Cityscapes and Mapillary.Comment: Accepted at IEEE Transactions on Intelligent Vehicles (T-IV) 10
pages, 2 figures, 7 table