Search CORE

15,683 research outputs found

Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond

Author: Li Bo
Wang Haoxiang
Zhao Han
Publication venue
Publication date: 18/04/2022
Field of study

The vast majority of existing algorithms for unsupervised domain adaptation (UDA) focus on adapting from a labeled source domain to an unlabeled target domain directly in a one-off way. Gradual domain adaptation (GDA), on the other hand, assumes a path of

(T-1)

unlabeled intermediate domains bridging the source and target, and aims to provide better generalization in the target domain by leveraging the intermediate ones. Under certain assumptions, Kumar et al. (2020) proposed a simple algorithm, Gradual Self-Training, along with a generalization bound in the order of

e^{O(T)} \left(\varepsilon_0+O\left(\sqrt{log(T)/n}\right)\right)

for the target domain error, where

\varepsilon_0

is the source domain error and

n

is the data size of each domain. Due to the exponential factor, this upper bound becomes vacuous when

T

is only moderately large. In this work, we analyze gradual self-training under more general and relaxed assumptions, and prove a significantly improved generalization bound as

\widetilde{O}\left(\varepsilon_0 + T\Delta + T/\sqrt{n} + 1/\sqrt{nT}\right)

, where

\Delta

is the average distributional distance between consecutive domains. Compared with the existing bound with an exponential dependency on

T

as a multiplicative factor, our bound only depends on

T

linearly and additively. Perhaps more interestingly, our result implies the existence of an optimal choice of

T

that minimizes the generalization error, and it also naturally suggests an optimal way to construct the path of intermediate domains so as to minimize the accumulative path length

T\Delta

between the source and target. To corroborate the implications of our theory, we examine gradual self-training on multiple semi-synthetic and real datasets, which confirms our findings. We believe our insights provide a path forward toward the design of future GDA algorithms.Comment: The code will be released at https://github.com/Haoxiang-Wang/gradual-domain-adaptatio

arXiv.org e-Print Archive

Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency

Author: A Newell
BKP Horn
C Ionescu
CB Choy
Jiajun Wu
M Tatarchenko
S Song
T Zhou
Publication venue
Publication date: 26/07/2018
Field of study

In this paper, we introduce a novel unsupervised domain adaptation technique for the task of 3D keypoint prediction from a single depth scan or image. Our key idea is to utilize the fact that predictions from different views of the same or similar objects should be consistent with each other. Such view consistency can provide effective regularization for keypoint prediction on unlabeled instances. In addition, we introduce a geometric alignment term to regularize predictions in the target domain. The resulting loss function can be effectively optimized via alternating minimization. We demonstrate the effectiveness of our approach on real datasets and present experimental results showing that our approach is superior to state-of-the-art general-purpose domain adaptation techniques.Comment: ECCV 201

arXiv.org e-Print Archive

Crossref

Unsupervised Domain Adaptation by Backpropagation

Author: Ganin Yaroslav
Lempitsky Victor
Publication venue
Publication date: 27/02/2015
Field of study

Top-performing deep architectures are trained on massive amounts of labeled data. In the absence of labeled data for a certain task, domain adaptation often provides an attractive option given that labeled data of similar nature but from a different domain (e.g. synthetic images) are available. Here, we propose a new approach to domain adaptation in deep architectures that can be trained on large amount of labeled data from the source domain and large amount of unlabeled data from the target domain (no labeled target-domain data is necessary). As the training progresses, the approach promotes the emergence of "deep" features that are (i) discriminative for the main learning task on the source domain and (ii) invariant with respect to the shift between the domains. We show that this adaptation behaviour can be achieved in almost any feed-forward model by augmenting it with few standard layers and a simple new gradient reversal layer. The resulting augmented architecture can be trained using standard backpropagation. Overall, the approach can be implemented with little effort using any of the deep-learning packages. The method performs very well in a series of image classification experiments, achieving adaptation effect in the presence of big domain shifts and outperforming previous state-of-the-art on Office datasets

arXiv.org e-Print Archive

CiteSeerX