Search CORE

148,443 research outputs found

A review of domain adaptation without target labels

Author: Kouw Wouter M.
Loog Marco
Publication venue
Publication date: 01/01/2019
Field of study

Domain adaptation has become a prominent problem setting in machine learning and related fields. This review asks the question: how can a classifier learn from a source domain and generalize to a target domain? We present a categorization of approaches, divided into, what we refer to as, sample-based, feature-based and inference-based methods. Sample-based methods focus on weighting individual observations during training based on their importance to the target domain. Feature-based methods revolve around on mapping, projecting and representing features such that a source classifier performs well on the target domain and inference-based methods incorporate adaptation into the parameter estimation procedure, for instance through constraints on the optimization procedure. Additionally, we review a number of conditions that allow for formulating bounds on the cross-domain generalization error. Our categorization highlights recurring ideas and raises questions important to further research.Comment: 20 pages, 5 figure

arXiv.org e-Print Archive

Crossref

Joint Distribution Optimal Transportation for Domain Adaptation

Author: Courty Nicolas
Flamary Rémi
Habrard Amaury
Rakotomamonjy Alain
Publication venue
Publication date: 22/10/2017
Field of study

This paper deals with the unsupervised domain adaptation problem, where one wants to estimate a prediction function

f

in a given target domain without any labeled sample by exploiting the knowledge available from a source domain where labels are known. Our work makes the following assumption: there exists a non-linear transformation between the joint feature/label space distributions of the two domain

\mathcal{P}_s

and

\mathcal{P}_t

. We propose a solution of this problem with optimal transport, that allows to recover an estimated target

\mathcal{P}^f_t=(X,f(X))

by optimizing simultaneously the optimal coupling and

f

. We show that our method corresponds to the minimization of a bound on the target error, and provide an efficient algorithmic solution, for which convergence is proved. The versatility of our approach, both in terms of class of hypothesis or loss functions is demonstrated with real world classification and regression problems, for which we reach or surpass state-of-the-art results.Comment: Accepted for publication at NIPS 201

arXiv.org e-Print Archive

HAL - Normandie Université

HAL-UJM

HAL-INSU