Search CORE

25 research outputs found

Learning from Samples of Variable Quality

Author: Dehghani M.
Gouws S.
Kamps J.
Mehrjou A.
Schölkopf B.
Publication venue
Publication date: 01/05/2019
Field of study

Training labels are expensive to obtain and may be of varying quality, as some may be from trusted expert labelers while others might be from heuristics or other sources of weak supervision such as crowd-sourcing. This creates a fundamental quality-versus-quantity trade-off in the learning process. Do we learn from the small amount of high-quality data or the potentially large amount of weakly-labeled data? We argue that if the learner could somehow know and take the label-quality into account, we could get the best of both worlds. To this end, we introduce “fidelity-weighted learning” (FWL), a semi-supervised student-teacher approach for training deep neural networks using weakly-labeled data. FWL modulates the parameter updates to a student network, trained on the task we care about on a per-sample basis according to the posterior confidence of its label-quality estimated by a teacher, who has access to limited samples with high-quality labels

International Migration, Integration and Social Cohesion online publications

Learning from Samples of Variable Quality

Author: Dehghani M.
Gouws S.
Kamps J.
Mehrjou A.
Schölkopf B.
Publication venue
Publication date: 01/05/2019
Field of study

International Migration, Integration and Social Cohesion online publications

Nonstationary GANs: Analysis as Nonautonomous Dynamical Systems

Author: Mehrjou A.
Schölkopf B.
Publication venue
Publication date: 01/01/2018
Field of study

MPG.PuRe

Deep Nonlinear Non-Gaussian Filtering for Dynamical Systems

Author: Mehrjou A.
Schölkopf B.
Publication venue
Publication date: 01/01/2018
Field of study

MPG.PuRe

Efficient Encoding of Dynamical Systems through Local Approximations

Author: Mehrjou A.
Schölkopf B.
Solowjow F.
Trimpe S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

MPG.PuRe

Tempered Adversarial Networks

Author: Mehrjou A.
Parascandolo G.
Sajjadi M.
Schölkopf B.
Publication venue
Publication date: 01/01/2018
Field of study

MPG.PuRe

Counterfactuals uncover the modular structure of deep generative models

Author: Besserve M.
Mehrjou A.
Schölkopf B.
Sun R.
Publication venue
Publication date: 01/04/2020
Field of study

Deep generative models can emulate the perceptual properties of complex image datasets, providing a latent representation of the data. However, manipulating such representation to perform meaningful and controllable transformations in the data space remains challenging without some form of supervision. While previous work has focused on exploiting statistical independence to \textit{disentangle} latent factors, we argue that such requirement can be advantageously relaxed and propose instead a non-statistical framework that relies on identifying a modular organization of the network, based on counterfactual manipulations. Our experiments support that modularity between groups of channels is achieved to a certain degree on a variety of generative models. This allowed the design of targeted interventions on complex image datasets, opening the way to applications such as computationally efficient style transfer and the automated assessment of robustness to contextual changes in pattern recognition systems

MPG.PuRe

Tempered Adversarial Networks

Author: Mehrjou A.
Parascandolo G.
Sajjadi M.
Schölkopf B.
Publication venue
Publication date: 01/01/2018
Field of study

MPG.PuRe

Counterfactuals uncover the modular structure of deep generative models

Author: Besserve M.
Mehrjou A.
Schoelkopf B.
Sun R.
Publication venue
Publication date: 01/01/2020
Field of study

MPG.PuRe

The Incomplete Rosetta Stone problem: Identifiability results for Multi-view Nonlinear ICA

Author: Gresele L.
Locatello F.
Mehrjou A.
Rubenstein P.
Schölkopf B.
Publication venue
Publication date: 01/01/2019
Field of study

We consider the problem of recovering a common latent source with independent components from multiple views. This applies to settings in which a variable is measured with multiple experimental modalities, and where the goal is to synthesize the disparate measurements into a single unified representation. We consider the case that the observed views are a nonlinear mixing of component-wise corruptions of the sources. When the views are considered separately, this reduces to nonlinear Independent Component Analysis (ICA) for which it is provably impossible to undo the mixing. We present novel identifiability proofs that this is possible when the multiple views are considered jointly, showing that the mixing can theoretically be undone using function approximators such as deep neural networks. In contrast to known identifiability results for nonlinear ICA, we prove that independent latent sources with arbitrary mixing can be recovered as long as multiple, sufficiently different noisy views are available

MPG.PuRe