5 research outputs found
GREAT3 results I: systematic errors in shear estimation and the impact of real galaxy morphology
We present first results from the third GRavitational lEnsing Accuracy
Testing (GREAT3) challenge, the third in a sequence of challenges for testing
methods of inferring weak gravitational lensing shear distortions from
simulated galaxy images. GREAT3 was divided into experiments to test three
specific questions, and included simulated space- and ground-based data with
constant or cosmologically-varying shear fields. The simplest (control)
experiment included parametric galaxies with a realistic distribution of
signal-to-noise, size, and ellipticity, and a complex point spread function
(PSF). The other experiments tested the additional impact of realistic galaxy
morphology, multiple exposure imaging, and the uncertainty about a
spatially-varying PSF; the last two questions will be explored in Paper II. The
24 participating teams competed to estimate lensing shears to within systematic
error tolerances for upcoming Stage-IV dark energy surveys, making 1525
submissions overall. GREAT3 saw considerable variety and innovation in the
types of methods applied. Several teams now meet or exceed the targets in many
of the tests conducted (to within the statistical errors). We conclude that the
presence of realistic galaxy morphology in simulations changes shear
calibration biases by per cent for a wide range of methods. Other
effects such as truncation biases due to finite galaxy postage stamps, and the
impact of galaxy type as measured by the S\'{e}rsic index, are quantified for
the first time. Our results generalize previous studies regarding sensitivities
to galaxy size and signal-to-noise, and to PSF properties such as seeing and
defocus. Almost all methods' results support the simple model in which additive
shear biases depend linearly on PSF ellipticity.Comment: 32 pages + 15 pages of technical appendices; 28 figures; submitted to
MNRAS; latest version has minor updates in presentation of 4 figures, no
changes in content or conclusion
Multi-source domain adaptation through dataset dictionary learning in wasserstein space
International audienceThis paper seeks to solve Multi-Source Domain Adaptation (MSDA), which aims to mitigate data distribution shifts when transferring knowledge from multiple labeled source domains to an unlabeled target domain. We propose a novel MSDA framework based on dictionary learning and optimal transport. We interpret each domain in MSDA as an empirical distribution. As such, we express each domain as a Wasserstein barycenter of dictionary atoms, which are empirical distributions. We propose a novel algorithm, DaDiL, for learning via mini-batches: (i) atom distributions; (ii) a matrix of barycentric coordinates. Based on our dictionary, we propose two novel methods for MSDA: DaDil-R, based on the reconstruction of labeled samples in the target domain, and DaDiL-E, based on the ensembling of classifiers learned on atom distributions. We evaluate our methods in 3 benchmarks: Caltech-Office, Office 31, and CRWU, where we improved previous state-of-the-art by 3.15%, 2.29%, and 7.71% in classification performance. Finally, we show that interpolations in the Wasserstein hull of learned atoms provide data that can generalize to the target domain
Cross-domain fault diagnosis through optimal transport for a CSTR process
Publisher Copyright: © 2022 Elsevier B.V.. All rights reserved.Fault diagnosis is a key task for developing safer control systems, especially in chemical plants. Nonetheless, acquiring good labeled fault data involves sampling from dangerous system conditions. A possible workaround to this limitation is to use simulation data for training data-driven fault diagnosis systems. However, due to modelling errors or unknown factors, simulation data may differ in distribution from real-world data. This setting is known as cross-domain fault diagnosis (CDFD). We use optimal transport for: (i) exploring how modelling errors relate to the distance between simulation (source) and real-world (target) data distributions, and (ii) matching source and target distributions through the framework of optimal transport for domain adaptation (OTDA), resulting in new training data that follows the target distribution. Comparisons show that OTDA outperforms other CDFD methods.Peer reviewe