145 research outputs found
Encouraging Intra-Class Diversity Through a Reverse Contrastive Loss for Better Single-Source Domain Generalization
Traditional deep learning algorithms often fail to generalize when they are
tested outside of the domain of training data. Because data distributions can
change dynamically in real-life applications once a learned model is deployed,
in this paper we are interested in single-source domain generalization (SDG)
which aims to develop deep learning algorithms able to generalize from a single
training domain where no information about the test domain is available at
training time. Firstly, we design two simple MNISTbased SDG benchmarks, namely
MNIST Color SDG-MP and MNIST Color SDG-UP, which highlight the two different
fundamental SDG issues of increasing difficulties: 1) a class-correlated
pattern in the training domain is missing (SDG-MP), or 2) uncorrelated with the
class (SDG-UP), in the testing data domain. This is in sharp contrast with the
current domain generalization (DG) benchmarks which mix up different
correlation and variation factors and thereby make hard to disentangle success
or failure factors when benchmarking DG algorithms. We further evaluate several
state-of-the-art SDG algorithms through our simple benchmark, namely MNIST
Color SDG-MP, and show that the issue SDG-MP is largely unsolved despite of a
decade of efforts in developing DG algorithms. Finally, we also propose a
partially reversed contrastive loss to encourage intra-class diversity and find
less strongly correlated patterns, to deal with SDG-MP and show that the
proposed approach is very effective on our MNIST Color SDG-MP benchmark
Risk of cancer in the vicinity of municipal solid waste incinerators: importance of using a flexible modelling strategy
<p>Abstract</p> <p>Background</p> <p>We conducted an ecological study in four French administrative departments and highlighted an excess risk in cancer morbidity for residents around municipal solid waste incinerators. The aim of this paper is to show how important are advanced tools and statistical techniques to better assess weak associations between the risk of cancer and past environmental exposures.</p> <p>Methods</p> <p>The steps to evaluate the association between the risk of cancer and the exposure to incinerators, from the assessment of exposure to the definition of the confounding variables and the statistical analysis carried out are detailed and discussed. Dispersion modelling was used to assess exposure to sixteen incinerators. A geographical information system was developed to define an index of exposure at the IRIS level that is the geographical unit we considered.</p> <p>Population density, rural/urban status, socio-economic deprivation, exposure to air pollution from traffic and from other industries were considered as potential confounding factors and defined at the IRIS level. Generalized additive models and Bayesian hierarchical models were used to estimate the association between the risk of cancer and the index of exposure to incinerators accounting for the confounding factors.</p> <p>Results</p> <p>Modelling to assess the exposure to municipal solid waste incinerators allowed accounting for factors known to influence the exposure (meteorological data, point source characteristics, topography). The statistical models defined allowed modelling extra-Poisson variability and also non-linear relationships between the risk of cancer and the exposure to incinerators and the confounders.</p> <p>Conclusion</p> <p>In most epidemiological studies distance is still used as a proxy for exposure. This can lead to significant exposure misclassification. Additionally, in geographical correlation studies the non-linear relationships are usually not accounted for in the statistical analysis. In studies of weak associations it is important to use advanced methods to better assess dose-response relationships with disease risk.</p
- …