26 research outputs found
Comparison of meta-learners for estimating multi-valued treatment heterogeneous effects
Conditional Average Treatment Effects (CATE) estimation is one of the main
challenges in causal inference with observational data. In addition to Machine
Learning based-models, nonparametric estimators called meta-learners have been
developed to estimate the CATE with the main advantage of not restraining the
estimation to a specific supervised learning method. This task becomes,
however, more complicated when the treatment is not binary as some limitations
of the naive extensions emerge. This paper looks into meta-learners for
estimating the heterogeneous effects of multi-valued treatments. We consider
different meta-learners, and we carry out a theoretical analysis of their error
upper bounds as functions of important parameters such as the number of
treatment levels, showing that the naive extensions do not always provide
satisfactory results. We introduce and discuss meta-learners that perform well
as the number of treatments increases. We empirically confirm the strengths and
weaknesses of those methods with synthetic and semi-synthetic datasets.Comment: 42 pages, 9 figures, to appear in ICML 2023 conferenc
Dynamique et trame du peuplement dans la région de Fréjus (Var, France) entre le Vème et le VIIIème siècles de n. è.
International audienc
Robust prediction interval estimation for Gaussian processes by cross-validation method
International audienc
Robust Prediction Interval estimation for Gaussian Processes by Cross-Validation method
Probabilistic regression models typically use the Maximum Likelihood
Estimation or Cross-Validation to fit parameters. These methods can give an
advantage to the solutions that fit observations on average, but they do not
pay attention to the coverage and the width of Prediction Intervals. A robust
two-step approach is used to address the problem of adjusting and calibrating
Prediction Intervals for Gaussian Processes Regression. First, the covariance
hyperparameters are determined by a standard Cross-Validation or Maximum
Likelihood Estimation method. A Leave-One-Out Coverage Probability is
introduced as a metric to adjust the covariance hyperparameters and assess the
optimal type II Coverage Probability to a nominal level. Then a relaxation
method is applied to choose the hyperparameters that minimize the Wasserstein
distance between the Gaussian distribution with the initial hyperparameters
(obtained by Cross-Validation or Maximum Likelihood Estimation) and the
proposed Gaussian distribution with the hyperparameters that achieve the
desired Coverage Probability. The method gives Prediction Intervals with
appropriate coverage probabilities and small widths.Comment: Revised versio
Comparison of meta-learners for estimating multi-valued treatment heterogeneous effects
International audienceConditional Average Treatment Effects (CATE) estimation is one of the main challenges in causal inference with observational data. In addition to Machine Learning based-models, nonparametric estimators called meta-learners have been developed to estimate the CATE with the main advantage of not restraining the estimation to a specific supervised learning method. This task becomes, however, more complicated when the treatment is not binary as some limitations of the naive extensions emerge. This paper looks into meta-learners for estimating the heterogeneous effects of multivalued treatments. We consider different metalearners, and we carry out a theoretical analysis of their error upper bounds as functions of important parameters such as the number of treatment levels, showing that the naive extensions do not always provide satisfactory results. We introduce and discuss meta-learners that perform well as the number of treatments increases. We empirically confirm the strengths and weaknesses of those methods with synthetic and semi-synthetic datasets
Heterogeneous Treatment Effects Estimation: When Machine Learning meets multiple treatments regime
International audienc
Comparison of meta-learners for estimating multi-valued treatment heterogeneous effects
International audienceConditional Average Treatment Effects (CATE) estimation is one of the main challenges in causal inference with observational data. In addition to Machine Learning based-models, nonparametric estimators called meta-learners have been developed to estimate the CATE with the main advantage of not restraining the estimation to a specific supervised learning method. This task becomes, however, more complicated when the treatment is not binary as some limitations of the naive extensions emerge. This paper looks into meta-learners for estimating the heterogeneous effects of multivalued treatments. We consider different metalearners, and we carry out a theoretical analysis of their error upper bounds as functions of important parameters such as the number of treatment levels, showing that the naive extensions do not always provide satisfactory results. We introduce and discuss meta-learners that perform well as the number of treatments increases. We empirically confirm the strengths and weaknesses of those methods with synthetic and semi-synthetic datasets
Geostatistics on stratigraphic grids
International audienceFor stratigraphic grids, the traditional method used to model properties does not consider the volume of the cells and ignores the volumetric distortion between physical and depositional space. Therefore, it introduces some biases. A sampling based method allows to solve this problem for variogram-based geostatistics