9,080 research outputs found
De-confounding Representation Learning for Counterfactual Inference on Continuous Treatment via Generative Adversarial Network
Counterfactual inference for continuous rather than binary treatment
variables is more common in real-world causal inference tasks. While there are
already some sample reweighting methods based on Marginal Structural Model for
eliminating the confounding bias, they generally focus on removing the
treatment's linear dependence on confounders and rely on the accuracy of the
assumed parametric models, which are usually unverifiable. In this paper, we
propose a de-confounding representation learning (DRL) framework for
counterfactual outcome estimation of continuous treatment by generating the
representations of covariates disentangled with the treatment variables. The
DRL is a non-parametric model that eliminates both linear and nonlinear
dependence between treatment and covariates. Specifically, we train the
correlations between the de-confounded representations and the treatment
variables against the correlations between the covariate representations and
the treatment variables to eliminate confounding bias. Further, a
counterfactual inference network is embedded into the framework to make the
learned representations serve both de-confounding and trusted inference.
Extensive experiments on synthetic datasets show that the DRL model performs
superiorly in learning de-confounding representations and outperforms
state-of-the-art counterfactual inference models for continuous treatment
variables. In addition, we apply the DRL model to a real-world medical dataset
MIMIC and demonstrate a detailed causal relationship between red cell width
distribution and mortality.Comment: 15 pages,4 figure
Learning Counterfactual Representations for Estimating Individual Dose-Response Curves
Estimating what would be an individual's potential response to varying levels
of exposure to a treatment is of high practical relevance for several important
fields, such as healthcare, economics and public policy. However, existing
methods for learning to estimate counterfactual outcomes from observational
data are either focused on estimating average dose-response curves, or limited
to settings with only two treatments that do not have an associated dosage
parameter. Here, we present a novel machine-learning approach towards learning
counterfactual representations for estimating individual dose-response curves
for any number of treatments with continuous dosage parameters with neural
networks. Building on the established potential outcomes framework, we
introduce performance metrics, model selection criteria, model architectures,
and open benchmarks for estimating individual dose-response curves. Our
experiments show that the methods developed in this work set a new
state-of-the-art in estimating individual dose-response
CausaLM: Causal Model Explanation Through Counterfactual Language Models
Understanding predictions made by deep neural networks is notoriously
difficult, but also crucial to their dissemination. As all ML-based methods,
they are as good as their training data, and can also capture unwanted biases.
While there are tools that can help understand whether such biases exist, they
do not distinguish between correlation and causation, and might be ill-suited
for text-based models and for reasoning about high level language concepts. A
key problem of estimating the causal effect of a concept of interest on a given
model is that this estimation requires the generation of counterfactual
examples, which is challenging with existing generation technology. To bridge
that gap, we propose CausaLM, a framework for producing causal model
explanations using counterfactual language representation models. Our approach
is based on fine-tuning of deep contextualized embedding models with auxiliary
adversarial tasks derived from the causal graph of the problem. Concretely, we
show that by carefully choosing auxiliary adversarial pre-training tasks,
language representation models such as BERT can effectively learn a
counterfactual representation for a given concept of interest, and be used to
estimate its true causal effect on model performance. A byproduct of our method
is a language representation model that is unaffected by the tested concept,
which can be useful in mitigating unwanted bias ingrained in the data.Comment: Our code and data are available at:
https://amirfeder.github.io/CausaLM/ Under review for the Computational
Linguistics journa
- …