40,600 research outputs found
Counting and Sampling from Markov Equivalent DAGs Using Clique Trees
A directed acyclic graph (DAG) is the most common graphical model for
representing causal relationships among a set of variables. When restricted to
using only observational data, the structure of the ground truth DAG is
identifiable only up to Markov equivalence, based on conditional independence
relations among the variables. Therefore, the number of DAGs equivalent to the
ground truth DAG is an indicator of the causal complexity of the underlying
structure--roughly speaking, it shows how many interventions or how much
additional information is further needed to recover the underlying DAG. In this
paper, we propose a new technique for counting the number of DAGs in a Markov
equivalence class. Our approach is based on the clique tree representation of
chordal graphs. We show that in the case of bounded degree graphs, the proposed
algorithm is polynomial time. We further demonstrate that this technique can be
utilized for uniform sampling from a Markov equivalence class, which provides a
stochastic way to enumerate DAGs in the equivalence class and may be needed for
finding the best DAG or for causal inference given the equivalence class as
input. We also extend our counting and sampling method to the case where prior
knowledge about the underlying DAG is available, and present applications of
this extension in causal experiment design and estimating the causal effect of
joint interventions
Geometry of the faithfulness assumption in causal inference
Many algorithms for inferring causality rely heavily on the faithfulness
assumption. The main justification for imposing this assumption is that the set
of unfaithful distributions has Lebesgue measure zero, since it can be seen as
a collection of hypersurfaces in a hypercube. However, due to sampling error
the faithfulness condition alone is not sufficient for statistical estimation,
and strong-faithfulness has been proposed and assumed to achieve uniform or
high-dimensional consistency. In contrast to the plain faithfulness assumption,
the set of distributions that is not strong-faithful has nonzero Lebesgue
measure and in fact, can be surprisingly large as we show in this paper. We
study the strong-faithfulness condition from a geometric and combinatorial
point of view and give upper and lower bounds on the Lebesgue measure of
strong-faithful distributions for various classes of directed acyclic graphs.
Our results imply fundamental limitations for the PC-algorithm and potentially
also for other algorithms based on partial correlation testing in the Gaussian
case.Comment: Published in at http://dx.doi.org/10.1214/12-AOS1080 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Causal Discovery with Continuous Additive Noise Models
We consider the problem of learning causal directed acyclic graphs from an
observational joint distribution. One can use these graphs to predict the
outcome of interventional experiments, from which data are often not available.
We show that if the observational distribution follows a structural equation
model with an additive noise structure, the directed acyclic graph becomes
identifiable from the distribution under mild conditions. This constitutes an
interesting alternative to traditional methods that assume faithfulness and
identify only the Markov equivalence class of the graph, thus leaving some
edges undirected. We provide practical algorithms for finitely many samples,
RESIT (Regression with Subsequent Independence Test) and two methods based on
an independence score. We prove that RESIT is correct in the population setting
and provide an empirical evaluation
- …