Search CORE

4,174 research outputs found

Model selection and local geometry

Author: Evans Robin J.
Publication venue
Publication date: 05/12/2019
Field of study

We consider problems in model selection caused by the geometry of models close to their points of intersection. In some cases---including common classes of causal or graphical models, as well as time series models---distinct models may nevertheless have identical tangent spaces. This has two immediate consequences: first, in order to obtain constant power to reject one model in favour of another we need local alternative hypotheses that decrease to the null at a slower rate than the usual parametric

n^{-1/2}

(typically we will require

n^{-1/4}

or slower); in other words, to distinguish between the models we need large effect sizes or very large sample sizes. Second, we show that under even weaker conditions on their tangent cones, models in these classes cannot be made simultaneously convex by a reparameterization. This shows that Bayesian network models, amongst others, cannot be learned directly with a convex method similar to the graphical lasso. However, we are able to use our results to suggest methods for model selection that learn the tangent space directly, rather than the model itself. In particular, we give a generic algorithm for learning Bayesian network models

arXiv.org e-Print Archive

Oxford University Research Archive

Graphical methods for inequality constraints in marginalized DAGs

Author: Evans Robin J.
Publication venue
Publication date: 01/01/2012
Field of study

We present a graphical approach to deriving inequality constraints for directed acyclic graph (DAG) models, where some variables are unobserved. In particular we show that the observed distribution of a discrete model is always restricted if any two observed variables are neither adjacent in the graph, nor share a latent parent; this generalizes the well known instrumental inequality. The method also provides inequalities on interventional distributions, which can be used to bound causal effects. All these constraints are characterized in terms of a new graphical separation criterion, providing an easy and intuitive method for their derivation.Comment: A final version will appear in the proceedings of the 22nd Workshop on Machine Learning and Signal Processing, 201

arXiv.org e-Print Archive

Oxford University Research Archive

Graphs for margins of Bayesian networks

Author: Evans Robin J.
Publication venue: 'Wiley'
Publication date: 21/08/2015
Field of study

Directed acyclic graph (DAG) models, also called Bayesian networks, impose conditional independence constraints on a multivariate probability distribution, and are widely used in probabilistic reasoning, machine learning and causal inference. If latent variables are included in such a model, then the set of possible marginal distributions over the remaining (observed) variables is generally complex, and not represented by any DAG. Larger classes of mixed graphical models, which use multiple edge types, have been introduced to overcome this; however, these classes do not represent all the models which can arise as margins of DAGs. In this paper we show that this is because ordinary mixed graphs are fundamentally insufficiently rich to capture the variety of marginal models. We introduce a new class of hyper-graphs, called mDAGs, and a latent projection operation to obtain an mDAG from the margin of a DAG. We show that each distinct marginal of a DAG model is represented by at least one mDAG, and provide graphical results towards characterizing when two such marginal models are the same. Finally we show that mDAGs correctly capture the marginal structure of causally-interpreted DAGs under interventions on the observed variables

arXiv.org e-Print Archive

CiteSeerX

Margins of discrete Bayesian networks

Author: Evans Robin J.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 30/01/2017
Field of study

Bayesian network models with latent variables are widely used in statistics and machine learning. In this paper we provide a complete algebraic characterization of Bayesian network models with latent variables when the observed variables are discrete and no assumption is made about the state-space of the latent variables. We show that it is algebraically equivalent to the so-called nested Markov model, meaning that the two are the same up to inequality constraints on the joint probabilities. In particular these two models have the same dimension. The nested Markov model is therefore the best possible description of the latent variable model that avoids consideration of inequalities, which are extremely complicated in general. A consequence of this is that the constraint finding algorithm of Tian and Pearl (UAI 2002, pp519-527) is complete for finding equality constraints. Latent variable models suffer from difficulties of unidentifiable parameters and non-regular asymptotics; in contrast the nested Markov model is fully identifiable, represents a curved exponential family of known dimension, and can easily be fitted using an explicit parameterization.Comment: 41 page

arXiv.org e-Print Archive

Oxford University Research Archive

Predicting and controlling the dynamics of infectious diseases

Author: Evans Robin J.
Mammadov Musa
Publication venue
Publication date: 01/01/2015
Field of study

This paper introduces a new optimal control model to describe and control the dynamics of infectious diseases. In the present model, the average time of isolation (i.e. hospitalization) of infectious population is the main time-dependent parameter that defines the spread of infection. All the preventive measures aim to decrease the average time of isolation under given constraints

arXiv.org e-Print Archive

Crossref

Deakin Research Online

Federation ResearchOnline

Dynamics of Ebola epidemics in West Africa 2014

Author: Evans Robin J.
Mammadov Musa
Publication venue
Publication date: 01/01/2014
Field of study

This paper investigates the dynamics of Ebola virus transmission in West Africa during 2014. The reproduction numbers for the total period of epidemic and for different consequent time intervals are estimated based on a newly suggested linear model. It contains one major variable - the average time of infectiousness (time from onset to hospitalization) that is considered as a parameter for controlling the future dynamics of epidemics. Numerical implementations are carried out on data collected from three countries Guinea, Sierra Leone and Liberia as well as the total data collected worldwide. Predictions are provided by considering different scenarios involving the average times of infectiousness for the next few months and the end of the current epidemic is estimated according to each scenario

arXiv.org e-Print Archive

Deakin Research Online

Directory of Open Access Journals

University of Melbourne Institutional Repository

Smooth, identifiable supermodels of discrete DAG models with latent variables

Author: Evans Robin J.
Richardson Thomas S.
Publication venue
Publication date: 30/01/2017
Field of study

We provide a parameterization of the discrete nested Markov model, which is a supermodel that approximates DAG models (Bayesian network models) with latent variables. Such models are widely used in causal inference and machine learning. We explicitly evaluate their dimension, show that they are curved exponential families of distributions, and fit them to data. The parameterization avoids the irregularities and unidentifiability of latent variable models. The parameters used are all fully identifiable and causally-interpretable quantities.Comment: 30 page

arXiv.org e-Print Archive