Search CORE

4,230 research outputs found

Invariant theory and scaling algorithms for maximum likelihood estimation

Author: Améndola Carlos
Kohn Kathlén
Reichenbach Philipp
Seigal Anna
Publication venue
Publication date: 01/01/2021
Field of study

We uncover connections between maximum likelihood estimation in statistics and norm minimization over a group orbit in invariant theory. We focus on Gaussian transformation families, which include matrix normal models and Gaussian graphical models given by transitive directed acyclic graphs. We use stability under group actions to characterize boundedness of the likelihood, and existence and uniqueness of the maximum likelihood estimate. Our approach reveals promising consequences of the interplay between invariant theory and statistics. In particular, existing scaling algorithms from statistics can be used in invariant theory, and vice versa.Comment: 34 pages; minor changes in comparison to version 2. The discrete part on log-linear models from version 1 is contained in the companion paper arXiv:2012.0779

arXiv.org e-Print Archive

Oxford University Research Archive

Sequences of regressions and their independences

Author: K. Sadeghi
Kayvan Sadeghi
N. Wermuth
N. Wermuth
Nanny Wermuth
Publication venue
Publication date: 01/01/2011
Field of study

Ordered sequences of univariate or multivariate regressions provide statistical models for analysing data from randomized, possibly sequential interventions, from cohort or multi-wave panel studies, but also from cross-sectional or retrospective studies. Conditional independences are captured by what we name regression graphs, provided the generated distribution shares some properties with a joint Gaussian distribution. Regression graphs extend purely directed, acyclic graphs by two types of undirected graph, one type for components of joint responses and the other for components of the context vector variable. We review the special features and the history of regression graphs, derive criteria to read all implied independences of a regression graph and prove criteria for Markov equivalence that is to judge whether two different graphs imply the same set of independence statements. Knowledge of Markov equivalence provides alternative interpretations of a given sequence of regressions, is essential for machine learning strategies and permits to use the simple graphical criteria of regression graphs on graphs for which the corresponding criteria are in general more complex. Under the known conditions that a Markov equivalent directed acyclic graph exists for any given regression graph, we give a polynomial time algorithm to find one such graph.Comment: 43 pages with 17 figures The manuscript is to appear as an invited discussion paper in the journal TES

arXiv.org e-Print Archive

CiteSeerX

Chalmers Research

Chalmers Publication Library

Mixed Cumulative Distribution Networks

Author: Blundell Charles
Silva Ricardo
Teh Yee Whye
Publication venue
Publication date: 31/08/2010
Field of study

Directed acyclic graphs (DAGs) are a popular framework to express multivariate probability distributions. Acyclic directed mixed graphs (ADMGs) are generalizations of DAGs that can succinctly capture much richer sets of conditional independencies, and are especially useful in modeling the effects of latent variables implicitly. Unfortunately there are currently no good parameterizations of general ADMGs. In this paper, we apply recent work on cumulative distribution networks and copulas to propose one one general construction for ADMG models. We consider a simple parameter estimation approach, and report some encouraging experimental results.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

CiteSeerX

UCL Discovery

Oxford University Research Archive

Distributional Equivalence and Structure Learning for Bow-free Acyclic Path Diagrams

Author: Bühlmann Peter
Evans Robin J.
Maathuis Marloes H.
Nowzohour Christopher
Publication venue
Publication date: 01/01/2017
Field of study

We consider the problem of structure learning for bow-free acyclic path diagrams (BAPs). BAPs can be viewed as a generalization of linear Gaussian DAG models that allow for certain hidden variables. We present a first method for this problem using a greedy score-based search algorithm. We also prove some necessary and some sufficient conditions for distributional equivalence of BAPs which are used in an algorithmic ap- proach to compute (nearly) equivalent model structures. This allows us to infer lower bounds of causal effects. We also present applications to real and simulated datasets using our publicly available R-package

arXiv.org e-Print Archive

Oxford University Research Archive