9,807 research outputs found
A convex pseudo-likelihood framework for high dimensional partial correlation estimation with convergence guarantees
Sparse high dimensional graphical model selection is a topic of much interest
in modern day statistics. A popular approach is to apply l1-penalties to either
(1) parametric likelihoods, or, (2) regularized regression/pseudo-likelihoods,
with the latter having the distinct advantage that they do not explicitly
assume Gaussianity. As none of the popular methods proposed for solving
pseudo-likelihood based objective functions have provable convergence
guarantees, it is not clear if corresponding estimators exist or are even
computable, or if they actually yield correct partial correlation graphs. This
paper proposes a new pseudo-likelihood based graphical model selection method
that aims to overcome some of the shortcomings of current methods, but at the
same time retain all their respective strengths. In particular, we introduce a
novel framework that leads to a convex formulation of the partial covariance
regression graph problem, resulting in an objective function comprised of
quadratic forms. The objective is then optimized via a coordinate-wise
approach. The specific functional form of the objective function facilitates
rigorous convergence analysis leading to convergence guarantees; an important
property that cannot be established using standard results, when the dimension
is larger than the sample size, as is often the case in high dimensional
applications. These convergence guarantees ensure that estimators are
well-defined under very general conditions, and are always computable. In
addition, the approach yields estimators that have good large sample properties
and also respect symmetry. Furthermore, application to simulated/real data,
timing comparisons and numerical convergence is demonstrated. We also present a
novel unifying framework that places all graphical pseudo-likelihood methods as
special cases of a more general formulation, leading to important insights
Inferring Multiple Graphical Structures
Gaussian Graphical Models provide a convenient framework for representing
dependencies between variables. Recently, this tool has received a high
interest for the discovery of biological networks. The literature focuses on
the case where a single network is inferred from a set of measurements, but, as
wetlab data is typically scarce, several assays, where the experimental
conditions affect interactions, are usually merged to infer a single network.
In this paper, we propose two approaches for estimating multiple related
graphs, by rendering the closeness assumption into an empirical prior or group
penalties. We provide quantitative results demonstrating the benefits of the
proposed approaches. The methods presented in this paper are embeded in the R
package 'simone' from version 1.0-0 and later
Factorial graphical lasso for dynamic networks
Dynamic networks models describe a growing number of important scientific
processes, from cell biology and epidemiology to sociology and finance. There
are many aspects of dynamical networks that require statistical considerations.
In this paper we focus on determining network structure. Estimating dynamic
networks is a difficult task since the number of components involved in the
system is very large. As a result, the number of parameters to be estimated is
bigger than the number of observations. However, a characteristic of many
networks is that they are sparse. For example, the molecular structure of genes
make interactions with other components a highly-structured and therefore
sparse process.
Penalized Gaussian graphical models have been used to estimate sparse
networks. However, the literature has focussed on static networks, which lack
specific temporal constraints. We propose a structured Gaussian dynamical
graphical model, where structures can consist of specific time dynamics, known
presence or absence of links and block equality constraints on the parameters.
Thus, the number of parameters to be estimated is reduced and accuracy of the
estimates, including the identification of the network, can be tuned up. Here,
we show that the constrained optimization problem can be solved by taking
advantage of an efficient solver, logdetPPA, developed in convex optimization.
Moreover, model selection methods for checking the sensitivity of the inferred
networks are described. Finally, synthetic and real data illustrate the
proposed methodologies.Comment: 30 pp, 5 figure
- …