The inference of networks of dependencies by Gaussian Graphical models on
high-throughput data is an open issue in modern molecular biology. In this
paper we provide a comparative study of three methods to obtain small sample
and high dimension estimates of partial correlation coefficients: the
Moore-Penrose pseudoinverse (PINV), residual correlation (RCM) and
covariance-regularized method (ℓ2C). We first compare them on simulated
datasets and we find that PINV is less stable in terms of AUC performance when
the number of variables changes. The two regularized methods have comparable
performances but ℓ2C is much faster than RCM. Finally, we present the
results of an application of ℓ2C for the inference of a gene network
for isoprenoid biosynthesis pathways in Arabidopsis thaliana.Comment: 7 pages, 1 figure, RevTex4, version to appear in the proceedings of
1st International Workshop on Pattern Recognition, Proteomics, Structural
Biology and Bioinformatics: PR PS BB 2011, Ravenna, Italy, 13 September 201