3,691 research outputs found
Detection of regulator genes and eQTLs in gene networks
Genetic differences between individuals associated to quantitative phenotypic
traits, including disease states, are usually found in non-coding genomic
regions. These genetic variants are often also associated to differences in
expression levels of nearby genes (they are "expression quantitative trait
loci" or eQTLs for short) and presumably play a gene regulatory role, affecting
the status of molecular networks of interacting genes, proteins and
metabolites. Computational systems biology approaches to reconstruct causal
gene networks from large-scale omics data have therefore become essential to
understand the structure of networks controlled by eQTLs together with other
regulatory genes, and to generate detailed hypotheses about the molecular
mechanisms that lead from genotype to phenotype. Here we review the main
analytical methods and softwares to identify eQTLs and their associated genes,
to reconstruct co-expression networks and modules, to reconstruct causal
Bayesian gene and module networks, and to validate predicted networks in
silico.Comment: minor revision with typos corrected; review article; 24 pages, 2
figure
Pathologic gene network rewiring implicates PPP1R3A as a central regulator in pressure overload heart failure
Heart failure is a leading cause of mortality, yet our understanding of the genetic interactions underlying this disease remains incomplete. Here, we harvest 1352 healthy and failing human hearts directly from transplant center operating rooms, and obtain genome-wide genotyping and gene expression measurements for a subset of 313. We build failing and non-failing cardiac regulatory gene networks, revealing important regulators and cardiac expression quantitative trait loci (eQTLs). PPP1R3A emerges as a regulator whose network connectivity changes significantly between health and disease. RNA sequencing after PPP1R3A knockdown validates network-based predictions, and highlights metabolic pathway regulation associated with increased cardiomyocyte size and perturbed respiratory metabolism. Mice lacking PPP1R3A are protected against pressure-overload heart failure. We present a global gene interaction map of the human heart failure transition, identify previously unreported cardiac eQTLs, and demonstrate the discovery potential of disease-specific networks through the description of PPP1R3A as a central regulator in heart failure
ENNET: inferring large gene regulatory networks from expression data using gradient boosting
Background
The regulation of gene expression by transcription factors is a key determinant of cellular phenotypes. Deciphering genome-wide networks that capture which transcription factors regulate which genes is one of the major efforts towards understanding and accurate modeling of living systems. However, reverse-engineering the network from gene expression profiles remains a challenge, because the data are noisy, high dimensional and sparse, and the regulation is often obscured by indirect connections. Results
We introduce a gene regulatory network inference algorithm ENNET, which reverse-engineers networks of transcriptional regulation from a variety of expression profiles with a superior accuracy compared to the state-of-the-art methods. The proposed method relies on the boosting of regression stumps combined with a relative variable importance measure for the initial scoring of transcription factors with respect to each gene. Then, we propose a technique for using a distribution of the initial scores and information about knockouts to refine the predictions. We evaluated the proposed method on the DREAM3, DREAM4 and DREAM5 data sets and achieved higher accuracy than the winners of those competitions and other established methods. Conclusions
Superior accuracy achieved on the three different benchmark data sets shows that ENNET is a top contender in the task of network inference. It is a versatile method that uses information about which gene was knocked-out in which experiment if it is available, but remains the top performer even without such information. ENNET is available for download fromhttps://github.com/slawekj/ennet webcite under the GNU GPLv3 license
Statistical modelling of transcript profiles of differentially regulated genes
Background: The vast quantities of gene expression profiling data produced in microarray studies, and
the more precise quantitative PCR, are often not statistically analysed to their full potential. Previous
studies have summarised gene expression profiles using simple descriptive statistics, basic analysis of
variance (ANOVA) and the clustering of genes based on simple models fitted to their expression profiles
over time. We report the novel application of statistical non-linear regression modelling techniques to
describe the shapes of expression profiles for the fungus Agaricus bisporus, quantified by PCR, and for E.
coli and Rattus norvegicus, using microarray technology. The use of parametric non-linear regression models
provides a more precise description of expression profiles, reducing the "noise" of the raw data to
produce a clear "signal" given by the fitted curve, and describing each profile with a small number of
biologically interpretable parameters. This approach then allows the direct comparison and clustering of
the shapes of response patterns between genes and potentially enables a greater exploration and
interpretation of the biological processes driving gene expression.
Results: Quantitative reverse transcriptase PCR-derived time-course data of genes were modelled. "Splitline"
or "broken-stick" regression identified the initial time of gene up-regulation, enabling the classification
of genes into those with primary and secondary responses. Five-day profiles were modelled using the
biologically-oriented, critical exponential curve, y(t) = A + (B + Ct)Rt + ε. This non-linear regression
approach allowed the expression patterns for different genes to be compared in terms of curve shape,
time of maximal transcript level and the decline and asymptotic response levels. Three distinct regulatory
patterns were identified for the five genes studied. Applying the regression modelling approach to
microarray-derived time course data allowed 11% of the Escherichia coli features to be fitted by an
exponential function, and 25% of the Rattus norvegicus features could be described by the critical
exponential model, all with statistical significance of p < 0.05.
Conclusion: The statistical non-linear regression approaches presented in this study provide detailed
biologically oriented descriptions of individual gene expression profiles, using biologically variable data to
generate a set of defining parameters. These approaches have application to the modelling and greater
interpretation of profiles obtained across a wide range of platforms, such as microarrays. Through careful
choice of appropriate model forms, such statistical regression approaches allow an improved comparison
of gene expression profiles, and may provide an approach for the greater understanding of common
regulatory mechanisms between genes
Netter: re-ranking gene network inference predictions using structural network properties
Background: Many algorithms have been developed to infer the topology of gene regulatory networks from gene expression data. These methods typically produce a ranking of links between genes with associated confidence scores, after which a certain threshold is chosen to produce the inferred topology. However, the structural properties of the predicted network do not resemble those typical for a gene regulatory network, as most algorithms only take into account connections found in the data and do not include known graph properties in their inference process. This lowers the prediction accuracy of these methods, limiting their usability in practice.
Results: We propose a post-processing algorithm which is applicable to any confidence ranking of regulatory interactions obtained from a network inference method which can use, inter alia, graphlets and several graph-invariant properties to re-rank the links into a more accurate prediction. To demonstrate the potential of our approach, we re-rank predictions of six different state-of-the-art algorithms using three simple network properties as optimization criteria and show that Netter can improve the predictions made on both artificially generated data as well as the DREAM4 and DREAM5 benchmarks. Additionally, the DREAM5 E. coli. community prediction inferred from real expression data is further improved. Furthermore, Netter compares favorably to other post-processing algorithms and is not restricted to correlation-like predictions. Lastly, we demonstrate that the performance increase is robust for a wide range of parameter settings. Netter is available at http://bioinformatics. intec. ugent. be.
Conclusions: Network inference from high-throughput data is a long-standing challenge. In this work, we present Netter, which can further refine network predictions based on a set of user-defined graph properties. Netter is a flexible system which can be applied in unison with any method producing a ranking from omics data. It can be tailored to specific prior knowledge by expert users but can also be applied in general uses cases. Concluding, we believe that Netter is an interesting second step in the network inference process to further increase the quality of prediction
Data- and knowledge-based modeling of gene regulatory networks: an update
Gene regulatory network inference is a systems biology approach which predicts interactions between genes with the help of high-throughput data. In this review, we present current and updated network inference methods focusing on novel techniques for data acquisition, network inference assessment, network inference for interacting species and the integration of prior knowledge. After the advance of Next-Generation-Sequencing of cDNAs derived from RNA samples (RNA-Seq) we discuss in detail its application to network inference. Furthermore, we present progress for large-scale or even full-genomic network inference as well as for small-scale condensed network inference and review advances in the evaluation of network inference methods by crowdsourcing. Finally, we reflect the current availability of data and prior knowledge sources and give an outlook for the inference of gene regulatory networks that reflect interacting species, in particular pathogen-host interactions
- …