43 research outputs found

    NEAT: An efficient network enrichment analysis test

    Get PDF
    Background: Network enrichment analysis is a powerful method, which allows to integrate gene enrichment analysis with the information on relationships between genes that is provided by gene networks. Existing tests for network enrichment analysis deal only with undirected networks, they can be computationally slow and are based on normality assumptions. Results: We propose NEAT, a test for network enrichment analysis. The test is based on the hypergeometric distribution, which naturally arises as the null distribution in this context. NEAT can be applied not only to undirected, but to directed and partially directed networks as well. Our simulations indicate that NEAT is considerably faster than alternative resampling-based methods, and that its capacity to detect enrichments is at least as good as the one of alternative tests. We discuss applications of NEAT to network analyses in yeast by testing for enrichment of the Environmental Stress Response target gene set with GO Slim and KEGG functional gene sets, and also by inspecting associations between functional sets themselves. Conclusions: NEAT is a flexible and efficient test for network enrichment analysis that aims to overcome some limitations of existing resampling-based tests. The method is implemented in the R package neat, which can be freely downloaded from CRAN ( https://cran.r-project.org/package=neat )

    Integrating Flux Balance Analysis into Kinetic Models to Decipher the Dynamic Metabolism of Shewanella oneidensis MR-1

    Get PDF
    Shewanella oneidensis MR-1 sequentially utilizes lactate and its waste products (pyruvate and acetate) during batch culture. To decipher MR-1 metabolism, we integrated genome-scale flux balance analysis (FBA) into a multiple-substrate Monod model to perform the dynamic flux balance analysis (dFBA). The dFBA employed a static optimization approach (SOA) by dividing the batch time into small intervals (i.e., ∼400 mini-FBAs), then the Monod model provided time-dependent inflow/outflow fluxes to constrain the mini-FBAs to profile the pseudo-steady-state fluxes in each time interval. The mini-FBAs used a dual-objective function (a weighted combination of “maximizing growth rate” and “minimizing overall flux”) to capture trade-offs between optimal growth and minimal enzyme usage. By fitting the experimental data, a bi-level optimization of dFBA revealed that the optimal weight in the dual-objective function was time-dependent: the objective function was constant in the early growth stage, while the functional weight of minimal enzyme usage increased significantly when lactate became scarce. The dFBA profiled biologically meaningful dynamic MR-1 metabolisms: 1. the oxidative TCA cycle fluxes increased initially and then decreased in the late growth stage; 2. fluxes in the pentose phosphate pathway and gluconeogenesis were stable in the exponential growth period; and 3. the glyoxylate shunt was up-regulated when acetate became the main carbon source for MR-1 growth

    Evidence for Community Transmission of Community-Associated but Not Health-Care-Associated Methicillin-Resistant Staphylococcus Aureus Strains Linked to Social and Material Deprivation: Spatial Analysis of Cross-sectional Data

    Get PDF
    Supporting information for a paper titled "Evidence for Community Transmission of Community-Associated but Not Health-Care-Associated Methicillin-Resistant Staphylococcus Aureus Strains Linked to Social and Material Deprivation: Spatial Analysis of Cross-sectional Data". This includes: an MS Word document that describes the modelling approach (S1 Methods), Summary statistics for area-level variables in 513 LSOAs within catchment areas of the hospital cohort (S1 Table), data from the 2011 England and Wales census that outlines population structure of 513 LSOAs within catchment areas of the hospital cohort. (S2 Table), Individual patient-level metadata (S1 Text), and LSOA-level aggregated metadata (S2 Text

    Unsupervised GRN ensemble

    No full text
    Inferring gene regulatory networks from expression data is a very challenging problem that has raised the interest of the scientific community. Different algorithms have been proposed to try to solve this issue, but it has been shown that different methods have some particular biases and strengths, and none of them is the best across all types of data and datasets. As a result, the idea of aggregating various network inferences through a consensus mechanism naturally arises. In this chapter, a common framework to standardize already proposed consensus methods is presented, and based on this framework different proposals are introduced and analyzed in two different scenarios: Homogeneous and Heterogeneous. The first scenario reflects situations where the networks to be aggregated are rather similar because they are obtained with inference algorithms working on the same data, whereas the second scenario deals with very diverse networks because various sources of data are used to generate the individual networks. A procedure for combining multiple network inference algorithms is analyzed in a systematic way. The results show that there is a very significant difference between these two scenarios, and that the best way to combine networks in the Heterogeneous scenario is not the most commonly used. We show in particular that aggregation in the Heterogeneous scenario can be very beneficial if the individual networks are combined with our new proposed method ScaleLSum.Peer reviewe

    Unsupervised GRN Ensemble

    No full text
    Inferring gene regulatory networks from expression data is a very challenging problem that has raised the interest of the scientific community. Different algorithms have been proposed to try to solve this issue, but it has been shown that different methods have some particular biases and strengths, and none of them is the best across all types of data and datasets. As a result, the idea of aggregating various network inferences through a consensus mechanism naturally arises. In this chapter, a common framework to standardize already proposed consensus methods is presented, and based on this framework different proposals are introduced and analyzed in two different scenarios: Homogeneous and Heterogeneous. The first scenario reflects situations where the networks to be aggregated are rather similar because they are obtained with inference algorithms working on the same data, whereas the second scenario deals with very diverse networks because various sources of data are used to generate the individual networks. A procedure for combining multiple network inference algorithms is analyzed in a systematic way. The results show that there is a very significant difference between these two scenarios, and that the best way to combine networks in the Heterogeneous scenario is not the most commonly used. We show in particular that aggregation in the Heterogeneous scenario can be very beneficial if the individual networks are combined with our new proposed method ScaleLSum.Peer ReviewedPostprint (published version

    Network inference from single-cell transcriptomic data

    No full text
    Recent technological breakthroughs in single-cell RNA sequencing are revolutionizing modern experimental design in biology. The increasing size of the single-cell expression data from which networks can be inferred allows identifying more complex, non-linear dependencies between genes. Moreover, the inter-cellular variability that is observed in single-cell expression data can be used to infer not only one global network representing all the cells, but also numerous regulatory networks that are more specific to certain conditions. By experimentally perturbing certain genes, the deconvolution of the true contribution of these genes can also be greatly facilitated. In this chapter, we will therefore tackle the advantages of single-cell transcriptomic data and show how new methods exploit this novel data type to enhance the inference of gene regulatory networks

    Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases.

    Get PDF
    Mapping perturbed molecular circuits that underlie complex diseases remains a great challenge. We developed a comprehensive resource of 394 cell type- and tissue-specific gene regulatory networks for human, each specifying the genome-wide connectivity among transcription factors, enhancers, promoters and genes. Integration with 37 genome-wide association studies (GWASs) showed that disease-associated genetic variants--including variants that do not reach genome-wide significance--often perturb regulatory modules that are highly specific to disease-relevant cell types or tissues. Our resource opens the door to systematic analysis of regulatory programs across hundreds of human cell types and tissues (http://regulatorycircuits.org)
    corecore