128 research outputs found
Metabolic Networks Analysis using Convex Optimization
Metabolic networks map the biochemical reactions in a living cell to the flow of various chemical substances in the cell, which are called metabolites. A standard model of a metabolic network is given as a linear map from the reaction rates to the change in metabolites concentrations. We study two problems related to the analysis of metabolic networks, the minimal network problem and the minimal knockout problem
Recommended from our members
Identification of focally amplified lineage-specific super-enhancers in human epithelial cancers
Whole genome analysis approaches are revealing recurrent cancer-associated somatic alterations in non-coding DNA regions. We combined somatic copy number analysis of 12 tumor types with tissue-specific epigenetic profiling to identify significant regions of focal amplification harboring super-enhancers. Copy-number gains of non-coding regions harboring super-enhancers near KLF5, USP12, PARD6B and MYC are associated with over-expression of these cancer-related genes. We show that two distinct focal amplifications of super-enhancers 3′ to MYC in lung adenocarcinoma (MYC-LASE) and endometrial carcinoma (MYC-ECSE), are physically associated with the MYC promoter and correlate with MYC over-expression. CRISPR/Cas9-mediated repression or deletion of a constituent enhancer within the MYC-LASE region led to significant reductions in the expression of MYC and its target genes, and to the impairment of anchorage-independent and clonogenic growth, consistent with an oncogenic function. Our results demonstrate that genomic amplification of super-enhancers represents a common mechanism to activate cancer driver genes in multiple cancer types
Exploiting the pathway structure of metabolism to reveal high-order epistasis
<p>Abstract</p> <p>Background</p> <p>Biological robustness results from redundant pathways that achieve an essential objective, e.g. the production of biomass. As a consequence, the biological roles of many genes can only be revealed through multiple knockouts that identify a <it>set </it>of genes as essential for a given function. The identification of such "epistatic" essential relationships between network components is critical for the understanding and eventual manipulation of robust systems-level phenotypes.</p> <p>Results</p> <p>We introduce and apply a network-based approach for genome-scale metabolic knockout design. We apply this method to uncover over 11,000 minimal knockouts for biomass production in an <it>in silico </it>genome-scale model of <it>E. coli</it>. A large majority of these "essential sets" contain 5 or more reactions, and thus represent complex epistatic relationships between components of the <it>E. coli </it>metabolic network.</p> <p>Conclusion</p> <p>The complex minimal biomass knockouts discovered with our approach illuminate robust essential systems-level roles for reactions in the <it>E. coli </it>metabolic network. Unlike previous approaches, our method yields results regarding high-order epistatic relationships and is applicable at the genome-scale.</p
Dynamics of Prolyl hydroxylases levels during disease progression in experimental colitis
Hypoxia inducible factor (HIF)-prolyl hydroxylase (PHD) inhibitors are shown to be protective in several models of inflammatory bowel disease (IBD). However, these non-selective inhibitors are known to inhibit all the three isoforms of PHD, i.e. PHD-1, PHD-2 and PHD-3. In the present report, we investigated the associated changes in levels of PHDs during the development and recovery of chemically induced colitis in mice. The results indicated that in the experimental model of murine colitis, levels of both, PHD-1 and PHD-2 were found to be increased with the progression of the disease; however, the level of PHD-3 remained the same in group of healthy controls and mice with colitis. Thus, the findings advocated that inhibitors, which inhibited all three isoforms of PHD could not be ideal therapeutics for IBD since PHD-3 is required for normal gut function. Hence, this necessitates the development of new compounds capable of selectively inhibiting PHD-1 and PHD-2 for effective treatment of IBD
Patterns of somatic structural variation in human cancer genomes.
A key mutational process in cancer is structural variation, in which rearrangements delete, amplify or reorder genomic segments that range in size from kilobases to whole chromosomes1-7. Here we develop methods to group, classify and describe somatic structural variants, using data from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), which aggregated whole-genome sequencing data from 2,658Â cancers across 38Â tumour types8. Sixteen signatures of structural variation emerged. Deletions have a multimodal size distribution, assort unevenly across tumour types and patients, are enriched in late-replicating regions and correlate with inversions. Tandem duplications also have a multimodal size distribution, but are enriched in early-replicating regions-as are unbalanced translocations. Replication-based mechanisms of rearrangement generate varied chromosomal structures with low-level copy-number gains and frequent inverted rearrangements. One prominent structure consists of 2-7Â templates copied from distinct regions of the genome strung together within one locus. Such cycles of templated insertions correlate with tandem duplications, and-in liver cancer-frequently activate the telomerase gene TERT. A wide variety of rearrangement processes are active in cancer, which generate complex configurations of the genome upon which selection can act
Recommended from our members
Mutational heterogeneity in cancer and the search for new cancer genes
Major international projects are now underway aimed at creating a comprehensive catalog of all genes responsible for the initiation and progression of cancer. These studies involve sequencing of matched tumor–normal samples followed by mathematical analysis to identify those genes in which mutations occur more frequently than expected by random chance. Here, we describe a fundamental problem with cancer genome studies: as the sample size increases, the list of putatively significant genes produced by current analytical methods burgeons into the hundreds. The list includes many implausible genes (such as those encoding olfactory receptors and the muscle protein titin), suggesting extensive false positive findings that overshadow true driver events. Here, we show that this problem stems largely from mutational heterogeneity and provide a novel analytical methodology, MutSigCV, for resolving the problem. We apply MutSigCV to exome sequences from 3,083 tumor-normal pairs and discover extraordinary variation in (i) mutation frequency and spectrum within cancer types, which shed light on mutational processes and disease etiology, and (ii) mutation frequency across the genome, which is strongly correlated with DNA replication timing and also with transcriptional activity. By incorporating mutational heterogeneity into the analyses, MutSigCV is able to eliminate most of the apparent artefactual findings and allow true cancer genes to rise to attention
Association Analysis of the FTO Gene with Obesity in Children of Caucasian and African Ancestry Reveals a Common Tagging SNP
Recently an association was demonstrated between the single nucleotide polymorphism (SNP), rs9939609, within the FTO locus and obesity as a consequence of a genome wide association (GWA) study of type 2 diabetes in adults. We examined the effects of two perfect surrogates for this SNP plus 11 other SNPs at this locus with respect to our childhood obesity cohort, consisting of both Caucasians and African Americans (AA). Utilizing data from our ongoing GWA study in our cohort of 418 Caucasian obese children (BMI≥95th percentile), 2,270 Caucasian controls (BMI<95th percentile), 578 AA obese children and 1,424 AA controls, we investigated the association of the previously reported variation at the FTO locus with the childhood form of this disease in both ethnicities. The minor allele frequencies (MAF) of rs8050136 and rs3751812 (perfect surrogates for rs9939609 i.e. both r2 = 1) in the Caucasian cases were 0.448 and 0.443 respectively while they were 0.391 and 0.386 in Caucasian controls respectively, yielding for both an odds ratio (OR) of 1.27 (95% CI 1.08–1.47; P = 0.0022). Furthermore, the MAFs of rs8050136 and rs3751812 in the AA cases were 0.449 and 0.115 respectively while they were 0.436 and 0.090 in AA controls respectively, yielding an OR of 1.05 (95% CI 0.91–1.21; P = 0.49) and of 1.31 (95% CI 1.050–1.643; P = 0.017) respectively. Investigating all 13 SNPs present on the Illumina HumanHap550 BeadChip in this region of linkage disequilibrium, rs3751812 was the only SNP conferring significant risk in AA. We have therefore replicated and refined the association in an AA cohort and distilled a tag-SNP, rs3751812, which captures the ancestral origin of the actual mutation. As such, variants in the FTO gene confer a similar magnitude of risk of obesity to children as to their adult counterparts and appear to have a global impact
A Genome-Wide Meta-Analysis of Six Type 1 Diabetes Cohorts Identifies Multiple Associated Loci
Diabetes impacts approximately 200 million people worldwide, of whom approximately 10% are affected by type 1 diabetes (T1D). The application of genome-wide association studies (GWAS) has robustly revealed dozens of genetic contributors to the pathogenesis of T1D, with the most recent meta-analysis identifying in excess of 40 loci. To identify additional genetic loci for T1D susceptibility, we examined associations in the largest meta-analysis to date between the disease and ∼2.54 million SNPs in a combined cohort of 9,934 cases and 16,956 controls. Targeted follow-up of 53 SNPs in 1,120 affected trios uncovered three new loci associated with T1D that reached genome-wide significance. The most significantly associated SNP (rs539514, P = 5.66×10−11) resides in an intronic region of the LMO7 (LIM domain only 7) gene on 13q22. The second most significantly associated SNP (rs478222, P = 3.50×10−9) resides in an intronic region of the EFR3B (protein EFR3 homolog B) gene on 2p23; however, the region of linkage disequilibrium is approximately 800 kb and harbors additional multiple genes, including NCOA1, C2orf79, CENPO, ADCY3, DNAJC27, POMC, and DNMT3A. The third most significantly associated SNP (rs924043, P = 8.06×10−9) lies in an intergenic region on 6q27, where the region of association is approximately 900 kb and harbors multiple genes including WDR27, C6orf120, PHF10, TCTE3, C6orf208, LOC154449, DLL1, FAM120B, PSMB1, TBP, and PCD2. These latest associated regions add to the growing repertoire of gene networks predisposing to T1D
- …