17 research outputs found

    Backup machinery of yeast transcriptional regulatory network

    Get PDF
    Several studies have suggested the existence of backup machinery of transcriptional regulatory networks (TRNs). Here, we have quantified the backup machinery of yeast gene's TRNs under five different conditions in terms of alternate paths and have revealed that a statistically significant (p<0.0001) stronger backup is maintained for endogenous processes (ENPs) than exogenous processes (EXPs). A number of biologically important genes (SUC2, MF(ALPHA)2, CLN2 etc) are observed that maintain a higher backup. Hub and random transcription factor (TF) knockouts in TRNs have showed ENPs are more robust to deletion than EXPs. While higher average connectivity of TFs in EXPs than ENPs can't explain the higher robustness in ENPs, we have found that the later have a densely interconnectedness explaining their specialized architecture that may have evolved due to evolutionary pressure. Some non-hub TFs identified here are more likely to be essential, and if not essential, have a larger impact on fitness

    Missing and spurious interactions and the reconstruction of complex networks

    Full text link
    Network analysis is currently used in a myriad of contexts: from identifying potential drug targets to predicting the spread of epidemics and designing vaccination strategies, and from finding friends to uncovering criminal activity. Despite the promise of the network approach, the reliability of network data is a source of great concern in all fields where complex networks are studied. Here, we present a general mathematical and computational framework to deal with the problem of data reliability in complex networks. In particular, we are able to reliably identify both missing and spurious interactions in noisy network observations. Remarkably, our approach also enables us to obtain, from those noisy observations, network reconstructions that yield estimates of the true network properties that are more accurate than those provided by the observations themselves. Our approach has the potential to guide experiments, to better characterize network data sets, and to drive new discoveries

    A network inference method for large-scale unsupervised identification of novel drug-drug interactions

    Get PDF
    Characterizing interactions between drugs is important to avoid potentially harmful combinations, to reduce off-target effects of treatments and to fight antibiotic resistant pathogens, among others. Here we present a network inference algorithm to predict uncharacterized drug-drug interactions. Our algorithm takes, as its only input, sets of previously reported interactions, and does not require any pharmacological or biochemical information about the drugs, their targets or their mechanisms of action. Because the models we use are abstract, our approach can deal with adverse interactions, synergistic/antagonistic/suppressing interactions, or any other type of drug interaction. We show that our method is able to accurately predict interactions, both in exhaustive pairwise interaction data between small sets of drugs, and in large-scale databases. We also demonstrate that our algorithm can be used efficiently to discover interactions of new drugs as part of the drug discovery process

    EFICAz²: enzyme function inference by a combined approach enhanced by machine learning

    Get PDF
    Š2009 Arakaki et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2105/10/107doi:10.1186/1471-2105-10-107Background: We previously developed EFICAz, an enzyme function inference approach that combines predictions from non-completely overlapping component methods. Two of the four components in the original EFICAz are based on the detection of functionally discriminating residues (FDRs). FDRs distinguish between member of an enzyme family that are homofunctional (classified under the EC number of interest) or heterofunctional (annotated with another EC number or lacking enzymatic activity). Each of the two FDR-based components is associated to one of two specific kinds of enzyme families. EFICAz exhibits high precision performance, except when the maximal test to training sequence identity (MTTSI) is lower than 30%. To improve EFICAz's performance in this regime, we: i) increased the number of predictive components and ii) took advantage of consensual information from the different components to make the final EC number assignment. Results: We have developed two new EFICAz components, analogs to the two FDR-based components, where the discrimination between homo and heterofunctional members is based on the evaluation, via Support Vector Machine models, of all the aligned positions between the query sequence and the multiple sequence alignments associated to the enzyme families. Benchmark results indicate that: i) the new SVM-based components outperform their FDR-based counterparts, and ii) both SVM-based and FDR-based components generate unique predictions. We developed classification tree models to optimally combine the results from the six EFICAz components into a final EC number prediction. The new implementation of our approach, EFICAz², exhibits a highly improved prediction precision at MTTSI < 30% compared to the original EFICAz, with only a slight decrease in prediction recall. A comparative analysis of enzyme function annotation of the human proteome by EFICAz² and KEGG shows that: i) when both sources make EC number assignments for the same protein sequence, the assignments tend to be consistent and ii) EFICAz² generates considerably more unique assignments than KEGG. Conclusion: Performance benchmarks and the comparison with KEGG demonstrate that EFICAz² is a powerful and precise tool for enzyme function annotation, with multiple applications in genome analysis and metabolic pathway reconstruction. The EFICAz² web service is available at: http://cssb.biology.gatech.edu/skolnick/webservice/EFICAz2/index.htm

    A side-effect free method for identifying cancer drug targets

    Get PDF
    Identifying efective drug targets, with little or no side efects, remains an ever challenging task. A potential pitfall of failing to uncover the correct drug targets, due to side efect of pleiotropic genes, might lead the potential drugs to be illicit and withdrawn. Simplifying disease complexity, for the investigation of the mechanistic aspects and identifcation of efective drug targets, have been done through several approaches of protein interactome analysis. Of these, centrality measures have always gained importance in identifying candidate drug targets. Here, we put forward an integrated method of analysing a complex network of cancer and depict the importance of k-core, functional connectivity and centrality (KFC) for identifying efective drug targets. Essentially, we have extracted the proteins involved in the pathways leading to cancer from the pathway databases which enlist real experimental datasets. The interactions between these proteins were mapped to build an interactome. Integrative analyses of the interactome enabled us to unearth plausible reasons for drugs being rendered withdrawn, thereby giving future scope to pharmaceutical industries to potentially avoid them (e.g. ESR1, HDAC2, F2, PLG, PPARA, RXRA, etc). Based upon our KFC criteria, we have shortlisted ten proteins (GRB2, FYN, PIK3R1, CBL, JAK2, LCK, LYN, SYK, JAK1 and SOCS3) as efective candidates for drug development

    Two-stage flux balance analysis of metabolic networks for drug target identification

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Efficient identification of drug targets is one of major challenges for drug discovery and drug development. Traditional approaches to drug target identification include literature search-based target prioritization and <it>in vitro</it> binding assays which are both time-consuming and labor intensive. Computational integration of different knowledge sources is a more effective alternative. Wealth of omics data generated from genomic, proteomic and metabolomic techniques changes the way researchers view drug targets and provides unprecedent opportunities for drug target identification.</p> <p>Results</p> <p>In this paper, we develop a method based on flux balance analysis (FBA) of metabolic networks to identify potential drug targets. This method consists of two linear programming (LP) models, which first finds the steady optimal fluxes of reactions and the mass flows of metabolites in the pathologic state and then determines the fluxes and mass flows in the medication state with the minimal side effect caused by the medication. Drug targets are identified by comparing the fluxes of reactions in both states and examining the change of reaction fluxes. We give an illustrative example to show that the drug target identification problem can be solved effectively by our method, then apply it to a hyperuricemia-related purine metabolic pathway. Known drug targets for hyperuricemia are correctly identified by our two-stage FBA method, and the side effects of these targets are also taken into account. A number of other promising drug targets are found to be both effective and safe.</p> <p>Conclusions</p> <p>Our method is an efficient procedure for drug target identification through flux balance analysis of large-scale metabolic networks. It can generate testable predictions, provide insights into drug action mechanisms and guide experimental design of drug discovery.</p

    Metabolic pathway alignment between species using a comprehensive and flexible similarity measure

    Get PDF
    Comparative analysis of metabolic networks in multiple species yields important information on their evolution, and has great practical value in metabolic engineering, human disease analysis, drug design etc. In this work, we aim to systematically search for conserved pathways in two species, quantify their similarities, and focus on the variations between themElectrical Engineering, Mathematics and Computer Scienc

    Path finding methods accounting for stoichiometry in metabolic networks

    Get PDF
    Graph-based methods have been widely used for the analysis of biological networks. Their application to metabolic networks has been much discussed, in particular noting that an important weakness in such methods is that reaction stoichiometry is neglected. In this study, we show that reaction stoichiometry can be incorporated into path-finding approaches via mixed-integer linear programming. This major advance at the modeling level results in improved prediction of topological and functional properties in metabolic networks

    EFICAz2: enzyme function inference by a combined approach enhanced by machine learning

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>We previously developed EFICAz, an enzyme function inference approach that combines predictions from non-completely overlapping component methods. Two of the four components in the original EFICAz are based on the detection of functionally discriminating residues (FDRs). FDRs distinguish between member of an enzyme family that are homofunctional (classified under the EC number of interest) or heterofunctional (annotated with another EC number or lacking enzymatic activity). Each of the two FDR-based components is associated to one of two specific kinds of enzyme families. EFICAz exhibits high precision performance, except when the maximal test to training sequence identity (MTTSI) is lower than 30%. To improve EFICAz's performance in this regime, we: i) increased the number of predictive components and ii) took advantage of consensual information from the different components to make the final EC number assignment.</p> <p>Results</p> <p>We have developed two new EFICAz components, analogs to the two FDR-based components, where the discrimination between homo and heterofunctional members is based on the evaluation, via Support Vector Machine models, of all the aligned positions between the query sequence and the multiple sequence alignments associated to the enzyme families. Benchmark results indicate that: i) the new SVM-based components outperform their FDR-based counterparts, and ii) both SVM-based and FDR-based components generate unique predictions. We developed classification tree models to optimally combine the results from the six EFICAz components into a final EC number prediction. The new implementation of our approach, EFICAz<sup>2</sup>, exhibits a highly improved prediction precision at MTTSI < 30% compared to the original EFICAz, with only a slight decrease in prediction recall. A comparative analysis of enzyme function annotation of the human proteome by EFICAz<sup>2 </sup>and KEGG shows that: i) when both sources make EC number assignments for the same protein sequence, the assignments tend to be consistent and ii) EFICAz<sup>2 </sup>generates considerably more unique assignments than KEGG.</p> <p>Conclusion</p> <p>Performance benchmarks and the comparison with KEGG demonstrate that EFICAz<sup>2 </sup>is a powerful and precise tool for enzyme function annotation, with multiple applications in genome analysis and metabolic pathway reconstruction. The EFICAz<sup>2 </sup>web service is available at: <url>http://cssb.biology.gatech.edu/skolnick/webservice/EFICAz2/index.html</url></p