584 research outputs found

    anNET: a tool for network-embedded thermodynamic analysis of quantitative metabolome data

    Get PDF
    Background: Compared to other omics techniques, quantitative metabolomics is still at its infancy. Complex sample preparation and analytical procedures render exact quantification extremely difficult. Furthermore, not only the actual measurement but also the subsequent interpretation of quantitative metabolome data to obtain mechanistic insights is still lacking behind the current expectations. Recently, the method of network-embedded thermodynamic (NET) analysis was introduced to address some of these open issues. Building upon principles of thermodynamics, this method allows for a quality check of measured metabolite concentrations and enables to spot metabolic reactions where active regulation potentially controls metabolic flux. So far, however, widespread application of NET analysis in metabolomics labs was hindered by the absence of suitable software. Results: We have developed in Matlab a generalized software called 'anNET' that affords a user-friendly implementation of the NET analysis algorithm. anNET supports the analysis of any metabolic network for which a stoichiometric model can be compiled. The model size can span from a single reaction to a complete genome-wide network reconstruction including compartments. anNET can (i) test quantitative data sets for thermodynamic consistency, (ii) predict metabolite concentrations beyond the actually measured data, (iii) identify putative sites of active regulation in the metabolic reaction network, and (iv) help in localizing errors in data sets that were found to be thermodynamically infeasible. We demonstrate the application of anNET with three published Escherichia coli metabolome data sets. Conclusion: Our user-friendly and generalized implementation of the NET analysis method in the software anNET allows users to rapidly integrate quantitative metabolome data obtained from virtually any organism. We envision that use of anNET in labs working on quantitative metabolomics will provide the systems biology and metabolic engineering communities with a mean to proof the quality of metabolome data sets and with all further benefits of the NET analysis approach.

    In-silico-Systemanalyse von Biopathways

    Get PDF
    Chen M. In silico systems analysis of biopathways. Bielefeld (Germany): Bielefeld University; 2004.In the past decade with the advent of high-throughput technologies, biology has migrated from a descriptive science to a predictive one. A vast amount of information on the metabolism have been produced; a number of specific genetic/metabolic databases and computational systems have been developed, which makes it possible for biologists to perform in silico analysis of metabolism. With experimental data from laboratory, biologists wish to systematically conduct their analysis with an easy-to-use computational system. One major task is to implement molecular information systems that will allow to integrate different molecular database systems, and to design analysis tools (e.g. simulators of complex metabolic reactions). Three key problems are involved: 1) Modeling and simulation of biological processes; 2) Reconstruction of metabolic pathways, leading to predictions about the integrated function of the network; and 3) Comparison of metabolism, providing an important way to reveal the functional relationship between a set of metabolic pathways. This dissertation addresses these problems of in silico systems analysis of biopathways. We developed a software system to integrate the access to different databases, and exploited the Petri net methodology to model and simulate metabolic networks in cells. It develops a computer modeling and simulation technique based on Petri net methodology; investigates metabolic networks at a system level; proposes a markup language for biological data interchange among diverse biological simulators and Petri net tools; establishes a web-based information retrieval system for metabolic pathway prediction; presents an algorithm for metabolic pathway alignment; recommends a nomenclature of cellular signal transduction; and attempts to standardize the representation of biological pathways. Hybrid Petri net methodology is exploited to model metabolic networks. Kinetic modeling strategy and Petri net modeling algorithm are applied to perform the processes of elements functioning and model analysis. The proposed methodology can be used for all other metabolic networks or the virtual cell metabolism. Moreover, perspectives of Petri net modeling and simulation of metabolic networks are outlined. A proposal for the Biology Petri Net Markup Language (BioPNML) is presented. The concepts and terminology of the interchange format, as well as its syntax (which is based on XML) are introduced. BioPNML is designed to provide a starting point for the development of a standard interchange format for Bioinformatics and Petri nets. The language makes it possible to exchange biology Petri net diagrams between all supported hardware platforms and versions. It is also designed to associate Petri net models and other known metabolic simulators. A web-based metabolic information retrieval system, PathAligner, is developed in order to predict metabolic pathways from rudimentary elements of pathways. It extracts metabolic information from biological databases via the Internet, and builds metabolic pathways with data sources of genes, sequences, enzymes, metabolites, etc. The system also provides a navigation platform to investigate metabolic related information, and transforms the output data into XML files for further modeling and simulation of the reconstructed pathway. An alignment algorithm to compare the similarity between metabolic pathways is presented. A new definition of the metabolic pathway is proposed. The pathway defined as a linear event sequence is practical for our alignment algorithm. The algorithm is based on strip scoring the similarity of 4-hierarchical EC numbers involved in the pathways. The algorithm described has been implemented and is in current use in the context of the PathAligner system. Furthermore, new methods for the classification and nomenclature of cellular signal transductions are recommended. For each type of characterized signal transduction, a unique ST number is provided. The Signal Transduction Classification Database (STCDB), based on the proposed classification and nomenclature, has been established. By merging the ST numbers with EC numbers, alignments of biopathways are possible. Finally, a detailed model of urea cycle that includes gene regulatory networks, metabolic pathways and signal transduction is demonstrated by using our approaches. A system biological interpretation of the observed behavior of the urea cycle and its related transcriptomics information is proposed to provide new insights for metabolic engineering and medical care

    Artificial intelligence in cancer target identification and drug discovery

    Get PDF
    Artificial intelligence is an advanced method to identify novel anticancer targets and discover novel drugs from biology networks because the networks can effectively preserve and quantify the interaction between components of cell systems underlying human diseases such as cancer. Here, we review and discuss how to employ artificial intelligence approaches to identify novel anticancer targets and discover drugs. First, we describe the scope of artificial intelligence biology analysis for novel anticancer target investigations. Second, we review and discuss the basic principles and theory of commonly used network-based and machine learning-based artificial intelligence algorithms. Finally, we showcase the applications of artificial intelligence approaches in cancer target identification and drug discovery. Taken together, the artificial intelligence models have provided us with a quantitative framework to study the relationship between network characteristics and cancer, thereby leading to the identification of potential anticancer targets and the discovery of novel drug candidates

    Metabolic modelling and 13C flux analysis : application to biotechnologically important yeasts and a fungus

    Get PDF
    All bioconversions in cells derive from metabolism. Microbial metabolisms contain potential for bioconversions from simple source molecules to unlimited number of biochemicals and for degradation of even detrimental compounds. Metabolic fluxes are rates of consumption and production of compounds in metabolic reactions. Fluxes emerge as an ultimate phenotype of an organism from an integrated regulatory function of the underlying networks of complex and dynamic biochemical interactions. Since the fluxes are time-dependent, they have to be inferred from other, measurable, quantities by modelling and computational analysis. 13C-labelling is crucial for quantitative analysis of fluxes through intracellular alternative pathways. Local flux ratio analysis utilises uniform 13C-labelling experiments, where the carbon source contains a fraction of uniformly 13C-labelled molecules. Carbon-carbon bonds are cleaved and formed in metabolic reactions depending on the in vivo fluxes. 13C-labelling patterns of metabolites or macromolecule components can be detected by mass spectrometry (MS) or nuclear magnetic resonance (NMR) spectroscopy. Local flux ratio analysis utilises directly the 13C-labelling data and metabolic network models to solve ratios of converging fluxes. In this thesis the local flux ratio analysis has been extended and applied to analysis of phenotypes of biotechnologically important yeasts Saccharomyces cerevisiae and Pichia pastoris, and a fungus Trichoderma reesei. Oxygen dependence of in vivo net flux distribution of S. cerevisiae was quantified by using local flux ratios as additional constraints to the stoichiometric model of the central carbon metabolism. The distribution of fluxes in the pyruvate branching point turned out to be most responsive to different oxygen availabilities. The distribution of fluxes was observed to vary not only between the fully respiratory, respiro-fermentative and fermentative metabolic states but also between different respiro-fermentative states. The local flux ratio analysis was extended to the case of two-carbon source of glycerol and methanol co-utilisation by P. pastoris. The fraction of methanol in the carbon source did not have as profound effect on the distribution of fluxes as the growth rate. The effect of carbon catabolite repression (CCR) on fluxes of T. reesei was studied by reconstructing amino acid biosynthetic pathways and by performing local flux ratio analysis. T. reesei was observed to primarily utilise respiratory metabolism also in conditions of CCR. T. reesei metabolism was further studied and L-threo-3-deoxy-hexulosonate was identified as L-galactonate dehydratase reaction product by using NMR spectroscopy. L-galactonate dehydratase reaction is part of the fungal pathway for D-galacturonic acid catabolism

    Network-Based Biomarker Discovery : Development of Prognostic Biomarkers for Personalized Medicine by Integrating Data and Prior Knowledge

    Get PDF
    Advances in genome science and technology offer a deeper understanding of biology while at the same time improving the practice of medicine. The expression profiling of some diseases, such as cancer, allows for identifying marker genes, which could be able to diagnose a disease or predict future disease outcomes. Marker genes (biomarkers) are selected by scoring how well their expression levels can discriminate between different classes of disease or between groups of patients with different clinical outcome (e.g. therapy response, survival time, etc.). A current challenge is to identify new markers that are directly related to the underlying disease mechanism

    Design of new algorithms for gene network reconstruction applied to in silico modeling of biomedical data

    Get PDF
    Programa de Doctorado en Biotecnología, Ingeniería y Tecnología QuímicaLínea de Investigación: Ingeniería, Ciencia de Datos y BioinformáticaClave Programa: DBICódigo Línea: 111The root causes of disease are still poorly understood. The success of current therapies is limited because persistent diseases are frequently treated based on their symptoms rather than the underlying cause of the disease. Therefore, biomedical research is experiencing a technology-driven shift to data-driven holistic approaches to better characterize the molecular mechanisms causing disease. Using omics data as an input, emerging disciplines like network biology attempt to model the relationships between biomolecules. To this effect, gene co- expression networks arise as a promising tool for deciphering the relationships between genes in large transcriptomic datasets. However, because of their low specificity and high false positive rate, they demonstrate a limited capacity to retrieve the disrupted mechanisms that lead to disease onset, progression, and maintenance. Within the context of statistical modeling, we dove deeper into the reconstruction of gene co-expression networks with the specific goal of discovering disease-specific features directly from expression data. Using ensemble techniques, which combine the results of various metrics, we were able to more precisely capture biologically significant relationships between genes. We were able to find de novo potential disease-specific features with the help of prior biological knowledge and the development of new network inference techniques. Through our different approaches, we analyzed large gene sets across multiple samples and used gene expression as a surrogate marker for the inherent biological processes, reconstructing robust gene co-expression networks that are simple to explore. By mining disease-specific gene co-expression networks we come up with a useful framework for identifying new omics-phenotype associations from conditional expression datasets.In this sense, understanding diseases from the perspective of biological network perturbations will improve personalized medicine, impacting rational biomarker discovery, patient stratification and drug design, and ultimately leading to more targeted therapies.Universidad Pablo de Olavide de Sevilla. Departamento de Deporte e Informátic

    Reconstruction and systems analysis of metabolism in apicomplexan parasites Toxoplasma gondii and Plasmodium falciparum

    Get PDF
    Understanding of metabolism in disease-causing microorganisms promotes drug design through the identification of the enzymes whose activity is indispensable for important cellular functions of the pathogens. Nowadays such understanding arises from experimental as well as computational studies. These two approaches, long considered as rather orthogonal, in recent years began to converge and form a new field, where they are utilized as complementary. In this thesis I present my endeavors in bringing closer the fields of infection and systems biology with a particular focus on large-scale metabolic models and their analysis. Integrative, interdisciplinary nature of my project also included multiple experimental inputs as well as original experimental efforts on investigating model-derived hypotheses. In the scope of this thesis I explored metabolism of two of the most experimentally amenable apicomplexan species â human parasites Plasmodium falciparum and Toxoplasma gondii. As a foundation for the studies included in this thesis I used standard as well as recently developed computational algorithms, existing experimental datasets and innovative context- specific assumptions. I produced an extensive survey of the modeling efforts previously applied for studying metabolism of P. falciparum and available large-scale experimental datasets in comparison with the similar efforts made in other species. Further, I curated an existing model of metabolism in Plasmodium falciparum with respect to an up-to-date primary literature on metabolism of the parasite and addressed several important assumptions implicitly made in this model. Using a state-of-the-art approach, I reconstructed de novo a comprehensive metabolic model of T. gondii, and performed an extensive computational analysis to explore its metabolic needs and capabilities. I identified and classified the minimal set of substrates the parasite utilizes for growth, along with the genes and pairs of genes that are essential for cellular functions such as growth and energy metabolism. Subsequently, several of the model-driven hypotheses were confirmed experimentally, while for validation of the majority of the computational predictions forthcoming high-throughput approaches shall be used. Every confirmed hypothesis expands the scope of our knowledge on peculiarities of metabolism in apicomplexan parasites and hence can serve as an input for the pipeline of developing novel medicines

    Computational methods for augmenting association-based gene mapping

    Get PDF
    The context and motivation for this thesis is gene mapping, the discovery of genetic variants that affect susceptibility to disease. The goals of gene mapping research include understanding of disease mechanisms, evaluating individual disease risks and ultimately developing new medicines and treatments. Traditional genetic association mapping methods test each measured genetic variant independently for association with the disease. One way to improve the power of detecting disease-affecting variants is to base the tests on haplotypes, strings of adjacent variants that are inherited together, instead of individual variants. To enable haplotype analyses in large-scale association studies, this thesis introduces two novel statistical models and gives an efficient algorithm for haplotype reconstruction, jointly called HaloRec. HaploRec is based on modeling local regularities of variable length in the haplotypes of the studied population and using the obtained model to statistically reconstruct the most probable haplotypes for each studied individual. Our experiments demonstrate that HaploRec is especially well suited to data sets with a large number or markers and subjects, such as those typically used in currently popular genome-wide association studies. Public biological databases contain large amounts of data that can help in determining the relevance of putative associations. In this thesis, we introduce Biomine, a database and search engine that integrates data from several such databases under a uniform graph representation. The graph database is used to derive a general proximity measure for biological entities represented as graph nodes, based on a novel scheme of weighting individual graph edges based on their informativeness and type. The resulting proximity measure can be used as a basis for various data analysis tasks, such as ranking putative disease genes and visualization of gene relationships. Our experiments show that relevant disease genes can be identified from among the putative ones with a reasonable accuracy using Biomine. Best accuracy is obtained when a pre-known reference set of disease genes is available, but experiments using a novel clustering-based method demonstrate that putative disease genes can also be ranked without a reference set under suitable conditions. An important complementary use of Biomine is the search and visualization of indirect relationships between graph nodes, which can be used e.g. to characterize the relationship of putative disease genes to already known disease genes. We provide two methods for selecting subgraphs to be visualized: one based on weights of the edges on the paths connecting query nodes, and one based on using context free grammars to define the types of paths to be displayed. Both of these query interfaces to Biomine are available online.Tämän väitöskirjan aihealue on geenikartoitus, tautialttiuteen vaikuttavien perinnöllisten muunnosten paikantaminen. Geenikartoituksen käytännöllisiä päämääriä ovat tautimekanismien ymmärtäminen, yksilöllisten tautiriskien arviointi sekä uusien lääkitysten kehittäminen. Tässä työssä on kehitetty laskennallisia menetelmiä joita voidaan käyttää parantamaan olemassaolevien geenikartoitusmenetelmien tehoa sekä analysoimaan niiden antamia alustavia tuloksia. Geenikartoitusmenetelmät perustuvat ns. markereihin, jotka ovat yksilöllistä vaihtelua sisältäviä kohtia perimässä. Tyypillisesti käytetyt menetelmät mittaavat kussakin markerissa esiintyvien muunnosten yhteyttä tautiin erikseen, huomioimatta muita markereita. Kartoituksen tarkkuutta voidaan parantaa käyttämällä testaamisen yksikkönä yksittäisten markerien sijaan haplotyyppejä, lähekkäisissä markereissa esiintyvien muunnosten muodostamia säännönmukaisia jaksoja jotka periytyvät yhdessä. Laboratoriomenelmät eivät suoraan tuota tietoa siitä, miten kunkin yksilön perimästä mitatut muunnokset jakautuvat tämän kahdelta vanhemmalta perimiin haplotyyppeihin. Tämän väitöskirjan alkupuolella esitetään laskennallinen menetelmä, joilla haplotyypit voidaan rekonstruoida tilastollisesti, perustuen niiden paikallisiin säännönmukaisuuksiin. Kehitetty menetelmä on laskennallisesti tehokas ja soveltuu erityisesti genominlaajuisiin tutkimuksiin, joissa sekä tutkittujen yksilöiden että markereiden määrät ovat suuria, ja markerit sijaitsevat kohtuullisen etäällä toisistaan. Yksittäisten muunnosten vaikutukset tauteihin ovat usein suhteellisen heikkoja, ja kun testataan suuri joukko markereita, tuloksiin tulee yleensä sattumalta mukaan myös muunnoksia joilla ei ole todellista vaikutusta tautiin. Julkiset biologiset tietokannat sisältävät paljon tietoa joka voi auttaa alustavien geenikartoitustulosten merkityksen arvioimista. Työn toisessa osassa esitellään Biomine, tietokanta jossa on yhdistetty tietoa joukosta tällaisia tietokantoja käyttäen painotettua verkkomallia joka kuvaa mm. geenien, proteiinien ja tautien välisiä tunnettuja yhteyksiä. Verkon solmujen välisten epäsuorien yhteyksien voimakkuuden mittaamiseen esitetään uusi menetelmä. Tätä menetelmää voidaan hyödyntää mm. geenikartoituksella löydettyjen kandidaattigeenien priorisointiin, perustuen siihen että mitataan kandidaattigeenien ja entuudestaan tunnettujen tautigeenien välisten yhteyksien voimakkuutta, tai kandidaattigeenien keskinäisten yhteyksien voimakkuutta. Työssä esitetään myös menetelmiä verkkotietokannan solmujen välisten epäsuorien yhteyksien visualisointiin, perustuen kulloinkin kiinnostuksen kohteena olevien solmujen yhteyttä parhaiten kuvaavan pienen aliverkon eristämiseen tietokannasta. Aliverkon valintaan esitetään kaksi laskennallisesti tehokasta menetelmää: toinen perustuen yhteyksien voimakkuuden arvioimiseen, ja toinen perustuen yhdistävien polkujen sisältämien linkkien tyyppeihin. Nämä visualisointimenetelmät ovat myös käytettävissä julkisessa verkkopalvelussa jossa voi tehdä kyselyjä Biomine-tietokantaan
    corecore