888 research outputs found

    Predicting protein function by machine learning on amino acid sequences – a critical evaluation

    Get PDF
    Copyright @ 2007 Al-Shahib et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.Background: Predicting the function of newly discovered proteins by simply inspecting their amino acid sequence is one of the major challenges of post-genomic computational biology, especially when done without recourse to experimentation or homology information. Machine learning classifiers are able to discriminate between proteins belonging to different functional classes. Until now, however, it has been unclear if this ability would be transferable to proteins of unknown function, which may show distinct biases compared to experimentally more tractable proteins. Results: Here we show that proteins with known and unknown function do indeed differ significantly. We then show that proteins from different bacterial species also differ to an even larger and very surprising extent, but that functional classifiers nonetheless generalize successfully across species boundaries. We also show that in the case of highly specialized proteomes classifiers from a different, but more conventional, species may in fact outperform the endogenous species-specific classifier. Conclusion: We conclude that there is very good prospect of successfully predicting the function of yet uncharacterized proteins using machine learning classifiers trained on proteins of known function

    GeneRank: Using search engine technology for the analysis of microarray experiments

    Get PDF
    Copyright @ 2005 Morrison et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.Background: Interpretation of simple microarray experiments is usually based on the fold-change of gene expression between a reference and a "treated" sample where the treatment can be of many types from drug exposure to genetic variation. Interpretation of the results usually combines lists of differentially expressed genes with previous knowledge about their biological function. Here we evaluate a method – based on the PageRank algorithm employed by the popular search engine Google – that tries to automate some of this procedure to generate prioritized gene lists by exploiting biological background information. Results: GeneRank is an intuitive modification of PageRank that maintains many of its mathematical properties. It combines gene expression information with a network structure derived from gene annotations (gene ontologies) or expression profile correlations. Using both simulated and real data we find that the algorithm offers an improved ranking of genes compared to pure expression change rankings. Conclusion: Our modification of the PageRank algorithm provides an alternative method of evaluating microarray experimental results which combines prior knowledge about the underlying network. GeneRank offers an improvement compared to assessing the importance of a gene based on its experimentally observed fold-change alone and may be used as a basis for further analytical developments

    A structured approach for the engineering of biochemical network models, illustrated for signalling pathways

    Get PDF
    http://dx.doi.org/10.1093/bib/bbn026Quantitative models of biochemical networks (signal transduction cascades, metabolic pathways, gene regulatory circuits) are a central component of modern systems biology. Building and managing these complex models is a major challenge that can benefit from the application of formal methods adopted from theoretical computing science. Here we provide a general introduction to the field of formal modelling, which emphasizes the intuitive biochemical basis of the modelling process, but is also accessible for an audience with a background in computing science and/or model engineering. We show how signal transduction cascades can be modelled in a modular fashion, using both a qualitative approach { Qualitative Petri nets, and quantitative approaches { Continuous Petri Nets and Ordinary Differential Equations. We review the major elementary building blocks of a cellular signalling model, discuss which critical design decisions have to be made during model building, and present ..

    Metabolic modeling and analysis of the metabolic switch in Streptomyces coelicolor

    Get PDF
    Background The transition from exponential to stationary phase in Streptomyces coelicolor is accompanied by a major metabolic switch and results in a strong activation of secondary metabolism. Here we have explored the underlying reorganization of the metabolome by combining computational predictions based on constraint-based modeling and detailed transcriptomics time course observations. Results We reconstructed the stoichiometric matrix of S. coelicolor, including the major antibiotic biosynthesis pathways, and performed flux balance analysis to predict flux changes that occur when the cell switches from biomass to antibiotic production. We defined the model input based on observed fermenter culture data and used a dynamically varying objective function to represent the metabolic switch. The predicted fluxes of many genes show highly significant correlation to the time series of the corresponding gene expression data. Individual mispredictions identify novel links between antibiotic production and primary metabolism. Conclusion Our results show the usefulness of constraint-based modeling for providing a detailed interpretation of time course gene expression data

    Assessing polymer-surface adhesion with a polymer collection

    Get PDF
    Polymer modification plays an important role in the construction of devices, but the lack of fundamental understanding on polymer-surface adhesion limits the development of miniaturized devices. In this work, a thermoplastic polymer collection was established using the combinatorial laser-induced forward transfer technique as a research platform, to assess the adhesion of polymers to substrates of different wettability. Furthermore, it also revealed the influence of adhesion on dewetting phenomena during the laser transfer and relaxation process, resulting in polymer spots of various morphologies. This gives a general insight into polymer-surface adhesion and connects it with the generation of defined polymer microstructures, which can be a valuable reference for the rational use of polymers

    Liver Enzymes: Interaction Analysis of Smoking with Alcohol Consumption or BMI, Comparing AST and ALT to γ-GT

    Get PDF
    A detrimental interaction between smoking and alcohol consumption with respect serum γ-glutamyltransferase (γ-GT) has recently been described. The underlying mechanisms remain unknown. The present work aimed to provide further insights by examining similar interactions pertaining to aspartate and alanine transaminase (AST, ALT), routine liver markers less prone to enzyme induction.<0.0001). The interactions all were in the same directions as for γ-GT, i.e. synergistic with alcohol and opposite with BMI.The patterns of interaction between smoking and alcohol consumption or BMI with respect to AST and ALT resembled those observed for γ-GT. This renders enzyme induction a less probable mechanism for these associations, whereas it might implicate exacerbated hepatocellular vulnerability and injury

    Metabolomics to unveil and understand phenotypic diversity between pathogen populations

    Get PDF
    Visceral leishmaniasis is caused by a parasite called Leishmania donovani, which every year infects about half a million people and claims several thousand lives. Existing treatments are now becoming less effective due to the emergence of drug resistance. Improving our understanding of the mechanisms used by the parasite to adapt to drugs and achieve resistance is crucial for developing future treatment strategies. Unfortunately, the biological mechanism whereby Leishmania acquires drug resistance is poorly understood. Recent years have brought new technologies with the potential to increase greatly our understanding of drug resistance mechanisms. The latest mass spectrometry techniques allow the metabolome of parasites to be studied rapidly and in great detail. We have applied this approach to determine the metabolome of drug-sensitive and drug-resistant parasites isolated from patients with leishmaniasis. The data show that there are wholesale differences between the isolates and that the membrane composition has been drastically modified in drug-resistant parasites compared with drug-sensitive parasites. Our findings demonstrate that untargeted metabolomics has great potential to identify major metabolic differences between closely related parasite strains and thus should find many applications in distinguishing parasite phenotypes of clinical relevance

    Computational models for inferring biochemical networks

    Get PDF
    Biochemical networks are of great practical importance. The interaction of biological compounds in cells has been enforced to a proper understanding by the numerous bioinformatics projects, which contributed to a vast amount of biological information. The construction of biochemical systems (systems of chemical reactions), which include both topology and kinetic constants of the chemical reactions, is NP-hard and is a well-studied system biology problem. In this paper, we propose a hybrid architecture, which combines genetic programming and simulated annealing in order to generate and optimize both the topology (the network) and the reaction rates of a biochemical system. Simulations and analysis of an artificial model and three real models (two models and the noisy version of one of them) show promising results for the proposed method.The Romanian National Authority for Scientific Research, CNDI–UEFISCDI, Project No. PN-II-PT-PCCA-2011-3.2-0917

    Interpreting microarray experiments via co-expressed gene groups analysis

    Get PDF
    International audienceMicroarray technology produces vast amounts of data by measuring simultaneously the expression levels of thousands of genes under hundreds of biological conditions. Nowadays, one of the principal challenges in bioinformatics is the interpretation of huge data using different sources of information. We propose a novel data analysis method named CGGA (Co-expressed Gene Groups Analysis) that automatically finds groups of genes that are functionally enriched, i.e. have the same functional annotations, and are co- expressed. CGGA automatically integrates the information of microarrays, i.e. gene expression profiles, with the functional annotations of the genes obtained by the genome-wide information sources such as Gene Ontology (GO)1. By applying CGGA to well-known microarray experiments, we have identified the principal functionally enriched and co-expressed gene groups, and we have shown that this approach enhances and accelerates the interpretation of DNA microarray experiments
    corecore