128 research outputs found

    Accelerated search for biomolecular network models to interpret high-throughput experimental data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The functions of human cells are carried out by biomolecular networks, which include proteins, genes, and regulatory sites within DNA that encode and control protein expression. Models of biomolecular network structure and dynamics can be inferred from high-throughput measurements of gene and protein expression. We build on our previously developed fuzzy logic method for bridging quantitative and qualitative biological data to address the challenges of noisy, low resolution high-throughput measurements, i.e., from gene expression microarrays. We employ an evolutionary search algorithm to accelerate the search for hypothetical fuzzy biomolecular network models consistent with a biological data set. We also develop a method to estimate the probability of a potential network model fitting a set of data by chance. The resulting metric provides an estimate of both model quality and dataset quality, identifying data that are too noisy to identify meaningful correlations between the measured variables.</p> <p>Results</p> <p>Optimal parameters for the evolutionary search were identified based on artificial data, and the algorithm showed scalable and consistent performance for as many as 150 variables. The method was tested on previously published human cell cycle gene expression microarray data sets. The evolutionary search method was found to converge to the results of exhaustive search. The randomized evolutionary search was able to converge on a set of similar best-fitting network models on different training data sets after 30 generations running 30 models per generation. Consistent results were found regardless of which of the published data sets were used to train or verify the quantitative predictions of the best-fitting models for cell cycle gene dynamics.</p> <p>Conclusion</p> <p>Our results demonstrate the capability of scalable evolutionary search for fuzzy network models to address the problem of inferring models based on complex, noisy biomolecular data sets. This approach yields multiple alternative models that are consistent with the data, yielding a constrained set of hypotheses that can be used to optimally design subsequent experiments.</p

    Predicting Protein Phenotypes Based on Protein-Protein Interaction Network

    Get PDF
    BACKGROUND: Identifying associated phenotypes of proteins is a challenge of the modern genetics since the multifactorial trait often results from contributions of many proteins. Besides the high-through phenotype assays, the computational methods are alternative ways to identify the phenotypes of proteins. METHODOLOGY/PRINCIPAL FINDINGS: Here, we proposed a new method for predicting protein phenotypes in yeast based on protein-protein interaction network. Instead of only the most likely phenotype, a series of possible phenotypes for the query protein were generated and ranked according to the tethering potential score. As a result, the first order prediction accuracy of our method achieved 65.4% evaluated by Jackknife test of 1,267 proteins in budding yeast, much higher than the success rate (15.4%) of a random guess. And the likelihood of the first 3 predicted phenotypes including all the real phenotypes of the proteins was 70.6%. CONCLUSIONS/SIGNIFICANCE: The candidate phenotypes predicted by our method provided useful clues for the further validation. In addition, the method can be easily applied to the prediction of protein associated phenotypes in other organisms

    Functions of Some Capsular Polysaccharide Biosynthetic Genes in Klebsiella pneumoniae NTUH K-2044

    Get PDF
    The growing number of Klebsiella pneumoniae infections, commonly acquired in hospitals, has drawn great concern. It has been shown that the K1 and K2 capsular serotypes are the most detrimental strains, particularly to those with diabetes. The K1 cps (capsular polysaccharide) locus in the NTUH-2044 strain of the pyogenic liver abscess (PLA) K. pneumoniae has been identified recently, but little is known about the functions of the genes therein. Here we report characterization of a group of cps genes and their roles in the pathogenesis of K1 K. pneumoniae. By sequential gene deletion, the cps gene cluster was first re-delimited between genes galF and ugd, which serve as up- and down-stream ends, respectively. Eight gene products were characterized in vitro and in vivo to be involved in the syntheses of UDP-glucose, UDP-glucuronic acid and GDP-fucose building units. Twelve genes were identified as virulence factors based on the observation that their deletion mutants became avirulent or lost K1 antigenicity. Furthermore, deletion of kp3706, kp3709 or kp3712 (ΔwcaI, ΔwcaG or Δatf, respectively), which are all involved in fucose biosynthesis, led to a broad range of transcriptional suppression for 52 upstream genes. The genes suppressed include those coding for unknown regulatory membrane proteins and six multidrug efflux system proteins, as well as proteins required for the K1 CPS biosynthesis. In support of the suppression of multidrug efflux genes, we showed that these three mutants became more sensitive to antibiotics. Taken together, the results suggest that kp3706, kp3709 or kp3712 genes are strongly related to the pathogenesis of K. pneumoniae K1

    Genetic Interaction Maps in Escherichia coli Reveal Functional Crosstalk among Cell Envelope Biogenesis Pathways

    Get PDF
    As the interface between a microbe and its environment, the bacterial cell envelope has broad biological and clinical significance. While numerous biosynthesis genes and pathways have been identified and studied in isolation, how these intersect functionally to ensure envelope integrity during adaptive responses to environmental challenge remains unclear. To this end, we performed high-density synthetic genetic screens to generate quantitative functional association maps encompassing virtually the entire cell envelope biosynthetic machinery of Escherichia coli under both auxotrophic (rich medium) and prototrophic (minimal medium) culture conditions. The differential patterns of genetic interactions detected among >235,000 digenic mutant combinations tested reveal unexpected condition-specific functional crosstalk and genetic backup mechanisms that ensure stress-resistant envelope assembly and maintenance. These networks also provide insights into the global systems connectivity and dynamic functional reorganization of a universal bacterial structure that is both broadly conserved among eubacteria (including pathogens) and an important target

    Failure of SOX9 Regulation in 46XY Disorders of Sex Development with SRY, SOX9 and SF1 Mutations

    Get PDF
    In human embryogenesis, loss of SRY (sex determining region on Y), SOX9 (SRY-related HMG box 9) or SF1 (steroidogenic factor 1) function causes disorders of sex development (DSD). A defining event of vertebrate sex determination is male-specific upregulation and maintenance of SOX9 expression in gonadal pre-Sertoli cells, which is preceded by transient SRY expression in mammals. In mice, Sox9 regulation is under the transcriptional control of SRY, SF1 and SOX9 via a conserved testis-specific enhancer of Sox9 (TES). Regulation of SOX9 in human sex determination is however poorly understood.We show that a human embryonal carcinoma cell line (NT2/D1) can model events in presumptive Sertoli cells that initiate human sex determination. SRY associates with transcriptionally active chromatin in NT2/D1 cells and over-expression increases endogenous SOX9 expression. SRY and SF1 co-operate to activate the human SOX9 homologous TES (hTES), a process dependent on phosphorylated SF1. SOX9 also activates hTES, augmented by SF1, suggesting a mechanism for maintenance of SOX9 expression by auto-regulation. Analysis of mutant SRY, SF1 and SOX9 proteins encoded by thirteen separate 46,XY DSD gonadal dysgenesis individuals reveals a reduced ability to activate hTES.We demonstrate how three human sex-determining factors are likely to function during gonadal development around SOX9 as a hub gene, with different genetic causes of 46,XY DSD due a common failure to upregulate SOX9 transcription

    A Genome-Wide Gene Function Prediction Resource for Drosophila melanogaster

    Get PDF
    Predicting gene functions by integrating large-scale biological data remains a challenge for systems biology. Here we present a resource for Drosophila melanogaster gene function predictions. We trained function-specific classifiers to optimize the influence of different biological datasets for each functional category. Our model predicted GO terms and KEGG pathway memberships for Drosophila melanogaster genes with high accuracy, as affirmed by cross-validation, supporting literature evidence, and large-scale RNAi screens. The resulting resource of prioritized associations between Drosophila genes and their potential functions offers a guide for experimental investigations

    Universal Artifacts Affect the Branching of Phylogenetic Trees, Not Universal Scaling Laws

    Get PDF
    The superficial resemblance of phylogenetic trees to other branching structures allows searching for macroevolutionary patterns. However, such trees are just statistical inferences of particular historical events. Recent meta-analyses report finding regularities in the branching pattern of phylogenetic trees. But is this supported by evidence, or are such regularities just methodological artifacts? If so, is there any signal in a phylogeny?In order to evaluate the impact of polytomies and imbalance on tree shape, the distribution of all binary and polytomic trees of up to 7 taxa was assessed in tree-shape space. The relationship between the proportion of outgroups and the amount of imbalance introduced with them was assessed applying four different tree-building methods to 100 combinations from a set of 10 ingroup and 9 outgroup species, and performing covariance analyses. The relevance of this analysis was explored taking 61 published phylogenies, based on nucleic acid sequences and involving various taxa, taxonomic levels, and tree-building methods.All methods of phylogenetic inference are quite sensitive to the artifacts introduced by outgroups. However, published phylogenies appear to be subject to a rather effective, albeit rather intuitive control against such artifacts. The data and methods used to build phylogenetic trees are varied, so any meta-analysis is subject to pitfalls due to their uneven intrinsic merits, which translate into artifacts in tree shape. The binary branching pattern is an imposition of methods, and seldom reflects true relationships in intraspecific analyses, yielding artifactual polytomies in short trees. Above the species level, the departure of real trees from simplistic random models is caused at least by two natural factors--uneven speciation and extinction rates; and artifacts such as choice of taxa included in the analysis, and imbalance introduced by outgroups and basal paraphyletic taxa. This artifactual imbalance accounts for tree shape convergence of large trees.There is no evidence for any universal scaling in the tree of life. Instead, there is a need for improved methods of tree analysis that can be used to discriminate the noise due to outgroups from the phylogenetic signal within the taxon of interest, and to evaluate realistic models of evolution, correcting the retrospective perspective and explicitly recognizing extinction as a driving force. Artifacts are pervasive, and can only be overcome through understanding the structure and biological meaning of phylogenetic trees. Catalan Abstract in Translation S1

    The role of peptides in bone healing and regeneration: A systematic review

    Get PDF
    Background: Bone tissue engineering and the research surrounding peptides has expanded significantly over the last few decades. Several peptides have been shown to support and stimulate the bone healing response and have been proposed as therapeutic vehicles for clinical use. The aim of this comprehensive review is to present the clinical and experimental studies analysing the potential role of peptides for bone healing and bone regeneration. Methods: A systematic review according to PRISMA guidelines was conducted. Articles presenting peptides capable of exerting an upregulatory effect on osteoprogenitor cells and bone healing were included in the study. Results: Based on the available literature, a significant amount of experimental in vitro and in vivo evidence exists. Several peptides were found to upregulate the bone healing response in experimental models and could act as potential candidates for future clinical applications. However, from the available peptides that reached the level of clinical trials, the presented results are limited. Conclusion: Further research is desirable to shed more light into the processes governing the osteoprogenitor cellular responses. With further advances in the field of biomimetic materials and scaffolds, new treatment modalities for bone repair will emerge

    Insights into mammalian transcription control by systematic analysis of ChIP sequencing data

    Get PDF
    Abstract Background Transcription regulation is a major controller of gene expression dynamics during development and disease, where transcription factors (TFs) modulate expression of genes through direct or indirect DNA interaction. ChIP sequencing has become the most widely used technique to get a genome wide view of TF occupancy in a cell type of interest, mainly due to established standard protocols and a rapid decrease in the cost of sequencing. The number of available ChIP sequencing data sets in public domain is therefore ever increasing, including data generated by individual labs together with consortia such as the ENCODE project. Results A total of 1735 ChIP-sequencing datasets in mouse and human cell types and tissues were used to perform bioinformatic analyses to unravel diverse features of transcription control. 1- We used the Heat*seq webtool to investigate global relations across the ChIP-seq samples. 2- We demonstrated that factors have a specific genomic location preferences that are, for most factors, conserved across species. 3- Promoter proximal binding of factors was more conserved across cell types while the distal binding sites are more cell type specific. 4- We identified combinations of factors preferentially acting together in a cellular context. 5- Finally, by integrating the data with disease-associated gene loci from GWAS studies, we highlight the value of this data to associate novel regulators to disease. Conclusion In summary, we demonstrate how ChIP sequencing data integration and analysis is powerful to get new insights into mammalian transcription control and demonstrate the utility of various bioinformatic tools to generate novel testable hypothesis using this public resource
    corecore