46 research outputs found

    Identifying genomic and metabolic features that can underlie early successional and opportunistic lifestyles of human gut symbionts

    Get PDF
    We lack a deep understanding of genetic and metabolic attributes specializing in microbial consortia for initial and subsequent waves of colonization of our body habitats. Here we show that phylogenetically interspersed bacteria in Clostridium cluster XIVa, an abundant group of bacteria in the adult human gut also known as the Clostridium coccoides or Eubacterium rectale group, contains species that have evolved distribution patterns consistent with either early successional or stable gut communities. The species that specialize to the infant gut are more likely to associate with systemic infections and can reach high abundances in individuals with Inflammatory Bowel Disease (IBD), indicating that a subset of the microbiota that have adapted to pioneer/opportunistic lifestyles may do well in both early development and with disease. We identified genes likely selected during adaptation to pioneer/opportunistic lifestyles as those for which early succession association and not phylogenetic relationships explain genomic abundance. These genes reveal potential mechanisms by which opportunistic gut bacteria tolerate osmotic and oxidative stress and potentially important aspects of their metabolism. These genes may not only be biomarkers of properties associated with adaptation to early succession and disturbance, but also leads for developing therapies aimed at promoting reestablishment of stable gut communities following physiologic or pathologic disturbances

    EGenBio : a data management system for evolutionary genomics and biodiversity

    Get PDF
    From The Third Annual Conference of the MidSouth Computational Biology and Bioinformatics Society Baton Rouge, Louisiana. 2–4 March, 2006.© 2006 Nahum et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.EGenBio is a system for manipulation and filtering of large numbers of sequences, integrating curated sequence alignments and phylogenetic trees, managing evolutionary analyses, and visualizing their output. EGenBio is organized into three conceptual divisions, Evolution, Genomics, and Biodiversity. The Genomics division includes tools for selecting pre-aligned sequences from different genes and species, and for modifying and filtering these alignments for further analysis. Species searches are handled through queries that can be modified based on a tree-based navigation system and saved. The Biodiversity division contains tools for analyzing individual sequences or sequence alignments, whereas the Evolution division contains tools involving phylogenetic trees. Alignments are annotated with analytical results and modification history using our PRAED format. A miscellaneous Tools section and Help framework are also available. EGenBio was developed around our comparative genomic research and a prototype database of mtDNA genomes. It utilizes MySQL-relational databases and dynamic page generation, and calls numerous custom programs.This work was partly funded by the National Institutes of Health (R22/R33 Innovation and Development grant to David Pollock), the National Science Foundation (CBM2/EPSCOR), and the State of Louisiana (Biological Computation and Visualization Center, Governor's iotechnology Initiative, and startup funds to David Pollock)

    Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles

    Get PDF
    Machine learning approaches offer the potential to systematically identify transcriptional regulatory interactions from a compendium of microarray expression profiles. However, experimental validation of the performance of these methods at the genome scale has remained elusive. Here we assess the global performance of four existing classes of inference algorithms using 445 Escherichia coli Affymetrix arrays and 3,216 known E. coli regulatory interactions from RegulonDB. We also developed and applied the context likelihood of relatedness (CLR) algorithm, a novel extension of the relevance networks class of algorithms. CLR demonstrates an average precision gain of 36% relative to the next-best performing algorithm. At a 60% true positive rate, CLR identifies 1,079 regulatory interactions, of which 338 were in the previously known network and 741 were novel predictions. We tested the predicted interactions for three transcription factors with chromatin immunoprecipitation, confirming 21 novel interactions and verifying our RegulonDB-based performance estimates. CLR also identified a regulatory link providing central metabolic control of iron transport, which we confirmed with real-time quantitative PCR. The compendium of expression data compiled in this study, coupled with RegulonDB, provides a valuable model system for further improvement of network inference algorithms using experimental data

    Bridging the knowledge gap: from microbiome composition to function

    No full text
    Despite the wealth of metagenomic sequencing data, the functions of most bacterial genes from the mammalian microbiota have remained poorly understood. In their recent study (Yaung et al 2015), Wang, Gerber, and colleagues present a platform which allows functional mining of bacterial genomes for genes that contribute to fitness in vivo and holds great potential for forward engineering microbes with enhanced colonization abilities in the microbiota

    2.1. Mechanisms of RNA regulation.................................................... 68

    No full text
    Microarray technologies, which enable the simultaneous measurement of all RNA transcripts in a cell, have spawned the development of algorithms for reverse-engineering transcription control networks. In this article, we classify the algorithms into two general strategies: physical modeling and influence modeling. We discuss the biological and computational principles underlying each strategy, and provide leading examples of each. We als

    Lightweight genome viewer: portable software for browsing genomics data in its chromosomal context

    No full text
    Abstract Background Lightweight genome viewer (lwgv) is a web-based tool for visualization of sequence annotations in their chromosomal context. It performs most of the functions of larger genome browsers, while relying on standard flat-file formats and bypassing the database needs of most visualization tools. Visualization as an aide to discovery requires display of novel data in conjunction with static annotations in their chromosomal context. With database-based systems, displaying dynamic results requires temporary tables that need to be tracked for removal. Results lwgv simplifies the visualization of user-generated results on a local computer. The dynamic results of these analyses are written to transient files, which can import static content from a more permanent file. lwgv is currently used in many different applications, from whole genome browsers to single-gene RNAi design visualization, demonstrating its applicability in a large variety of contexts and scales. Conclusion lwgv provides a lightweight alternative to large genome browsers for visualizing biological annotations and dynamic analyses in their chromosomal context. It is particularly suited for applications ranging from short sequences to medium-sized genomes when the creation and maintenance of a large software and database infrastructure is not necessary or desired.</p
    corecore