1,815 research outputs found
Recommended from our members
Interactive tools for functional annotation of bacterial genomes
Automated annotations of protein functions are error-prone because of our lack of knowledge of protein functions. For example, it is often impossible to predict the correct substrate for an enzyme or a transporter. Furthermore, much of the knowledge that we do have about the functions of proteins is missing from the underlying databases. We discuss how to use interactive tools to quickly find different kinds of information relevant to a protein's function. Many of these tools are available via PaperBLAST (http://papers.genomics.lbl.gov). Combining these tools often allows us to infer a protein's function. Ideally, accurate annotations would allow us to predict a bacterium's capabilities from its genome sequence, but in practice, this remains challenging. We describe interactive tools that infer potential capabilities from a genome sequence or that search a genome to find proteins that might perform a specific function of interest. Database URL: http://papers.genomics.lbl.gov
FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
Background: We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability. Methodology/Principal Findings: Where FastTree 1 used nearest-neighbor interchanges (NNIs) and the minimum-evolution criterion to improve the tree, FastTree 2 adds minimum-evolution subtree-pruning-regrafting (SPRs) and maximumlikelihood NNIs. FastTree 2 uses heuristics to restrict the search for better trees and estimates a rate of evolution for each site (the ‘‘CAT’ ’ approximation). Nevertheless, for both simulated and genuine alignments, FastTree 2 is slightly more accurate than a standard implementation of maximum-likelihood NNIs (PhyML 3 with default settings). Although FastTree 2 is not quite as accurate as methods that use maximum-likelihood SPRs, most of the splits that disagree are poorly supported, and for large alignments, FastTree 2 is 100–1,000 times faster. FastTree 2 inferred a topology and likelihood-based local support values for 237,882 distinct 16S ribosomal RNAs on a desktop computer in 22 hours and 5.8 gigabytes of memory. Conclusions/Significance: FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments
Social Capital and the Campus Community
Investigating colleges’ and universities’ social capital through its five dimensions—civic engagement, norms and trust, collective action, bonding capital, and bridging capital—provides a powerful way of thinking about organizational and faculty development. Four very different institutions of higher learning have promoted their organizational development through efforts that build social capital. We seek to inspire additional application of and research into this topic by demonstrating that confronting the complexities of social capital within diverse campus communities can help faculty developers understand those communities with greater nuance and in ways that improve their ability to design and implement development initiatives
Horizontal gene transfer and the evolution of transcriptional regulation in Escherichia coli
Most Escherichia coli transcription factors have paralogs, but these usually arose by horizontal gene transfer rather than by duplication within the E. coli lineage, as previously believed
OpWise: Operons aid the identification of differentially expressed genes in bacterial microarray experiments
BACKGROUND: Differentially expressed genes are typically identified by analyzing the variation between replicate measurements. These procedures implicitly assume that there are no systematic errors in the data even though several sources of systematic error are known. RESULTS: OpWise estimates the amount of systematic error in bacterial microarray data by assuming that genes in the same operon have matching expression patterns. OpWise then performs a Bayesian analysis of a linear model to estimate significance. In simulations, OpWise corrects for systematic error and is robust to deviations from its assumptions. In several bacterial data sets, significant amounts of systematic error are present, and replicate-based approaches overstate the confidence of the changers dramatically, while OpWise does not. Finally, OpWise can identify additional changers by assigning genes higher confidence if they are consistent with other genes in the same operon. CONCLUSION: Although microarray data can contain large amounts of systematic error, operons provide an external standard and allow for reasonable estimates of significance. OpWise is available at
The genetic basis of energy conservation in the sulfate-reducing bacterium Desulfovibrio alaskensis G20.
Sulfate-reducing bacteria play major roles in the global carbon and sulfur cycles, but it remains unclear how reducing sulfate yields energy. To determine the genetic basis of energy conservation, we measured the fitness of thousands of pooled mutants of Desulfovibrio alaskensis G20 during growth in 12 different combinations of electron donors and acceptors. We show that ion pumping by the ferredoxin:NADH oxidoreductase Rnf is required whenever substrate-level phosphorylation is not possible. The uncharacterized complex Hdr/flox-1 (Dde_1207:13) is sometimes important alongside Rnf and may perform an electron bifurcation to generate more reduced ferredoxin from NADH to allow further ion pumping. Similarly, during the oxidation of malate or fumarate, the electron-bifurcating transhydrogenase NfnAB-2 (Dde_1250:1) is important and may generate reduced ferredoxin to allow additional ion pumping by Rnf. During formate oxidation, the periplasmic [NiFeSe] hydrogenase HysAB is required, which suggests that hydrogen forms in the periplasm, diffuses to the cytoplasm, and is used to reduce ferredoxin, thus providing a substrate for Rnf. During hydrogen utilization, the transmembrane electron transport complex Tmc is important and may move electrons from the periplasm into the cytoplasmic sulfite reduction pathway. Finally, mutants of many other putative electron carriers have no clear phenotype, which suggests that they are not important under our growth conditions, although we cannot rule out genetic redundancy
Orthologous Transcription Factors in Bacteria Have Different Functions and Regulate Different Genes
Transcription factors (TFs) form large paralogous gene families and have complex evolutionary histories. Here, we ask whether putative orthologs of TFs, from bidirectional best BLAST hits (BBHs), are evolutionary orthologs with conserved functions. We show that BBHs of TFs from distantly related bacteria are usually not evolutionary orthologs. Furthermore, the false orthologs usually respond to different signals and regulate distinct pathways, while the few BBHs that are evolutionary orthologs do have conserved functions. To test the conservation of regulatory interactions, we analyze expression patterns. We find that regulatory relationships between TFs and their regulated genes are usually not conserved for BBHs in Escherichia coli K12 and Bacillus subtilis. Even in the much more closely related bacteria Vibrio cholerae and Shewanella oneidensis MR-1, predicting regulation from E. coli BBHs has high error rates. Using gene–regulon correlations, we identify genes whose expression pattern differs between E. coli and S. oneidensis. Using literature searches and sequence analysis, we show that these changes in expression patterns reflect changes in gene regulation, even for evolutionary orthologs. We conclude that the evolution of bacterial regulation should be analyzed with phylogenetic trees, rather than BBHs, and that bacterial regulatory networks evolve more rapidly than previously thought
A novel method for accurate operon predictions in all sequenced prokaryotes
We combine comparative genomic measures and the distance separating adjacent genes to predict operons in 124 completely sequenced prokaryotic genomes. Our method automatically tailors itself to each genome using sequence information alone, and thus can be applied to any prokaryote. For Escherichia coli K12 and Bacillus subtilis, our method is 85 and 83% accurate, respectively, which is similar to the accuracy of methods that use the same features but are trained on experimentally characterized transcripts. In Halobacterium NRC-1 and in Helicobacter pylori, our method correctly infers that genes in operons are separated by shorter distances than they are in E.coli, and its predictions using distance alone are more accurate than distance-only predictions trained on a database of E.coli transcripts. We use microarray data from six phylogenetically diverse prokaryotes to show that combining intergenic distance with comparative genomic measures further improves accuracy and that our method is broadly effective. Finally, we survey operon structure across 124 genomes, and find several surprises: H.pylori has many operons, contrary to previous reports; Bacillus anthracis has an unusual number of pseudogenes within conserved operons; and Synechocystis PCC 6803 has many operons even though it has unusually wide spacings between conserved adjacent genes
- …