142 research outputs found
Recognizing the pseudogenes in bacterial genomes
Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes
Of What Use Is Sex to Bacteria?
Though bacteria are predominantly asexual, the genetic information in their genomes can be expanded and modified through mechanisms that introduce DNA from outside sources. Bacterial sex differs from that of eukaryotes in that it is unidirectional and does not involve gamete fusion or reproduction. The input of DNA during bacterial sex generates diversity in two ways — through the alteration of existing genes by recombination and through the introduction of novel sequences — and each of these processes has been shown to aid in the survival and diversification of lineages
Inferring clocks when lacking rocks: the variable rates of molecular evolution in bacteria
<p>Abstract</p> <p>Background</p> <p>Because bacteria do not have a robust fossil record, attempts to infer the timing of events in their evolutionary history requires comparisons of molecular sequences. This use of molecular clocks is based on the assumptions that substitution rates for homologous genes or sites are fairly constant through time and across taxa. Violation of these conditions can lead to erroneous inferences and result in estimates that are off by orders of magnitude. In this study, we examine the consistency of substitution rates among a set of conserved genes in diverse bacterial lineages, and address the questions regarding the validity of molecular dating.</p> <p>Results</p> <p>By examining the evolution of 16S rRNA gene in obligate endosymbionts, which can be calibrated by the fossil record of their hosts, we found that the rates are consistent within a clade but varied widely across different bacterial lineages. Genome-wide estimates of nonsynonymous and synonymous substitutions suggest that these two measures are highly variable in their rates across bacterial taxa. Genetic drift plays a fundamental role in determining the accumulation of substitutions in 16S rRNA genes and at nonsynonymous sites. Moreover, divergence estimates based on a set of universally conserved protein-coding genes also exhibit low correspondence to those based on 16S rRNA genes.</p> <p>Conclusion</p> <p>Our results document a wide range of substitution rates across genes and bacterial taxa. This high level of variation cautions against the assumption of a universal molecular clock for inferring divergence times in bacteria. However, by applying relative-rate tests to homologous genes, it is possible to derive reliable local clocks that can be used to calibrate bacterial evolution.</p> <p>Reviewers</p> <p>This article was reviewed by Adam Eyre-Walker, Simonetta Gribaldo and Tal Pupko (nominated by Dan Graur).</p
The Extinction Dynamics of Bacterial Pseudogenes
Pseudogenes are usually considered to be completely neutral sequences whose evolution is shaped by random mutations and chance events. It is possible, however, for disrupted genes to generate products that are deleterious due either to the energetic costs of their transcription and translation or to the formation of toxic proteins. We found that after their initial formation, the youngest pseudogenes in Salmonella genomes have a very high likelihood of being removed by deletional processes and are eliminated too rapidly to be governed by a strictly neutral model of stochastic loss. Those few highly degraded pseudogenes that have persisted in Salmonella genomes correspond to genes with low expression levels and low connectivity in gene networks, such that their inactivation and any initial deleterious effects associated with their inactivation are buffered. Although pseudogenes have long been considered the paradigm of neutral evolution, the distribution of pseudogenes among Salmonella strains indicates that removal of many of these apparently functionless regions is attributable to positive selection
A Selective Force Favoring Increased G+C Content in Bacterial Genes
Bacteria display considerable variation in their overall base compositions, which range from 13% to over 75% G+C. This variation in genomic base compositions has long been considered to be a strictly neutral character, due solely to differences in the mutational process; however, recent sequence comparisons indicate that mutational input alone cannot produce the observed base compositions, implying a role for natural selection. Because bacterial genomes have high gene content, forces that operate on the base composition of individual genes could help shape the overall genomic base composition. To explore this possibility, we tested whether genes that encode the same protein but vary only in their base compositions at synonymous sites have effects on bacterial fitness. Escherichia coli strains harboring G+C-rich versions of genes display higher growth rates, indicating that despite a pervasive mutational bias toward A+T, a selective force, independent of adaptive codon use, is driving genes toward higher G+C contents
Recommended from our members
Where Next for Microbiome Research?
The development of high-throughput sequencing technologies has transformed our capacity to investigate the composition and dynamics of the microbial communities that populate diverse habitats. Over the past decade, these advances have yielded an avalanche of metagenomic data. The current stage of “van Leeuwenhoek”–like cataloguing, as well as functional analyses, will likely accelerate as DNA and RNA sequencing, plus protein and metabolic profiling capacities and computational tools, continue to improve. However, it is time to consider: what’s next for microbiome research? The short pieces included here briefly consider the challenges and opportunities awaiting microbiome research
Evolutionary Origins of Genomic Repertoires in Bacteria
Explaining the diversity of gene repertoires has been a major problem in modern evolutionary biology. In eukaryotes, this diversity is believed to result mainly from gene duplication and loss, but in prokaryotes, lateral gene transfer (LGT) can also contribute substantially to genome contents. To determine the histories of gene inventories, we conducted an exhaustive analysis of gene phylogenies for all gene families in a widely sampled group, the γ-Proteobacteria. We show that, although these bacterial genomes display striking differences in gene repertoires, most gene families having representatives in several species have congruent histories. Other than the few vast multigene families, gene duplication has contributed relatively little to the contents of these genomes; instead, LGT, over time, provides most of the diversity in genomic repertoires. Most such acquired genes are lost, but the majority of those that persist in genomes are transmitted strictly vertically. Although our analyses are limited to the γ-Proteobacteria, these results resolve a long-standing paradox—i.e., the ability to make robust phylogenetic inferences in light of substantial LGT
The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081
The human enteropathogen, Yersinia enterocolitica, is a significant link in the range of Yersinia pathologies extending
from mild gastroenteritis to bubonic plague. Comparison at the genomic level is a key step in our understanding of the
genetic basis for this pathogenicity spectrum. Here we report the genome of Y. enterocolitica strain 8081 (serotype 0:8;
biotype 1B) and extensive microarray data relating to the genetic diversity of the Y. enterocolitica species. Our analysis
reveals that the genome of Y. enterocolitica strain 8081 is a patchwork of horizontally acquired genetic loci, including a
plasticity zone of 199 kb containing an extraordinarily high density of virulence genes. Microarray analysis has
provided insights into species-specific Y. enterocolitica gene functions and the intraspecies differences between the
high, low, and nonpathogenic Y. enterocolitica biotypes. Through comparative genome sequence analysis we provide
new information on the evolution of the Yersinia. We identify numerous loci that represent ancestral clusters of genes
potentially important in enteric survival and pathogenesis, which have been lost or are in the process of being lost, in
the other sequenced Yersinia lineages. Our analysis also highlights large metabolic operons in Y. enterocolitica that are
absent in the related enteropathogen, Yersinia pseudotuberculosis, indicating major differences in niche and nutrients
used within the mammalian gut. These include clusters directing, the production of hydrogenases, tetrathionate
respiration, cobalamin synthesis, and propanediol utilisation. Along with ancestral gene clusters, the genome of Y.
enterocolitica has revealed species-specific and enteropathogen-specific loci. This has provided important insights into
the pathology of this bacterium and, more broadly, into the evolution of the genus. Moreover, wider investigations
looking at the patterns of gene loss and gain in the Yersinia have highlighted common themes in the genome evolution
of other human enteropathogens
Sex and virulence in Escherichia coli: an evolutionary perspective
Pathogenic Escherichia coli cause over 160 million cases of dysentery and one million deaths per year, whereas non-pathogenic E. coli constitute part of the normal intestinal flora of healthy mammals and birds. The evolutionary pathways underlying this dichotomy in bacterial lifestyle were investigated by multilocus sequence typing of a global collection of isolates. Specific pathogen types [enterohaemorrhagic E. coli, enteropathogenic E. coli, enteroinvasive E. coli, K1 and Shigella] have arisen independently and repeatedly in several lineages, whereas other lineages contain only few pathogens. Rates of evolution have accelerated in pathogenic lineages, culminating in highly virulent organisms whose genomic contents are altered frequently by increased rates of homologous recombination; thus, the evolution of virulence is linked to bacterial sex. This long-term pattern of evolution was observed in genes distributed throughout the genome, and thereby is the likely result of episodic selection for strains that can escape the host immune response
- …