21 research outputs found
Prokaryote genome fluidity is dependent on effective population size
Many prokaryote species are known to have fluid genomes, with different strains varying markedly in accessory gene content through the combined action of gene loss, gene gain via lateral transfer, as well as gene duplication. However, the evolutionary forces determining genome fluidity are not yet well understood. We here for the first time systematically analyse the degree to which this distinctive genomic feature differs between bacterial species. We find that genome fluidity is positively correlated with synonymous nucleotide diversity of the core genome, a measure of effective population size Ne. No effects of genome size, phylogeny or homologous recombination rate on genome fluidity were found. Our findings are consistent with a scenario where accessory gene content turnover is for a large part dictated by neutral evolution
Robust estimation of microbial diversity in theory and in practice
Quantifying diversity is of central importance for the study of structure,
function and evolution of microbial communities. The estimation of microbial
diversity has received renewed attention with the advent of large-scale
metagenomic studies. Here, we consider what the diversity observed in a sample
tells us about the diversity of the community being sampled. First, we argue
that one cannot reliably estimate the absolute and relative number of microbial
species present in a community without making unsupported assumptions about
species abundance distributions. The reason for this is that sample data do not
contain information about the number of rare species in the tail of species
abundance distributions. We illustrate the difficulty in comparing species
richness estimates by applying Chao's estimator of species richness to a set of
in silico communities: they are ranked incorrectly in the presence of large
numbers of rare species. Next, we extend our analysis to a general family of
diversity metrics ("Hill diversities"), and construct lower and upper estimates
of diversity values consistent with the sample data. The theory generalizes
Chao's estimator, which we retrieve as the lower estimate of species richness.
We show that Shannon and Simpson diversity can be robustly estimated for the in
silico communities. We analyze nine metagenomic data sets from a wide range of
environments, and show that our findings are relevant for empirically-sampled
communities. Hence, we recommend the use of Shannon and Simpson diversity
rather than species richness in efforts to quantify and compare microbial
diversity.Comment: To be published in The ISME Journal. Main text: 16 pages, 5 figures.
Supplement: 16 pages, 4 figure
Genomic fluidity: an integrative view of gene diversity within microbial populations
<p>Abstract</p> <p>Background</p> <p>The dual concepts of pan and core genomes have been widely adopted as means to assess the distribution of gene families within microbial species and genera. The core genome is the set of genes shared by a group of organisms; the pan genome is the set of all genes seen in any of these organisms. A variety of methods have provided drastically different estimates of the sizes of pan and core genomes from sequenced representatives of the same groups of bacteria.</p> <p>Results</p> <p>We use a combination of mathematical, statistical and computational methods to show that current predictions of pan and core genome sizes may have no correspondence to true values. Pan and core genome size estimates are problematic because they depend on the estimation of the occurrence of rare genes and genomes, respectively, which are difficult to estimate precisely because they are rare. Instead, we introduce and evaluate a robust metric - genomic fluidity - to categorize the gene-level similarity among groups of sequenced isolates. Genomic fluidity is a measure of the dissimilarity of genomes evaluated at the gene level.</p> <p>Conclusions</p> <p>The genomic fluidity of a population can be estimated accurately given a small number of sequenced genomes. Further, the genomic fluidity of groups of organisms can be compared robustly despite variation in algorithms used to identify genes and their homologs. As such, we recommend that genomic fluidity be used in place of pan and core genome size estimates when assessing gene diversity within genomes of a species or a group of closely related organisms.</p
Bacterial microevolution and the Pangenome
The comparison of multiple genome sequences sampled from a bacterial population reveals considerable diversity in both the core and the accessory parts of the pangenome. This diversity can be analysed in terms of microevolutionary events that took place since the genomes shared a common ancestor, especially deletion, duplication, and recombination. We review the basic modelling ingredients used implicitly or explicitly when performing such a pangenome analysis. In particular, we describe a basic neutral phylogenetic framework of bacterial pangenome microevolution, which is not incompatible with evaluating the role of natural selection. We survey the different ways in which pangenome data is summarised in order to be included in microevolutionary models, as well as the main methodological approaches that have been proposed to reconstruct pangenome microevolutionary history
A barrier to homologous recombination between sympatric strains of the cooperative soil bacterium Myxococcus xanthus
The bacterium Myxococcus xanthus glides through soil in search of prey microbes, but when food
sources run out, cells cooperatively construct and sporulate within multicellular fruiting bodies.
M. xanthus strains isolated from a 16 × 16-cm-scale patch of soil were previously shown to have
diversified into many distinct compatibility types that are distinguished by the failure of swarming
colonies to merge upon encounter. We sequenced the genomes of 22 isolates from this population
belonging to the two most frequently occurring multilocus sequence type (MLST) clades to trace
patterns of incipient genomic divergence, specifically related to social divergence. Although
homologous recombination occurs frequently within the two MLST clades, we find an almost
complete absence of recombination events between them. As the two clades are very closely related
and live in sympatry, either ecological or genetic barriers must reduce genetic exchange between
them. We find that the rate of change in the accessory genome is greater than the rate of amino-acid
substitution in the core genome. We identify a large genomic tract that consistently differs between
isolates that do not freely merge and therefore is a candidate region for harbouring gene(s)
responsible for self/non-self discrimination
Distinct Campylobacter fetus lineages adapted as livestock pathogens and human pathobionts in the intestinal microbiota
Campylobacter fetus is a venereal pathogen of cattle and sheep, and an opportunistic human pathogen. It is often assumed that C. fetus infection occurs in humans as a zoonosis through food chain transmission. Here we show that mammalian C. fetus consists of distinct evolutionary lineages, primarily associated with either human or bovine hosts. We use whole-genome phylogenetics on 182 strains from 17 countries to provide evidence that C. fetus may have originated in humans around 10,500 years ago and may have "jumped" into cattle during the livestock domestication period. We detect C. fetus genomes in 8% of healthy human fecal metagenomes, where the human-associated lineages are the dominant type (78%). Thus, our work suggests that C. fetus is an unappreciated human intestinal pathobiont likely spread by human to human transmission. This genome-based evolutionary framework will facilitate C. fetus epidemiology research and the development of improved molecular diagnostics and prevention schemes for this neglected pathogen