15,226 research outputs found
Neutral genomic microevolution of a recently emerged pathogen, salmonella enterica serovar agona
Salmonella enterica serovar Agona has caused multiple food-borne outbreaks of gastroenteritis since it was first isolated in
1952. We analyzed the genomes of 73 isolates from global sources, comparing five distinct outbreaks with sporadic
infections as well as food contamination and the environment. Agona consists of three lineages with minimal mutational
diversity: only 846 single nucleotide polymorphisms (SNPs) have accumulated in the non-repetitive, core genome since
Agona evolved in 1932 and subsequently underwent a major population expansion in the 1960s. Homologous
recombination with other serovars of S. enterica imported 42 recombinational tracts (360 kb) in 5/143 nodes within the
genealogy, which resulted in 3,164 additional SNPs. In contrast to this paucity of genetic diversity, Agona is highly diverse
according to pulsed-field gel electrophoresis (PFGE), which is used to assign isolates to outbreaks. PFGE diversity reflects a
highly dynamic accessory genome associated with the gain or loss (indels) of 51 bacteriophages, 10 plasmids, and 6
integrative conjugational elements (ICE/IMEs), but did not correlate uniquely with outbreaks. Unlike the core genome, indels
occurred repeatedly in independent nodes (homoplasies), resulting in inaccurate PFGE genealogies. The accessory genome
contained only few cargo genes relevant to infection, other than antibiotic resistance. Thus, most of the genetic diversity
within this recently emerged pathogen reflects changes in the accessory genome, or is due to recombination, but these
changes seemed to reflect neutral processes rather than Darwinian selection. Each outbreak was caused by an independent
clade, without universal, outbreak-associated genomic features, and none of the variable genes in the pan-genome seemed
to be associated with an ability to cause outbreaks
Recommended from our members
Abundance of conserved CRISPR-Cas9 target sites within the highly polymorphic genomes of Anopheles and Aedes mosquitoes.
A number of recent papers report that standing genetic variation in natural populations includes ubiquitous polymorphisms within target sites for Cas9-based gene drive (CGD) and that these "drive resistant alleles" (DRA) preclude the successful application of CGD for managing these populations. Here we report the results of a survey of 1280 genomes of the mosquitoes Anopheles gambiae, An. coluzzii, and Aedes aegypti in which we determine that ~90% of all protein-encoding CGD target genes in natural populations include at least one target site with no DRAs at a frequency of ≥1.0%. We conclude that the abundance of conserved target sites in mosquito genomes and the inherent flexibility in CGD design obviates the concern that DRAs present in the standing genetic variation of mosquito populations will be detrimental to the deployment of this technology for population modification strategies
Progressive Mauve: Multiple alignment of genomes with gene flux and rearrangement
Multiple genome alignment remains a challenging problem. Effects of
recombination including rearrangement, segmental duplication, gain, and loss
can create a mosaic pattern of homology even among closely related organisms.
We describe a method to align two or more genomes that have undergone
large-scale recombination, particularly genomes that have undergone substantial
amounts of gene gain and loss (gene flux). The method utilizes a novel
alignment objective score, referred to as a sum-of-pairs breakpoint score. We
also apply a probabilistic alignment filtering method to remove erroneous
alignments of unrelated sequences, which are commonly observed in other genome
alignment methods. We describe new metrics for quantifying genome alignment
accuracy which measure the quality of rearrangement breakpoint predictions and
indel predictions. The progressive genome alignment algorithm demonstrates
markedly improved accuracy over previous approaches in situations where genomes
have undergone realistic amounts of genome rearrangement, gene gain, loss, and
duplication. We apply the progressive genome alignment algorithm to a set of 23
completely sequenced genomes from the genera Escherichia, Shigella, and
Salmonella. The 23 enterobacteria have an estimated 2.46Mbp of genomic content
conserved among all taxa and total unique content of 15.2Mbp. We document
substantial population-level variability among these organisms driven by
homologous recombination, gene gain, and gene loss. Free, open-source software
implementing the described genome alignment approach is available from
http://gel.ahabs.wisc.edu/mauve .Comment: Revision dated June 19, 200
Genome-wide signatures of complex introgression and adaptive evolution in the big cats.
The great cats of the genus Panthera comprise a recent radiation whose evolutionary history is poorly understood. Their rapid diversification poses challenges to resolving their phylogeny while offering opportunities to investigate the historical dynamics of adaptive divergence. We report the sequence, de novo assembly, and annotation of the jaguar (Panthera onca) genome, a novel genome sequence for the leopard (Panthera pardus), and comparative analyses encompassing all living Panthera species. Demographic reconstructions indicated that all of these species have experienced variable episodes of population decline during the Pleistocene, ultimately leading to small effective sizes in present-day genomes. We observed pervasive genealogical discordance across Panthera genomes, caused by both incomplete lineage sorting and complex patterns of historical interspecific hybridization. We identified multiple signatures of species-specific positive selection, affecting genes involved in craniofacial and limb development, protein metabolism, hypoxia, reproduction, pigmentation, and sensory perception. There was remarkable concordance in pathways enriched in genomic segments implicated in interspecies introgression and in positive selection, suggesting that these processes were connected. We tested this hypothesis by developing exome capture probes targeting ~19,000 Panthera genes and applying them to 30 wild-caught jaguars. We found at least two genes (DOCK3 and COL4A5, both related to optic nerve development) bearing significant signatures of interspecies introgression and within-species positive selection. These findings indicate that post-speciation admixture has contributed genetic material that facilitated the adaptive evolution of big cat lineages
The genome of the medieval Black Death agent (extended abstract)
The genome of a 650 year old Yersinia pestis bacteria, responsible for the
medieval Black Death, was recently sequenced and assembled into 2,105 contigs
from the main chromosome. According to the point mutation record, the medieval
bacteria could be an ancestor of most Yersinia pestis extant species, which
opens the way to reconstructing the organization of these contigs using a
comparative approach. We show that recent computational paleogenomics methods,
aiming at reconstructing the organization of ancestral genomes from the
comparison of extant genomes, can be used to correct, order and complete the
contig set of the Black Death agent genome, providing a full chromosome
sequence, at the nucleotide scale, of this ancient bacteria. This sequence
suggests that a burst of mobile elements insertions predated the Black Death,
leading to an exceptional genome plasticity and increase in rearrangement rate.Comment: Extended abstract of a talk presented at the conference JOBIM 2013,
https://colloque.inra.fr/jobim2013_eng/. Full paper submitte
Recommended from our members
Double-digest RADseq loci using standard Illumina indexes improve deep and shallow phylogenetic resolution of Lophodermium, a widespread fungal endophyte of pine needles.
The phylogenetic and population genetic structure of symbiotic microorganisms may correlate with important ecological traits that can be difficult to directly measure, such as host preferences or dispersal rates. This study develops and tests a low-cost double-digest restriction site-associated DNA sequencing (ddRADseq) protocol to reveal among- and within-species genetic structure for Lophodermium, a genus of fungal endophytes whose evolutionary analyses have been limited by the scarcity of informative markers. The protocol avoids expensive barcoded adapters and incorporates universal indexes for multiplexing. We tested for reproducibility and functionality by comparing shared loci from sample replicates and assessed the effects of numbers of ambiguous sites and clustering thresholds on coverage depths, number of shared loci among samples, and phylogenetic reconstruction. Errors between technical replicates were minimal. Relaxing the quality-filtering criteria increased the mean coverage depth per locus and the number of loci recovered within a sample, but had little effect on the number of shared loci across samples. Increasing clustering threshold decreased the mean coverage depth per cluster and increased the number of loci recovered within a sample but also decreased the number of shared loci across samples, especially among distantly related species. The combination of low similarity clustering (70%) and relaxed quality-filtering (allowing up to 30 ambiguous sites per read) performed the best in phylogenetic analyses at both recent and deep genetic divergences. Hence, this method generated sufficient number of shared homologous loci to investigate the evolutionary relationships among divergent fungal lineages with small haploid genomes. The greater genetic resolution also revealed new structure within species that correlated with ecological traits, providing valuable insights into their cryptic life histories
De novo human genome assemblies reveal spectrum of alternative haplotypes in diverse populations.
The human reference genome is used extensively in modern biological research. However, a single consensus representation is inadequate to provide a universal reference structure because it is a haplotype among many in the human population. Using 10× Genomics (10×G) "Linked-Read" technology, we perform whole genome sequencing (WGS) and de novo assembly on 17 individuals across five populations. We identify 1842 breakpoint-resolved non-reference unique insertions (NUIs) that, in aggregate, add up to 2.1 Mb of so far undescribed genomic content. Among these, 64% are considered ancestral to humans since they are found in non-human primate genomes. Furthermore, 37% of the NUIs can be found in the human transcriptome and 14% likely arose from Alu-recombination-mediated deletion. Our results underline the need of a set of human reference genomes that includes a comprehensive list of alternative haplotypes to depict the complete spectrum of genetic diversity across populations
- …