68 research outputs found
The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population.
Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets
Conserved novel ORFs in the mitochondrial genome of the ctenophore Beroe forskalii
To date, five ctenophore species’ mitochondrial genomes have been sequenced, and each contains open reading frames (ORFs) that if translated have no identifiable orthologs. ORFs with no identifiable orthologs are called unidentified reading frames (URFs). If truly protein-coding, ctenophore mitochondrial URFs represent a little understood path in early-diverging metazoan mitochondrial evolution and metabolism. We sequenced and annotated the mitochondrial genomes of three individuals of the beroid ctenophore Beroe forskalii and found that in addition to sharing the same canonical mitochondrial genes as other ctenophores, the B. forskalii mitochondrial genome contains two URFs. These URFs are conserved among the three individuals but not found in other sequenced species. We developed computational tools called pauvre and cuttlery to determine the likelihood that URFs are protein coding. There is evidence that the two URFs are under negative selection, and a novel Bayesian hypothesis test of trinucleotide frequency shows that the URFs are more similar to known coding genes than noncoding intergenic sequence. Protein structure and function prediction of all ctenophore URFs suggests that they all code for transmembrane transport proteins. These findings, along with the presence of URFs in other sequenced ctenophore mitochondrial genomes, suggest that ctenophores may have uncharacterized transmembrane proteins present in their mitochondria
Natural selection shaped the rise and fall of passenger pigeon genomic diversity.
The extinct passenger pigeon was once the most abundant bird in North America, and possibly the world. Although theory predicts that large populations will be more genetically diverse, passenger pigeon genetic diversity was surprisingly low. To investigate this disconnect, we analyzed 41 mitochondrial and 4 nuclear genomes from passenger pigeons and 2 genomes from band-tailed pigeons, which are passenger pigeons' closest living relatives. Passenger pigeons' large population size appears to have allowed for faster adaptive evolution and removal of harmful mutations, driving a huge loss in their neutral genetic diversity. These results demonstrate the effect that selection can have on a vertebrate genome and contradict results that suggested that population instability contributed to this species's surprisingly rapid extinction
Population genomics of sub-Saharan Drosophila melanogaster: African diversity and non-African admixture
(ABRIDGED) We report the genome sequencing of 139 wild-derived strains of D.
melanogaster, representing 22 population samples from the sub-Saharan ancestral
range of this species, along with one European population. Most genomes were
sequenced above 25X depth from haploid embryos. Results indicated a pervasive
influence of non-African admixture in many African populations, motivating the
development and application of a novel admixture detection method. Admixture
proportions varied among populations, with greater admixture in urban
locations. Admixture levels also varied across the genome, with localized peaks
and valleys suggestive of a non-neutral introgression process. Genomes from the
same location differed starkly in ancestry, suggesting that isolation
mechanisms may exist within African populations. After removing putatively
admixed genomic segments, the greatest genetic diversity was observed in
southern Africa (e.g. Zambia), while diversity in other populations was largely
consistent with a geographic expansion from this potentially ancestral region.
The European population showed different levels of diversity reduction on each
chromosome arm, and some African populations displayed chromosome arm-specific
diversity reductions. Inversions in the European sample were associated with
strong elevations in diversity across chromosome arms. Genomic scans were
conducted to identify loci that may represent targets of positive selection. A
disproportionate number of candidate selective sweep regions were located near
genes with varied roles in gene regulation. Outliers for Europe-Africa FST were
found to be enriched in genomic regions of locally elevated cosmopolitan
admixture, possibly reflecting a role for some of these loci in driving the
introgression of non-African alleles into African populations
Recommended from our members
Mixed transmission modes and dynamic genome evolution in an obligate animal–bacterial symbiosis
Reliable transmission of symbionts between host generations facilitates the evolution of beneficial and pathogenic associations. Although transmission mode is typically characterized as either vertical or horizontal, the prevalence of intermediate transmission modes, and their impact on symbiont genome evolution, are understudied. Here, we use population genomics to explore mixed transmission modes of chemosynthetic bacterial symbionts in the bivalve Solemya velum. Despite strong evidence for symbiont inheritance through host oocytes, whole-genome analyses revealed signatures of frequent horizontal transmission, including discordant mitochondrial-symbiont genealogies, widespread recombination and a dynamic symbiont genome structure consistent with evolutionary patterns of horizontally transmitted associations. Population-level analyses thus provide a tractable means of ascertaining the fidelity of vertical versus horizontal transmission. Our data support the strong influence horizontal transmission can have on symbiont genome evolution, and shed light on the dynamic evolutionary pressures shaping symbiotic bacterial genomes
Natural Selection Constrains Neutral Diversity across A Wide Range of Species
The neutral theory of molecular evolution predicts that the amount of neutral polymorphisms within a species will increase proportionally with the census population size (Nc). However, this prediction has not been borne out in practice: while the range of Nc spans many orders of magnitude, levels of genetic diversity within species fall in a comparatively narrow range. Although theoretical arguments have invoked the increased efficacy of natural selection in larger populations to explain this discrepancy, few direct empirical tests of this hypothesis have been conducted. In this work, we provide a direct test of this hypothesis using population genomic data from a wide range of taxonomically diverse species. To do this, we relied on the fact that the impact of natural selection on linked neutral diversity depends on the local recombinational environment. In regions of relatively low recombination, selected variants affect more neutral sites through linkage, and the resulting correlation between recombination and polymorphism allows a quantitative assessment of the magnitude of the impact of selection on linked neutral diversity. By comparing whole genome polymorphism data and genetic maps using a coalescent modeling framework, we estimate the degree to which natural selection reduces linked neutral diversity for 40 species of obligately sexual eukaryotes. We then show that the magnitude of the impact of natural selection is positively correlated with Nc, based on body size and species range as proxies for census population size. These results demonstrate that natural selection removes more variation at linked neutral sites in species with large Nc than those with small Nc and provides direct empirical evidence that natural selection constrains levels of neutral genetic diversity across many species. This implies that natural selection may provide an explanation for this longstanding paradox of population genetics
- …