72 research outputs found

    The Genomics of Speciation in Drosophila: Diversity, Divergence, and Introgression Estimated Using Low-Coverage Genome Sequencing

    Get PDF
    In nature, closely related species may hybridize while still retaining their distinctive identities. Chromosomal regions that experience reduced recombination in hybrids, such as within inversions, have been hypothesized to contribute to the maintenance of species integrity. Here, we examine genomic sequences from closely related fruit fly taxa of the Drosophila pseudoobscura subgroup to reconstruct their evolutionary histories and past patterns of genic exchange. Partial genomic assemblies were generated from two subspecies of Drosophila pseudoobscura (D. ps.) and an outgroup species, D. miranda. These new assemblies were compared to available assemblies of D. ps. pseudoobscura and D. persimilis, two species with overlapping ranges in western North America. Within inverted regions, nucleotide divergence among each pair of the three species is comparable, whereas divergence between D. ps. pseudoobscura and D. persimilis in non-inverted regions is much lower and closer to levels of intraspecific variation. Using molecular markers flanking each of the major chromosomal inversions, we identify strong crossover suppression in F1 hybrids extending over 2 megabase pairs (Mbp) beyond the inversion breakpoints. These regions of crossover suppression also exhibit the high nucleotide divergence associated with inverted regions. Finally, by comparison to a geographically isolated subspecies, D. ps. bogotana, our results suggest that autosomal gene exchange between the North American species, D. ps. pseudoobscura and D. persimilis, occurred since the split of the subspecies, likely within the last 200,000 years. We conclude that chromosomal rearrangements have been vital to the ongoing persistence of these species despite recent hybridization. Our study serves as a proof-of-principle on how whole genome sequencing can be applied to formulate and test hypotheses about species formation in lesser-known non-model systems

    The Genomic Signature of Crop-Wild Introgression in Maize

    Get PDF
    The evolutionary significance of hybridization and subsequent introgression has long been appreciated, but evaluation of the genome-wide effects of these phenomena has only recently become possible. Crop-wild study systems represent ideal opportunities to examine evolution through hybridization. For example, maize and the conspecific wild teosinte Zea mays ssp. mexicana, (hereafter, mexicana) are known to hybridize in the fields of highland Mexico. Despite widespread evidence of gene flow, maize and mexicana maintain distinct morphologies and have done so in sympatry for thousands of years. Neither the genomic extent nor the evolutionary importance of introgression between these taxa is understood. In this study we assessed patterns of genome-wide introgression based on 39,029 single nucleotide polymorphisms genotyped in 189 individuals from nine sympatric maize-mexicana populations and reference allopatric populations. While portions of the maize and mexicana genomes were particularly resistant to introgression (notably near known cross-incompatibility and domestication loci), we detected widespread evidence for introgression in both directions of gene flow. Through further characterization of these regions and preliminary growth chamber experiments, we found evidence suggestive of the incorporation of adaptive mexicana alleles into maize during its expansion to the highlands of central Mexico. In contrast, very little evidence was found for adaptive introgression from maize to mexicana. The methods we have applied here can be replicated widely, and such analyses have the potential to greatly informing our understanding of evolution through introgressive hybridization. Crop species, due to their exceptional genomic resources and frequent histories of spread into sympatry with relatives, should be particularly influential in these studies

    Female Drosophila melanogaster Gene Expression and Mate Choice: The X Chromosome Harbours Candidate Genes Underlying Sexual Isolation

    Get PDF
    Background: The evolution of female choice mechanisms favouring males of their own kind is considered a crucial step during the early stages of speciation. However, although the genomics of mate choice may influence both the likelihood and speed of speciation, the identity and location of genes underlying assortative mating remain largely unknown. Methods and Findings: We used mate choice experiments and gene expression analysis of female Drosophila melanogaster to examine three key components influencing speciation. We show that the 1,498 genes in Zimbabwean female D. melanogaster whose expression levels differ when mating with more (Zimbabwean) versus less (Cosmopolitan strain) preferred males include many with high expression in the central nervous system and ovaries, are disproportionately X-linked and form a number of clusters with low recombination distance. Significant involvement of the brain and ovaries is consistent with the action of a combination of pre- and postcopulatory female choice mechanisms, while sex linkage and clustering of genes lead to high potential evolutionary rate and sheltering against the homogenizing effects of gene exchange between populations. Conclusion: Taken together our results imply favourable genomic conditions for the evolution of reproductive isolation through mate choice in Zimbabwean D. melanogaster and suggest that mate choice may, in general, act as an even more important engine of speciation than previously realized

    Pervasive Adaptive Protein Evolution Apparent in Diversity Patterns around Amino Acid Substitutions in Drosophila simulans

    Get PDF
    In Drosophila, multiple lines of evidence converge in suggesting that beneficial substitutions to the genome may be common. All suffer from confounding factors, however, such that the interpretation of the evidence—in particular, conclusions about the rate and strength of beneficial substitutions—remains tentative. Here, we use genome-wide polymorphism data in D. simulans and sequenced genomes of its close relatives to construct a readily interpretable characterization of the effects of positive selection: the shape of average neutral diversity around amino acid substitutions. As expected under recurrent selective sweeps, we find a trough in diversity levels around amino acid but not around synonymous substitutions, a distinctive pattern that is not expected under alternative models. This characterization is richer than previous approaches, which relied on limited summaries of the data (e.g., the slope of a scatter plot), and relates to underlying selection parameters in a straightforward way, allowing us to make more reliable inferences about the prevalence and strength of adaptation. Specifically, we develop a coalescent-based model for the shape of the entire curve and use it to infer adaptive parameters by maximum likelihood. Our inference suggests that ∼13% of amino acid substitutions cause selective sweeps. Interestingly, it reveals two classes of beneficial fixations: a minority (approximately 3%) that appears to have had large selective effects and accounts for most of the reduction in diversity, and the remaining 10%, which seem to have had very weak selective effects. These estimates therefore help to reconcile the apparent conflict among previously published estimates of the strength of selection. More generally, our findings provide unequivocal evidence for strongly beneficial substitutions in Drosophila and illustrate how the rapidly accumulating genome-wide data can be leveraged to address enduring questions about the genetic basis of adaptation

    The Use of Orthologous Sequences to Predict the Impact of Amino Acid Substitutions on Protein Function

    Get PDF
    Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR), in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that “resurrects” the human-macaque ancestor) result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an “ancestral site preservation” measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants

    Fine Scale Analysis of Crossover and Non-Crossover and Detection of Recombination Sequence Motifs in the Honeybee (Apis mellifera)

    Get PDF
    BACKGROUND: Meiotic exchanges are non-uniformly distributed across the genome of most studied organisms. This uneven distribution suggests that recombination is initiated by specific signals and/or regulations. Some of these signals were recently identified in humans and mice. However, it is unclear whether or not sequence signals are also involved in chromosomal recombination of insects. METHODOLOGY: We analyzed recombination frequencies in the honeybee, in which genome sequencing provided a large amount of SNPs spread over the entire set of chromosomes. As the genome sequences were obtained from a pool of haploid males, which were the progeny of a single queen, an oocyte method (study of recombination on haploid males that develop from unfertilized eggs and hence are the direct reflect of female gametes haplotypes) was developed to detect recombined pairs of SNP sites. Sequences were further compared between recombinant and non-recombinant fragments to detect recombination-specific motifs. CONCLUSIONS: Recombination events between adjacent SNP sites were detected at an average distance of 92 bp and revealed the existence of high rates of recombination events. This study also shows the presence of conversion without crossover (i. e. non-crossover) events, the number of which largely outnumbers that of crossover events. Furthermore the comparison of sequences that have undergone recombination with sequences that have not, led to the discovery of sequence motifs (CGCA, GCCGC, CCGCA), which may correspond to recombination signals

    Human Population Differentiation Is Strongly Correlated with Local Recombination Rate

    Get PDF
    Allele frequency differences across populations can provide valuable information both for studying population structure and for identifying loci that have been targets of natural selection. Here, we examine the relationship between recombination rate and population differentiation in humans by analyzing two uniformly-ascertained, whole-genome data sets. We find that population differentiation as assessed by inter-continental FST shows negative correlation with recombination rate, with FST reduced by 10% in the tenth of the genome with the highest recombination rate compared with the tenth of the genome with the lowest recombination rate (P≪10−12). This pattern cannot be explained by the mutagenic properties of recombination and instead must reflect the impact of selection in the last 100,000 years since human continental populations split. The correlation between recombination rate and FST has a qualitatively different relationship for FST between African and non-African populations and for FST between European and East Asian populations, suggesting varying levels or types of selection in different epochs of human history

    Haldane's rule in the 21st century

    Get PDF
    Haldane's Rule (HR), which states that 'when in the offspring of two different animal races one sex is absent, rare, or sterile, that sex is the heterozygous (heterogametic) sex', is one of the most general patterns in speciation biology. We review the literature of the past 15 years and find that among the similar to 85 new studies, many consider taxa that traditionally have not been the focus for HR investigations. The new studies increased to nine, the number of 'phylogenetically independent' groups that comply with HR. They continue to support the dominance and faster-male theories as explanations for HR, although due to increased reliance on indirect data (from, for example, differential introgression of cytoplasmic versus chromosomal loci in natural hybrid zones) unambiguous novel results are rare. We further highlight how research on organisms with sex determination systems different from those traditionally considered may lead to more insight in the underlying causes of HR. In particular, haplodiploid organisms provide opportunities for testing specific predictions of the dominance and faster X chromosome theory, and we present new data that show that the faster-male component of HR is supported in hermaphrodites, suggesting that genes involved in male function may evolve faster than those expressed in the female function. Heredity (2011) 107, 95-102; doi:10.1038/hdy.2010.170; published online 12 January 201

    Genome-wide fine-scale recombination rate variation in Drosophila melanogaster

    Get PDF
    Estimating fine-scale recombination maps of Drosophila from population genomic data is a challenging problem, in particular because of the high background recombination rate. In this paper, a new computational method is developed to address this challenge. Through an extensive simulation study, it is demonstrated that the method allows more accurate inference, and exhibits greater robustness to the effects of natural selection and noise, compared to a well-used previous method developed for studying fine-scale recombination rate variation in the human genome. As an application, a genome-wide analysis of genetic variation data is performed for two Drosophila melanogaster populations, one from North America (Raleigh, USA) and the other from Africa (Gikongoro, Rwanda). It is shown that fine-scale recombination rate variation is widespread throughout the D. melanogaster genome, across all chromosomes and in both populations. At the fine-scale, a conservative, systematic search for evidence of recombination hotspots suggests the existence of a handful of putative hotspots each with at least a tenfold increase in intensity over the background rate. A wavelet analysis is carried out to compare the estimated recombination maps in the two populations and to quantify the extent to which recombination rates are conserved. In general, similarity is observed at very broad scales, but substantial differences are seen at fine scales. The average recombination rate of the X chromosome appears to be higher than that of the autosomes in both populations, and this pattern is much more pronounced in the African population than the North American population. The correlation between various genomic features—including recombination rates, diversity, divergence, GC content, gene content, and sequence quality—is examined using the wavelet analysis, and it is shown that the most notable difference between D. melanogaster and humans is in the correlation between recombination and diversity

    Patterns of Sequence Divergence and Evolution of the S1 Orthologous Regions between Asian and African Cultivated Rice Species

    Get PDF
    A strong postzygotic reproductive barrier separates the recently diverged Asian and African cultivated rice species, Oryza sativa and O. glaberrima. Recently a model of genetic incompatibilities between three adjacent loci: S1A, S1 and S1B (called together the S1 regions) interacting epistatically, was postulated to cause the allelic elimination of female gametes in interspecific hybrids. Two candidate factors for the S1 locus (including a putative F-box gene) were proposed, but candidates for S1A and S1B remained undetermined. Here, to better understand the basis of the evolution of regions involved in reproductive isolation, we studied the genic and structural changes accumulated in the S1 regions between orthologous sequences. First, we established an 813 kb genomic sequence in O. glaberrima, covering completely the S1A, S1 and the majority of the S1B regions, and compared it with the orthologous regions of O. sativa. An overall strong structural conservation was observed, with the exception of three isolated regions of disturbed collinearity: (1) a local invasion of transposable elements around a putative F-box gene within S1, (2) the multiple duplication and subsequent divergence of the same F-box gene within S1A, (3) an interspecific chromosomal inversion in S1B, which restricts recombination in our O. sativa×O. glaberrima crosses. Beside these few structural variations, a uniform conservative pattern of coding sequence divergence was found all along the S1 regions. Hence, the S1 regions have undergone no drastic variation in their recent divergence and evolution between O. sativa and O. glaberrima, suggesting that a small accumulation of genic changes, following a Bateson-Dobzhansky-Muller (BDM) model, might be involved in the establishment of the sterility barrier. In this context, genetic incompatibilities involving the duplicated F-box genes as putative candidates, and a possible strengthening step involving the chromosomal inversion might participate to the reproductive barrier between Asian and African rice species
    corecore