14 research outputs found

    The Diversity of REcent and Ancient huMan (DREAM): a new microarray for genetic anthropology and genealogy, forensics, and personalized medicine

    Get PDF
    The human population displays wide variety in demographic history, ancestry, content of DNA derived from hominins or ancient populations, adaptation, traits, copy number variation (CNVs), drug response, and more. These polymorphisms are of broad interest to population geneticists, forensics investigators, and medical professionals. Historically, much of that knowledge was gained from population survey projects. While many commercial arrays exist for genome-wide single-nucleotide polymorphism (SNP) genotyping, their design specifications are limited and they do not allow a full exploration of biodiversity. We thereby aimed to design the Diversity of REcent and Ancient huMan (DREAM) - an all-inclusive microarray that would allow both identification of known associations and exploration of standing questions in genetic anthropology, forensics, and personalized medicine. DREAM includes probes to interrogate ancestry informative markers obtained from over 450 human populations, over 200 ancient genomes, and 10 archaic hominins. DREAM can identify 94% and 61% of all known Y and mitochondrial haplogroups, respectively and was vetted to avoid interrogation of clinically relevant markers. To demonstrate its capabilities, we compared its FST distributions with those of the 1000 Genomes Project and commercial arrays. Although all arrays yielded similarly shaped (inverse J) FST distributions, DREAM's autosomal and X-chromosomal distributions had the highest mean FST, attesting to its ability to discern subpopulations. DREAM performances are further illustrated in biogeographical, identical by descent (IBD), and CNV analyses. In summary, with approximately 800,000 markers spanning nearly 2,000 genes, DREAM is a useful tool for genetic anthropology, forensic, and personalized medicine studies

    Drosophila evolution over space and time (DEST):A new population genomics resource

    Get PDF
    Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome datasets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate datasets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in > 20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This dataset, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental meta-data. A web-based genome browser and web portal provide easy access to the SNP dataset. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan dataset. Our resource will enable population geneticists to analyze spatio-temporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.DrosEU is funded by a Special Topic Networks (STN) grant from the European Society for Evolutionary Biology (ESEB). MK (M. Kapun) was supported by the Austrian Science Foundation (grant no. FWF P32275); JG by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (H2020-ERC-2014-CoG-647900) and by the Spanish Ministry of Science and Innovation (BFU-2011-24397); TF by the Swiss National Science Foundation (SNSF grants PP00P3_133641, PP00P3_165836, and 31003A_182262) and a Mercator Fellowship from the German Research Foundation (DFG), held as a EvoPAD Visiting Professor at the Institute for Evolution and Biodiversity, University of Münster; AOB by the National Institutes of Health (R35 GM119686); MK (M. Kankare) by Academy of Finland grant 322980; VL by Danish Natural Science Research Council (FNU) grant 4002-00113B; FS Deutsche Forschungsgemeinschaft (DFG) grant STA1154/4-1, Project 408908608; JP by the Deutsche Forschungsgemeinschaft Projects 274388701 and 347368302; AU by FPI fellowship (BES-2012-052999); ET Israel Science Foundation (ISF) grant 1737/17; MSV, MSR and MJ by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200178); AP, KE and MT by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200007); and TM NSERC grant RGPIN-2018-05551.Peer reviewe

    Corrigendum to: Drosophila Evolution over Space and Time (DEST): a New Population Genomics Resource

    Get PDF
    Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome datasets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate datasets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in > 20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This dataset, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental meta-data. A web-based genome browser and web portal provide easy access to the SNP dataset. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan dataset. Our resource will enable population geneticists to analyze spatio-temporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.DrosEU is funded by a Special Topic Networks (STN) grant from the European Society for Evolutionary Biology (ESEB). MK (M. Kapun) was supported by the Austrian Science Foundation (grant no. FWF P32275); JG by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (H2020-ERC-2014-CoG-647900) and by the Spanish Ministry of Science and Innovation (BFU-2011-24397); TF by the Swiss National Science Foundation (SNSF grants PP00P3_133641, PP00P3_165836, and 31003A_182262) and a Mercator Fellowship from the German Research Foundation (DFG), held as a EvoPAD Visiting Professor at the Institute for Evolution and Biodiversity, University of Münster; AOB by the National Institutes of Health (R35 GM119686); MK (M. Kankare) by Academy of Finland grant 322980; VL by Danish Natural Science Research Council (FNU) grant 4002-00113B; FS Deutsche Forschungsgemeinschaft (DFG) grant STA1154/4-1, Project 408908608; JP by the Deutsche Forschungsgemeinschaft Projects 274388701 and 347368302; AU by FPI fellowship (BES-2012-052999); ET Israel Science Foundation (ISF) grant 1737/17; MSV, MSR and MJ by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200178); AP, KE and MT by a grant from the Ministry of Education, Science and Technological Development of the Republic of Serbia (451-03-68/2020-14/200007); and TM NSERC grant RGPIN-2018-05551.Peer reviewe

    Genomic patterns of divergence and gene flow in<i> Drosophila</i> and <i>Goodeidae</i>

    No full text
    Understanding the impact of the presence or absence of geographic barriers on speciation has been a source of enduring interest in evolutionary biology. The degree of geographical isolation and the prevalence of gene flow during divergence may often determine the rate of reproductive isolation and the relative importance of other evolutionary forces. In this thesis, I use comparative genomic approaches to ask how frequent gene flow is, understand the contributions of ancient and recent gene flow to divergence, and finally, examine how patterns of genetic divergence between closely-related species are affected by geography and gene flow in two different systems: Drosophila and Splitfins (Goodeidae). I investigate the evolutionary history of divergence in the virilis group of Drosophila using de novo whole-genome sequences. I show that ancient and recent gene flow is common complicating phylogenetic inference and confounding levels of genetic divergence on the X chromosome across species pairs in the group. Next, I test whether gene flow is common across 96 species pairs in Drosophila. Using model-based inference from whole-genome data, I demonstrate that both allopatric and sympatric species pairs in Drosophila show similar support for models of speciation-with-gene-flow. Additionally, using inferred migration and divergence time estimates for species pairs across Drosophila, I show that evidence for reinforcement is mixed. In the final two chapters, I examined patterns of divergence and gene flow in Goodeidae. Using whole-genome sequences and de novo assemblies, I show that divergence of Goodeidae occurred rapidly, alongside consistent fluctuations in population size and limited ancient gene flow. Finally, I employ a broad comparative genomic approach using 21 genomes across Cyprinodontiformes to identify signatures of molecular convergence and positive selection linked to the evolution of viviparity in Goodeidae and Poecilidae. Altogether, this thesis demonstrates that complex speciation histories with gene flow are the rule rather than the exception.

    Divergence and introgression among the virilis group of Drosophila

    Get PDF
    Speciation with gene flow is now widely regarded as common. However, the frequency of introgression between recently diverged species and the evolutionary consequences of gene flow are still poorly understood. The virilis group of Drosophila contains 12 species that are geographically widespread and show varying levels of prezygotic and postzygotic isolation. Here, we use de novo genome assemblies and whole-genome sequencing data to resolve phylogenetic relationships and describe patterns of introgression and divergence across the group. We suggest that the virilis group consists of three, rather than the traditional two, subgroups. Some genes undergoing rapid sequence divergence across the group were involved in chemical communication and desiccation tolerance, and may be related to the evolution of sexual isolation and adaptation. We found evidence of pervasive phylogenetic discordance caused by ancient introgression events between distant lineages within the group, and more recent gene flow between closely related species. When assessing patterns of genome-wide divergence in species pairs across the group, we found no consistent genomic evidence of a disproportionate role for the X chromosome as has been found in other systems. Our results show how ancient and recent introgressions confuse phylogenetic reconstruction, but may play an important role during early radiation of a group.peerReviewe

    Noncoding regions underpin avian bill shape diversification at macroevolutionary scales

    Get PDF
    Recent progress has been made in identifying genomic regions implicated in trait evolution on a microevolutionary scale in many species, but whether these are relevant over macroevolutionary time remains unclear. Here, we directly address this fundamental question using bird beak shape, a key evolutionary innovation linked to patterns of resource use, divergence, and speciation, as a model trait. We integrate class-wide geometric-morphometric analyses with evolutionary sequence analyses of 10,322 protein-coding genes as well as 229,001 genomic regions spanning 72 species. We identify 1434 protein-coding genes and 39,806 noncoding regions for which molecular rates were significantly related to rates of bill shape evolution. We show that homologs of the identified protein-coding genes as well as genes in close proximity to the identified noncoding regions are involved in craniofacial embryo development in mammals. They are associated with embryonic stem cell pathways, including BMP and Wnt signaling, both of which have repeatedly been implicated in the morphological development of avian beaks. This suggests that identifying genotype-phenotype association on a genome-wide scale over macroevolutionary time is feasible. Although the coding and noncoding gene sets are associated with similar pathways, the actual genes are highly distinct, with significantly reduced overlap between them and bill-related phenotype associations specific to noncoding loci. Evidence for signatures of recent diversifying selection on our identified noncoding loci in Darwin finch populations further suggests that regulatory rather than coding changes are major drivers of morphological diversification over macroevolutionary times.Peer reviewe

    Within-population sperm competition intensity does not predict asymmetry in conpopulation sperm precedence

    Get PDF
    Funding: M.D.G. was supported by an Adapting to the Challenges of a Changing Environment (ACCE) Doctoral Training Partnership grant no. NE/L002450/1, funded by the Natural Environment Research Council (NERC). L.H.Y. was supported by a studentship funded bythe University of St Andrews and MGR.Postcopulatory sexual selection can generate evolutionary arms races between the sexes resulting in the rapid coevolution of reproductive phenotypes. As traits affecting fertilization success diverge between populations, postmating prezygotic (PMPZ) barriers to gene flow may evolve. Conspecific sperm precedence is a form of PMPZ isolation thought to evolve early during speciation yet has mostly been studied between species. Here, we show conpopulation sperm precedence (CpSP) between Drosophila montana populations. Using Pool-seq genomic data we estimate divergence times and ask whether PMPZ isolation evolved in the face of gene flow. We find models incorporating gene flow fit the data best indicating populations experienced considerable gene flow during divergence. We find CpSP is asymmetric and mirrors asymmetry in non-competitive PMPZ isolation, suggesting these phenomena have a shared mechanism. However, we show asymmetry is unrelated to the strength of postcopulatory sexual selection acting within populations. We tested whether overlapping foreign and coevolved ejaculates within the female reproductive tract altered fertilization success but found no effect. Our results show that neither time since divergence nor sperm competitiveness predicts the strength of PMPZ isolation. We suggest that instead cryptic female choice or mutation-order divergence may drive divergence of postcopulatory phenotypes resulting in PMPZ isolation. This article is part of the theme issue 'Fifty years of sperm competition'.Publisher PDFPeer reviewe

    Genomic signatures associated with transitions to viviparity in <i>Cyprinodontiformes</i>

    Get PDF
    Funding: LY was supported by a University of St Andrews studentship. MGR, CMG & YSL are supported by a Leverhulme research grant RPG-2020-181, by a Programa de Apoyo a Proyectos de Investigación e Inovación Tecnológica (PAPIIT) research grant (PAPIIT IN210718) and by a Consejo Nacional de Ciencia y Tecnología (CONACyT) Ciencia de Frontera research grant A1-S-33467. PT and bioinformatics and computational biology analyses were supported by the University of St Andrews Bioinformatics Unit (AMD3BIOINF), funded by Wellcome Trust ISSF award 105621/Z/14/Z. Additional HPC (Crop Diversity) were awarded as part of a BBSRC 18ALERT grant (2).The transition from oviparity to viviparity has occurred independently over 150 times across vertebrates, presenting one of the most compelling cases of phenotypic convergence. However, whether the repeated, independent evolution of viviparity is driven by redeployment of similar genetic mechanisms and whether these leave a common signature in genomic divergence remains largely unknown. Whilst recent investigations into the evolution of viviparity have demonstrated striking similarity among the genes and molecular pathways involved across disparate vertebrate groups, quantitative tests for genome-wide convergent have provided ambivalent answers. Here, we investigate the potential role of molecular convergence during independent transitions to viviparity across an order of ray-finned freshwater fish (Cyprinodontiformes). We assembled de novo genomes and utilized publicly-available genomes of viviparous and oviparous species to test for molecular convergence across both coding and non-coding regions. We found no evidence for an excess of molecular convergence in amino acid substitutions and in rates of sequence divergence, implying independent genetic changes are associated with these transitions. However, both statistical power and biological confounds could constrain our ability to detect significant correlated evolution. We therefore identified candidate genes with potential signatures of molecular convergence in viviparous Cyprinodontiformes lineages. Motif-enrichment and gene ontology analyses suggest transcriptional changes associated with early morphogenesis, brain development and immunity occurred alongside the evolution of viviparity. Overall, however, our findings indicate that independent transitions to viviparity in these fish is not strongly associated with an excess of molecular convergence, but a few genes show convincing evidence of convergent evolution.Publisher PDFPeer reviewe
    corecore