18 research outputs found

    Sweeps in time:Leveraging the joint distribution of branch lengths

    Get PDF
    Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diversity. Here, we develop a tractable method of describing the effect of positive selection on the genealogical histories in the surrounding genome, explicitly modeling both the timing and context of an adaptive event. In addition, our framework allows us to go beyond analyzing polymorphism data via the site frequency spectrum or summaries thereof and instead leverage information contained in patterns of linked variants. Tests on both simulations and a human data example, as well as a comparison to SweepFinder2, show that even with very small sample sizes, our analytic framework has higher power to identify old selective sweeps and to correctly infer both the time and strength of selection. Finally, we derived the marginal distribution of genealogical branch lengths at a locus affected by selection acting at a linked site. This provides a much-needed link between our analytic understanding of the effects of sweeps on sequence variation and recent advances in simulation and heuristic inference procedures that allow researchers to examine the sequence of genealogical histories along the genome

    VolcanoFinder:Genomic scans for adaptive introgression

    Get PDF
    Recent research shows that introgression between closely-related species is an important source of adaptive alleles for a wide range of taxa. Typically, detection of adaptive introgression from genomic data relies on comparative analyses that require sequence data from both the recipient and the donor species. However, in many cases, the donor is unknown or the data is not currently available. Here, we introduce a genome-scan method-VolcanoFinder-to detect recent events of adaptive introgression using polymorphism data from the recipient species only. VolcanoFinder detects adaptive introgression sweeps from the pattern of excess intermediate-frequency polymorphism they produce in the flanking region of the genome, a pattern which appears as a volcano-shape in pairwise genetic diversity. Using coalescent theory, we derive analytical predictions for these patterns. Based on these results, we develop a composite-likelihood test to detect signatures of adaptive introgression relative to the genomic background. Simulation results show that VolcanoFinder has high statistical power to detect these signatures, even for older sweeps and for soft sweeps initiated by multiple migrant haplotypes. Finally, we implement VolcanoFinder to detect archaic introgression in European and sub-Saharan African human populations, and uncovered interesting candidates in both populations, such as TSHR in Europeans and TCHH-RPTN in Africans. We discuss their biological implications and provide guidelines for identifying and circumventing artifactual signals during empirical applications of VolcanoFinder

    Demographically explicit scans for barriers to gene flow using gIMble

    Full text link
    Identifying regions of the genome that act as barriers to gene flow between recently diverged taxa has remained challenging given the many evolutionary forces that generate variation in genetic diversity and divergence along the genome, and the stochastic nature of this variation. Progress has been impeded by a conceptual and methodological divide between analyses that infer the demographic history of speciation and genome scans aimed at identifying locally maladaptive alleles i.e. genomic barriers to gene flow. Here we implement genomewide IM blockwise likelihood estimation (), a composite likelihood approach for the quantification of barriers, that bridges this divide. This analytic framework captures background selection and selection against barriers in a model of isolation with migration (IM) as heterogeneity in effective population size (Ne_{e}) and effective migration rate (me_{e}), respectively. Variation in both effective demographic parameters is estimated in sliding windows via pre-computed likelihood grids. includes modules for pre-processing/filtering of genomic data and performing parametric bootstraps using coalescent simulations. To demonstrate the new approach, we analyse data from a well-studied pair of sister species of tropical butterflies with a known history of post-divergence gene flow: Heliconius melpomene and H. cydno. Our analyses uncover both large-effect barrier loci (including well-known wing-pattern genes) and a genome-wide signal of a polygenic barrier architecture

    VolcanoFinder: Genomic scans for adaptive introgression.

    No full text

    Estimating the rates of crossover and gene conversion from individual genomes

    Get PDF
    Recombination can occur either as a result of crossover or gene conversion events. Population genetic methods for inferring the rate of recombination from patterns of linkage disequilibrium generally assume a simple model of recombination that only involves crossover events and ignore gene conversion. However, distinguishing the 2 processes is not only necessary for a complete description of recombination, but also essential for understanding the evolutionary consequences of inversions and other genomic partitions in which crossover (but not gene conversion) is reduced. We present heRho, a simple composite likelihood scheme for coestimating the rate of crossover and gene conversion from individual diploid genomes. The method is based on analytic results for the distance-dependent probability of heterozygous and homozygous states at 2 loci. We apply heRho to simulations and data from the house mouse Mus musculus castaneus, a well-studied model. Our analyses show (1) that the rates of crossover and gene conversion can be accurately coestimated at the level of individual chromosomes and (2) that previous estimates of the population scaled rate of recombination [Formula: see text] under a pure crossover model are likely biased

    Supplementary methods, tables & figures.

    No full text
    The supporting information is organized into three sections: (1) Supplementary methods; gIMble commands used for the Heliconius analysis. (2) Supplementary tables A—C (3) Supplementary Figures A—M. (PDF)</p

    Average heterozygosity (<i>H</i>) across all individuals within each species and divergence (<i>d</i><sub>xy</sub>) between <i>H. melpomene</i> and <i>H. cydno</i>.

    No full text
    Summaries were generated using gIMble info based on intergenic blocks of 64 bases sampled in heterospecific pairs of individuals (top row). Estimates from a similar previous analysis for a single pair of individual genomes and longer blocks of 150 bases [41] (second row) are shown for comparison.</p

    Barriers to gene flow between <i>H. melpomene</i> and <i>H. cydno</i> inferred using gIMble.

    No full text
    The me threshold used to diagnose barriers is relaxed from top (ΔB,0) to bottom (ΔB,0.2). Regions above the “sea-level” of ΔB = 0 fit a history of reduced me better than a model assuming the global estimate . Windows with ΔB > 0 have been coloured to reflect their expected FPR. The ΔB threshold corresponding to a FPR of 0.05 is shown in grey.</p
    corecore