33,581 research outputs found
Traveling waves of selective sweeps
The goal of cancer genome sequencing projects is to determine the genetic
alterations that cause common cancers. Many malignancies arise during the
clonal expansion of a benign tumor which motivates the study of recurrent
selective sweeps in an exponentially growing population. To better understand
this process, Beerenwinkel et al. [PLoS Comput. Biol. 3 (2007) 2239--2246]
consider a Wright--Fisher model in which cells from an exponentially growing
population accumulate advantageous mutations. Simulations show a traveling wave
in which the time of the first -fold mutant, , is approximately linear
in and heuristics are used to obtain formulas for . Here, we consider
the analogous problem for the Moran model and prove that as the mutation rate
, , where the can be computed
explicitly. In addition, we derive a limiting result on a log scale for the
size of the number of cells with mutations at time .Comment: Published in at http://dx.doi.org/10.1214/10-AAP721 the Annals of
Applied Probability (http://www.imstat.org/aap/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Fixation Probability for Competing Selective Sweeps
We consider a biological population in which a beneficial mutation is
undergoing a selective sweep when a second beneficial mutation arises at a
linked locus and we investigate the probability that both mutations will
eventually fix in the population. Previous work has dealt with the case where
the second mutation to arise confers a smaller benefit than the first. In that
case population size plays almost no role. Here we consider the opposite case
and observe that, by contrast, the probability of both mutations fixing can be
heavily dependent on population size. Indeed the key parameter is , the
product of the population size and the recombination rate between the two
selected loci. If is small, the probability that both mutations fix
can be reduced through interference to almost zero while for large the
mutations barely influence one another. The main rigorous result is a method
for calculating the fixation probability of a double mutant in the large
population limit.Comment: 33 pages, 3 figure
Genomewide Spatial Correspondence Between Nonsynonymous Divergence and Neutral Polymorphism Reveals Extensive Adaptation in Drosophila
The effect of recurrent selective sweeps is a spatially heterogeneous reduction in neutral polymorphism throughout the genome. The pattern of reduction depends on the selective advantage and recurrence rate of the sweeps. Because many adaptive substitutions responsible for these sweeps also contribute to nonsynonymous divergence, the spatial distribution of nonsynonymous divergence also reflects the distribution of adaptive substitutions. Thus, the spatial correspondence between neutral polymorphism and nonsynonymous divergence may be especially informative about the process of adaptation. Here we study this correspondence using genomewide polymorphism data from Drosophila simulans and the divergence between D. simulans and D. melanogaster. Focusing on highly recombining portions of the autosomes, at a spatial scale appropriate to the study of selective sweeps, we find that neutral polymorphism is both lower and, as measured by a new statistic QS, less homogeneous where nonsynonymous divergence is higher and that the spatial structure of this correlation is best explained by the action of strong recurrent selective sweeps. We introduce a method to infer, from the spatial correspondence between polymorphism and divergence, the rate and selective strength of adaptation. Our results independently confirm a high rate of adaptive substitution (∼1/3000 generations) and newly suggest that many adaptations are of surprisingly great selective effect (∼1%), reducing the effective population size by ∼15% even in highly recombining regions of the genome
Selection and gene flow shape genomic islands that control floral guides
Genomes of closely-related species or populations often display localized regions of enhanced relative sequence divergence, termed genomic islands. It has been proposed that these islands arise through selective sweeps and/or barriers to gene flow. Here, we genetically dissect a genomic island that controls flower color pattern differences between two subspecies of Antirrhinum majus, A.m.striatum and A.m.pseudomajus, and relate it to clinal variation across a natural hybrid zone. We show that selective sweeps likely raised relative divergence at two tightly-linked MYB-like transcription factors, leading to distinct flower patterns in the two subspecies. The two patterns provide alternate floral guides and create a strong barrier to gene flow where populations come into contact. This barrier affects the selected flower color genes and tightlylinked loci, but does not extend outside of this domain, allowing gene flow to lower relative divergence for the rest of the chromosome. Thus, both selective sweeps and barriers to gene flow play a role in shaping genomic islands: sweeps cause elevation in relative divergence, while heterogeneous gene flow flattens the surrounding "sea," making the island of divergence stand out. By showing how selective sweeps establish alternative adaptive phenotypes that lead to barriers to gene flow, our study sheds light on possible mechanisms leading to reproductive isolation and speciation
Discovery of Ongoing Selective Sweeps within Anopheles Mosquito Populations Using Deep Learning
Identification of partial sweeps, which include both hard and soft sweeps that have not currently reached fixation, provides crucial information about ongoing evolutionary responses. To this end, we introduce partialS/HIC, a deep learning method to discover selective sweeps from population genomic data. partialS/HIC uses a convolutional neural network for image processing, which is trained with a large suite of summary statistics derived from coalescent simulations incorporating population-specific history, to distinguish between completed versus partial sweeps, hard versus soft sweeps, and regions directly affected by selection versus those merely linked to nearby selective sweeps. We perform several simulation experiments under various demographic scenarios to demonstrate partialS/HIC's performance, which exhibits excellent resolution for detecting partial sweeps. We also apply our classifier to whole genomes from eight mosquito populations sampled across sub-Saharan Africa by the Anopheles gambiae 1000 Genomes Consortium, elucidating both continent-wide patterns as well as sweeps unique to specific geographic regions. These populations have experienced intense insecticide exposure over the past two decades, and we observe a strong overrepresentation of sweeps at insecticide resistance loci. Our analysis thus provides a list of candidate adaptive loci that may be relevant to mosquito control efforts. More broadly, our supervised machine learning approach introduces a method to distinguish between completed and partial sweeps, as well as between hard and soft sweeps, under a variety of demographic scenarios. As whole-genome data rapidly accumulate for a greater diversity of organisms, partialS/HIC addresses an increasing demand for useful selection scan tools that can track in-progress evolutionary dynamics.</p
Selective Sweeps in Growing Microbial Colonies
Evolutionary experiments with microbes are a powerful tool to study mutations and natural selection. These experiments, however, are often limited to the well-mixed environments of a test tube or a chemostat. Since spatial organization can significantly affect evolutionary dynamics, the need is growing for evolutionary experiments in spatially structured environments. The surface of a Petri dish provides such an environment, but a more detailed understanding of microbial growth on Petri dishes is necessary to interpret such experiments. We formulate a simple deterministic reaction-diffusion model, which successfully predicts the spatial patterns created by two competing species during colony expansion. We also derive the shape of these patterns analytically without relying on microscopic details of the model. In particular, we find that the relative fitness of two microbial strains can be estimated from the logarithmic spirals created by selective sweeps. The theory is tested with strains of the budding yeast Saccharomyces cerevisiae, for spatial competitions with different initial conditions and for a range of relative fitnesses. The reaction-diffusion model also connects the microscopic parameters like growth rates and diffusion constants with macroscopic spatial patterns and predicts the relationship between fitness in liquid cultures and on Petri dishes, which we confirmed experimentally. Spatial sector patterns therefore provide an alternative fitness assay to the commonly used liquid culture fitness assays.Molecular and Cellular Biolog
Discovery of Ongoing Selective Sweeps within Anopheles Mosquito Populations Using Deep Learning
Publisher Copyright: © 2020 The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.Identification of partial sweeps, which include both hard and soft sweeps that have not currently reached fixation, provides crucial information about ongoing evolutionary responses. To this end, we introduce partialS/HIC, a deep learning method to discover selective sweeps from population genomic data. partialS/HIC uses a convolutional neural network for image processing, which is trained with a large suite of summary statistics derived from coalescent simulations incorporating population-specific history, to distinguish between completed versus partial sweeps, hard versus soft sweeps, and regions directly affected by selection versus those merely linked to nearby selective sweeps. We perform several simulation experiments under various demographic scenarios to demonstrate partialS/HIC's performance, which exhibits excellent resolution for detecting partial sweeps. We also apply our classifier to whole genomes from eight mosquito populations sampled across sub-Saharan Africa by the Anopheles gambiae 1000 Genomes Consortium, elucidating both continent-wide patterns as well as sweeps unique to specific geographic regions. These populations have experienced intense insecticide exposure over the past two decades, and we observe a strong overrepresentation of sweeps at insecticide resistance loci. Our analysis thus provides a list of candidate adaptive loci that may be relevant to mosquito control efforts. More broadly, our supervised machine learning approach introduces a method to distinguish between completed and partial sweeps, as well as between hard and soft sweeps, under a variety of demographic scenarios. As whole-genome data rapidly accumulate for a greater diversity of organisms, partialS/HIC addresses an increasing demand for useful selection scan tools that can track in-progress evolutionary dynamics.publishersversionpublishe
A genomic map of the effects of linked selection in Drosophila
Natural selection at one site shapes patterns of genetic variation at linked
sites. Quantifying the effects of 'linked selection' on levels of genetic
diversity is key to making reliable inference about demography, building a null
model in scans for targets of adaptation, and learning about the dynamics of
natural selection. Here, we introduce the first method that jointly infers
parameters of distinct modes of linked selection, notably background selection
and selective sweeps, from genome-wide diversity data, functional annotations
and genetic maps. The central idea is to calculate the probability that a
neutral site is polymorphic given local annotations, substitution patterns, and
recombination rates. Information is then combined across sites and samples
using composite likelihood in order to estimate genome-wide parameters of
distinct modes of selection. In addition to parameter estimation, this approach
yields a map of the expected neutral diversity levels along the genome. To
illustrate the utility of our approach, we apply it to genome-wide resequencing
data from 125 lines in Drosophila melanogaster and reliably predict diversity
levels at the 1Mb scale. Our results corroborate estimates of a high fraction
of beneficial substitutions in proteins and untranslated regions (UTR). They
allow us to distinguish between the contribution of sweeps and other modes of
selection around amino acid substitutions and to uncover evidence for pervasive
sweeps in untranslated regions (UTRs). Our inference further suggests a
substantial effect of linked selection from non-classic sweeps. More generally,
we demonstrate that linked selection has had a larger effect in reducing
diversity levels and increasing their variance in D. melanogaster than
previously appreciated
- …