2,651 research outputs found

    Inference of population splits and mixtures from genome-wide allele frequency data

    Full text link
    Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In this model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication, and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and "ancient" Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.comComment: 28 pages, 6 figures in main text. Attached supplement is 22 pages, 15 figures. This is an updated version of the preprint available at http://precedings.nature.com/documents/6956/version/

    Scaling properties of protein family phylogenies

    Get PDF
    One of the classical questions in evolutionary biology is how evolutionary processes are coupled at the gene and species level. With this motivation, we compare the topological properties (mainly the depth scaling, as a characterization of balance) of a large set of protein phylogenies with a set of species phylogenies. The comparative analysis shows that both sets of phylogenies share remarkably similar scaling behavior, suggesting the universality of branching rules and of the evolutionary processes that drive biological diversification from gene to species level. In order to explain such generality, we propose a simple model which allows us to estimate the proportion of evolvability/robustness needed to approximate the scaling behavior observed in the phylogenies, highlighting the relevance of the robustness of a biological system (species or protein) in the scaling properties of the phylogenetic trees. Thus, the rules that govern the incapability of a biological system to diversify are equally relevant both at the gene and at the species level.Comment: Replaced with final published versio

    Universal scaling in the branching of the Tree of Life

    Get PDF
    Understanding the patterns and processes of diversification of life in the planet is a key challenge of science. The Tree of Life represents such diversification processes through the evolutionary relationships among the different taxa, and can be extended down to intra-specific relationships. Here we examine the topological properties of a large set of interspecific and intraspecific phylogenies and show that the branching patterns follow allometric rules conserved across the different levels in the Tree of Life, all significantly departing from those expected from the standard null models. The finding of non-random universal patterns of phylogenetic differentiation suggests that similar evolutionary forces drive diversification across the broad range of scales, from macro-evolutionary to micro-evolutionary processes, shaping the diversity of life on the planet.Comment: 6 pages + 19 of Supporting Informatio

    Environmental variables, habitat discontinuity and life history shaping the genetic structure of Pomatoschistus marmoratus

    Get PDF
    Coastal lagoons are semi-isolated ecosystems exposed to wide fluctuations of environmental conditions and showing habitat fragmentation. These features may play an important role in separating species into different populations, even at small spatial scales. In this study, we evaluate the concordance between mitochondrial (previous published data) and nuclear data analyzing the genetic variability of Pomatoschistus marmoratus in five localities, inside and outside the Mar Menor coastal lagoon (SE Spain) using eight microsatellites. High genetic diversity and similar levels of allele richness were observed across all loci and localities, although significant genic and genotypic differentiation was found between populations inside and outside the lagoon. In contrast to the FST values obtained from previous mitochondrial DNA analyses (control region), the microsatellite data exhibited significant differentiation among samples inside the Mar Menor and between lagoonal and marine samples. This pattern was corroborated using Cavalli-Sforza genetic distances. The habitat fragmentation inside the coastal lagoon and among lagoon and marine localities could be acting as a barrier to gene flow and contributing to the observed genetic structure. Our results from generalized additive models point a significant link between extreme lagoonal environmental conditions (mainly maximum salinity) and P. marmoratus genetic composition. Thereby, these environmental features could be also acting on genetic structure of coastal lagoon populations of P. marmoratus favoring their genetic divergence. The mating strategy of P. marmoratus could be also influencing our results obtained from mitochondrial and nuclear DNA. Therefore, a special consideration must be done in the selection of the DNA markers depending on the reproductive strategy of the species

    Genetic Diversity and Linkage Disequilibrium in Chinese Bread Wheat (Triticum aestivum L.) Revealed by SSR Markers

    Get PDF
    Two hundred and fifty bread wheat lines, mainly Chinese mini core accessions, were assayed for polymorphism and linkage disequilibrium (LD) based on 512 whole-genome microsatellite loci representing a mean marker density of 5.1 cM. A total of 6,724 alleles ranging from 1 to 49 per locus were identified in all collections. The mean PIC value was 0.650, ranging from 0 to 0.965. Population structure and principal coordinate analysis revealed that landraces and modern varieties were two relatively independent genetic sub-groups. Landraces had a higher allelic diversity than modern varieties with respect to both genomes and chromosomes in terms of total number of alleles and allelic richness. 3,833 (57.0%) and 2,788 (41.5%) rare alleles with frequencies of <5% were found in the landrace and modern variety gene pools, respectively, indicating greater numbers of rare variants, or likely new alleles, in landraces. Analysis of molecular variance (AMOVA) showed that A genome had the largest genetic differentiation and D genome the lowest. In contrast to genetic diversity, modern varieties displayed a wider average LD decay across the whole genome for locus pairs with r2>0.05 (P<0.001) than the landraces. Mean LD decay distance for the landraces at the whole genome level was <5 cM, while a higher LD decay distance of 5–10 cM in modern varieties. LD decay distances were also somewhat different for each of the 21 chromosomes, being higher for most of the chromosomes in modern varieties (<5∼25 cM) compared to landraces (<5∼15 cM), presumably indicating the influences of domestication and breeding. This study facilitates predicting the marker density required to effectively associate genotypes with traits in Chinese wheat genetic resources

    Identifying Selected Regions from Heterozygosity and Divergence Using a Light-Coverage Genomic Dataset from Two Human Populations

    Get PDF
    When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1) decreased multi-locus heterozygosity in the region; 2) elevated or diminished genetic divergence (FST) of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3) a consequent regional increase in the variance of FST (S2FST) for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5–65 SNPs) across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans from smaller studies of humans or other species

    Extended Haplotypes in the Growth Hormone Releasing Hormone Receptor Gene (GHRHR) Are Associated with Normal Variation in Height

    Get PDF
    Mutations in the gene for growth hormone releasing hormone receptor (GHRHR) cause isolated growth hormone deficiency (IGHD) but this gene has not been found to affect normal variation in height. We performed a whole genome linkage analysis for height in a population from northern Sweden and identified a region on chromosome 7 with a lod-score of 4.7. The GHRHR gene is located in this region and typing of tagSNPs identified a haplotype that is associated with height (p = 0.00077) in the original study population. Analysis of a sample from an independent population from the most northern part of Sweden also showed an association with height (p = 0.0039) but with another haplotype in the GHRHR gene. Both haplotypes span the 3′ part of the GHRHR gene, including the region in which most of the mutations in IGHD have been located. The effect size of these haplotypes are larger than that of any gene previously associated with height, which indicates that GHRHR might be one of the most important genes so far identified affecting normal variation in human height

    Studying the Underlying Event in Drell-Yan and High Transverse Momentum Jet Production at the Tevatron

    Get PDF
    We study the underlying event in proton-antiproton collisions by examining the behavior of charged particles (transverse momentum pT > 0.5 GeV/c, pseudorapidity |\eta| < 1) produced in association with large transverse momentum jets (~2.2 fb-1) or with Drell-Yan lepton-pairs (~2.7 fb-1) in the Z-boson mass region (70 < M(pair) < 110 GeV/c2) as measured by CDF at 1.96 TeV center-of-mass energy. We use the direction of the lepton-pair (in Drell-Yan production) or the leading jet (in high-pT jet production) in each event to define three regions of \eta-\phi space; toward, away, and transverse, where \phi is the azimuthal scattering angle. For Drell-Yan production (excluding the leptons) both the toward and transverse regions are very sensitive to the underlying event. In high-pT jet production the transverse region is very sensitive to the underlying event and is separated into a MAX and MIN transverse region, which helps separate the hard component (initial and final-state radiation) from the beam-beam remnant and multiple parton interaction components of the scattering. The data are corrected to the particle level to remove detector effects and are then compared with several QCD Monte-Carlo models. The goal of this analysis is to provide data that can be used to test and improve the QCD Monte-Carlo models of the underlying event that are used to simulate hadron-hadron collisions.Comment: Submitted to Phys.Rev.

    Forward-Backward Asymmetry in Top Quark Production in ppbar Collisions at sqrt{s}=1.96 TeV

    Get PDF
    Reconstructable final state kinematics and charge assignment in the reaction ppbar->ttbar allows tests of discrete strong interaction symmetries at high energy. We define frame dependent forward-backward asymmetries for the outgoing top quark in both the ppbar and ttbar rest frames, correct for experimental distortions, and derive values at the parton-level. Using 1.9/fb of ppbar collisions at sqrt{s}=1.96 TeV recorded with the CDF II detector at the Fermilab Tevatron, we measure forward-backward top quark production asymmetries in the ppbar and ttbar rest frames of A_{FB,pp} = 0.17 +- 0.08 and A_{FB,tt} = 0.24 +- 0.14.Comment: 7 pages, 2 figures, submitted to Phys.Rev.Lett, corrected references and change of tex

    Measurement of the Forward-Backward Asymmetry in the B -> K(*) mu+ mu- Decay and First Observation of the Bs -> phi mu+ mu- Decay

    Get PDF
    We reconstruct the rare decays B+K+μ+μB^+ \to K^+\mu^+\mu^-, B0K(892)0μ+μB^0 \to K^{*}(892)^0\mu^+\mu^-, and Bs0ϕ(1020)μ+μB^0_s \to \phi(1020)\mu^+\mu^- in a data sample corresponding to 4.4fb14.4 {\rm fb^{-1}} collected in ppˉp\bar{p} collisions at s=1.96TeV\sqrt{s}=1.96 {\rm TeV} by the CDF II detector at the Fermilab Tevatron Collider. Using 121±16121 \pm 16 B+K+μ+μB^+ \to K^+\mu^+\mu^- and 101±12101 \pm 12 B0K0μ+μB^0 \to K^{*0}\mu^+\mu^- decays we report the branching ratios. In addition, we report the measurement of the differential branching ratio and the muon forward-backward asymmetry in the B+B^+ and B0B^0 decay modes, and the K0K^{*0} longitudinal polarization in the B0B^0 decay mode with respect to the squared dimuon mass. These are consistent with the theoretical prediction from the standard model, and most recent determinations from other experiments and of comparable accuracy. We also report the first observation of the Bs0ϕμ+μdecayandmeasureitsbranchingratioB^0_s \to \phi\mu^+\mu^- decay and measure its branching ratio {\mathcal{B}}(B^0_s \to \phi\mu^+\mu^-) = [1.44 \pm 0.33 \pm 0.46] \times 10^{-6}using using 27 \pm 6signalevents.Thisiscurrentlythemostrare signal events. This is currently the most rare B^0_s$ decay observed.Comment: 7 pages, 2 figures, 3 tables. Submitted to Phys. Rev. Let
    corecore