25 research outputs found

    F<sub>ST</sub> vs. genetic similarity in various population pairs.

    No full text
    <p>Pairwise distances are colored red or blue within populations and black between populations. (A) Even at a relatively low F<sub>ST</sub> of 0.02 all within-population pairs among the Uygur and Adygei samples are genetically more similar than all the between-population pairs. (B) Separation is more ambiguous among Native Americans. Despite a relatively high F<sub>ST</sub> of 0.09, there is substantial overlap between Maya-Maya (red) and Maya-Surui (black) samples. E<sub>ST</sub> values are more consistent with the within- vs.-between population overlap and the dissimilarity fraction (Ο‰).</p

    Scatter plots indicating a positive correlation for <i>SD</i><sub>T</sub> and <i>F</i><sub>ST</sub>.

    No full text
    <p>Each dot represents the two statistics computed for data sampled from our population model with 1000 SNPs and allele frequencies from Beta distributions. The Pearson product-moment correlation coefficient of F<sub>ST</sub> and SD<sub>T</sub> is 0.67 for plot (A) with panmictic populations, 0.38 for plot (B) with slightly varying structure in subpopulations, 0.14 for plot (C) with highly varying structure in subpopulations, and for 0.94 for plot (D) with three panmictic populations.</p

    On the Apportionment of Population Structure

    No full text
    <div><p>Measures of population differentiation, such as F<sub>ST</sub>, are traditionally derived from the partition of diversity within and between populations. However, the emergence of population clusters from multilocus analysis is a function of genetic <i>structure</i> (departures from panmixia) rather than of diversity. If the populations are close to panmixia, slight differences between the mean pairwise distance within and between populations (low F<sub>ST</sub>) can manifest as strong separation between the populations, thus population clusters are often evident even when the vast majority of diversity is partitioned within populations rather than between them. For any given F<sub>ST</sub> value, clusters can be tighter (more panmictic) or looser (more stratified), and in this respect higher F<sub>ST</sub> does not always imply stronger differentiation. In this study we propose a measure for the partition of structure, denoted E<sub>ST</sub>, which is more consistent with results from clustering schemes. Crucially, our measure is based on a statistic of the data that is a good measure of internal structure, mimicking the information extracted by unsupervised clustering or dimensionality reduction schemes. To assess the utility of our metric, we ranked various human (HGDP) population pairs based on F<sub>ST</sub> and E<sub>ST</sub> and found substantial differences in ranking order. E<sub>ST</sub> ranking seems more consistent with population clustering and classification and possibly with geographic distance between populations. Thus, E<sub>ST</sub> may at times outperform F<sub>ST</sub> in identifying evolutionary significant differentiation.</p></div

    A simulation of <i>SD</i><sub>T</sub> and <i>SD</i><sub>S</sub> under a two panmictic population model demonstrating the divergent behavior of these two statistics with an increasing number of SNP loci.

    No full text
    <p>SNP frequencies are modeled on <i>Beta</i> distributions (as in [<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0160413#pone.0160413.ref005" target="_blank">5</a>]). (A) with <i>F</i><sub>ST</sub> = 0.10. (B) with <i>F</i><sub>ST</sub> = 0.03.</p

    E<sub>ST</sub> as a function of SNP sample size.

    No full text
    <p>E<sub>ST</sub> was estimated in various population pairs with gradually increasing SNP sample size from 10 to 660,755. As expected, E<sub>ST</sub> initially rises rather steeply, but tends to plateau before reaching the 660,755 SNP point. This suggests that we are approaching the maximal resolving power of genetic markers in this dataset, and adding markers beyond this point should not have a significant effect on -cluster separation and E<sub>ST</sub>.</p

    F<sub>ST</sub> and E<sub>ST</sub> vs. Clustering with increasing SNP count.

    No full text
    <p>Multidimensional scaling (MDS) plots with Russian (n = 25) and Chinese (n = 34) samples with increasing SNP count from top to bottom (10, 100, 1000, 10,000, and 660,755 SNPs). Two clusters gradually emerge as SNP count increases, along with an increase in E<sub>ST</sub>, while F<sub>ST</sub> remains relatively constant.</p

    Positive correlation between F<sub>ST</sub> and E<sub>ST</sub>.

    No full text
    <p>R = 0.61. Note that E<sub>ST</sub> has a much broader range, spanning nearly the entire 0–1 interval while F<sub>ST</sub> only goes as high as 0.2 in these HGDP populations.</p

    Pairwise F<sub>ST</sub> (above diagonal) and E<sub>ST</sub> (below diagonal) in 5 New World and 5 Old World HGDP populations.

    No full text
    <p>Pairwise F<sub>ST</sub> (above diagonal) and E<sub>ST</sub> (below diagonal) in 5 New World and 5 Old World HGDP populations.</p

    Zooming into a Russian (n = 25) and Chinese (n = 34) Neighbor Joining tree of individual similarities.

    No full text
    <p>(A) The length of the red branch compared to the overall tree length is a rough proxy to F<sub>ST</sub>. (B) 10x magnification highlights the structure within and between populations. The blue branches inversely associate with E<sub>ST</sub> values. (C) 100x magnification reveals fine substructure within Russian samples. Individual branches (black) were removed in B and C for clarity.</p
    corecore