499 research outputs found

    The Genetic Structure and History of Africans and African Americans.

    Get PDF
    Africa is the source of all modern humans, but characterization of genetic variation and of relationships among populations across the continent has been enigmatic. We studied 121 African populations, four African American populations, and 60 non-African populations for patterns of variation at 1327 nuclear microsatellite and insertion/deletion markers. We identified 14 ancestral population clusters in Africa that correlate with self-described ethnicity and shared cultural and/or linguistic properties. We observed high levels of mixed ancestry in most populations, reflecting historical migration events across the continent. Our data also provide evidence for shared ancestry among geographically diverse hunter-gatherer populations (Khoesan speakers and Pygmies). The ancestry of African Americans is predominantly from Niger-Kordofanian (approximately 71%), European (approximately 13%), and other African (approximately 8%) populations, although admixture levels varied considerably among individuals. This study helps tease apart the complex evolutionary history of Africans and African Americans, aiding both anthropological and genetic epidemiologic studies

    Statistical properties of genealogical trees

    Get PDF
    We analyse the statistical properties of genealogical trees in a neutral model of a closed population with sexual reproduction and non-overlapping generations. By reconstructing the genealogy of an individual from the population evolution, we measure the distribution of ancestors appearing more than once in a given tree. After a transient time, the probability of repetition follows, up to a rescaling, a stationary distribution which we calculate both numerically and analytically. This distribution exhibits a universal shape with a non-trivial power law which can be understood by an exact, though simple, renormalization calculation. Some real data on human genealogy illustrate the problem, which is relevant to the study of the real degree of diversity in closed interbreeding communities.Comment: Accepted for publication in Phys. Rev. Let

    Identifying Selected Regions from Heterozygosity and Divergence Using a Light-Coverage Genomic Dataset from Two Human Populations

    Get PDF
    When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1) decreased multi-locus heterozygosity in the region; 2) elevated or diminished genetic divergence (FST) of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3) a consequent regional increase in the variance of FST (S2FST) for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5–65 SNPs) across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans from smaller studies of humans or other species

    No Evidence for Strong Recent Positive Selection Favoring the 7 Repeat Allele of VNTR in the DRD4 Gene

    Get PDF
    The human dopamine receptor D4 (DRD4) gene contains a 48-bp variable number of tandem repeat (VNTR) in exon 3, encoding the third intracellular loop of this dopamine receptor. The DRD4 7R allele, which seems to have a single origin, is commonly observed in various human populations and the nucleotide diversity of the DRD4 7R haplotype at the DRD4 locus is reduced compared to the most common DRD4 4R haplotype. Based on these observations, previous studies have hypothesized that positive selection has acted on the DRD4 7R allele. However, the degrees of linkage disequilibrium (LD) of the DRD4 7R allele with single nucleotide polymorphisms (SNPs) outside the DRD4 locus have not been evaluated. In this study, to re-examine the possibility of recent positive selection favoring the DRD4 7R allele, we genotyped HapMap subjects for DRD4 VNTR, and conducted several neutrality tests including long range haplotype test and iHS test based on the extended haplotype homozygosity. Our results indicated that LD of the DRD4 7R allele was not extended compared to SNP alleles with the similar frequency. Thus, we conclude that the DRD4 7R allele has not been subjected to strong recent positive selection

    Genetics and geography of leukocyte telomere length in sub-Saharan Africans

    Get PDF
    Leukocyte telomere length (LTL) might be causal in cardiovascular disease and major cancers. To elucidate the roles of genetics and geography in LTL variability across humans, we compared LTL measured in 1295 sub-Saharan Africans (SSAs) with 559 African-Americans (AAms) and 2464 European-Americans (EAms). LTL differed significantly across SSAs (P = 0.003), with the San from Botswana (with the oldest genomic ancestry) having the longest LTL and populations from Ethiopia having the shortest LTL. SSAs had significantly longer LTL than AAms [P = 6.5(e-16)] whose LTL was significantly longer than EAms [P = 2.5(e-7)]. Genetic variation in SSAs explained 52% of LTL variance versus 27% in AAms and 34% in EAms. Adjustment for genetic variation removed the LTL differences among SSAs. LTL genetic variation among SSAs, with the longest LTL in the San, supports the hypothesis that longer LTL was ancestral in humans. Identifying factors driving LTL variation in Africa may have important ramifications for LTL-associated diseases

    No Evidence for Strong Recent Positive Selection Favoring the 7 Repeat Allele of VNTR in the DRD4 Gene

    Get PDF
    The human dopamine receptor D4 (DRD4) gene contains a 48-bp variable number of tandem repeat (VNTR) in exon 3, encoding the third intracellular loop of this dopamine receptor. The DRD4 7R allele, which seems to have a single origin, is commonly observed in various human populations and the nucleotide diversity of the DRD4 7R haplotype at the DRD4 locus is reduced compared to the most common DRD4 4R haplotype. Based on these observations, previous studies have hypothesized that positive selection has acted on the DRD4 7R allele. However, the degrees of linkage disequilibrium (LD) of the DRD4 7R allele with single nucleotide polymorphisms (SNPs) outside the DRD4 locus have not been evaluated. In this study, to re-examine the possibility of recent positive selection favoring the DRD4 7R allele, we genotyped HapMap subjects for DRD4 VNTR, and conducted several neutrality tests including long range haplotype test and iHS test based on the extended haplotype homozygosity. Our results indicated that LD of the DRD4 7R allele was not extended compared to SNP alleles with the similar frequency. Thus, we conclude that the DRD4 7R allele has not been subjected to strong recent positive selection

    Cryptic Distant Relatives Are Common in Both Isolated and Cosmopolitan Genetic Samples

    Get PDF
    Although a few hundred single nucleotide polymorphisms (SNPs) suffice to infer close familial relationships, high density genome-wide SNP data make possible the inference of more distant relationships such as 2nd to 9th cousinships. In order to characterize the relationship between genetic similarity and degree of kinship given a timeframe of 100–300 years, we analyzed the sharing of DNA inferred to be identical by descent (IBD) in a subset of individuals from the 23andMe customer database (n = 22,757) and from the Human Genome Diversity Panel (HGDP-CEPH, n = 952). With data from 121 populations, we show that the average amount of DNA shared IBD in most ethnolinguistically-defined populations, for example Native American groups, Finns and Ashkenazi Jews, differs from continentally-defined populations by several orders of magnitude. Via extensive pedigree-based simulations, we determined bounds for predicted degrees of relationship given the amount of genomic IBD sharing in both endogamous and ‘unrelated’ population samples. Using these bounds as a guide, we detected tens of thousands of 2nd to 9th degree cousin pairs within a heterogenous set of 5,000 Europeans. The ubiquity of distant relatives, detected via IBD segments, in both ethnolinguistic populations and in large ‘unrelated’ populations samples has important implications for genetic genealogy, forensics and genotype/phenotype mapping studies

    Genome-Wide Analysis of the World's Sheep Breeds Reveals High Levels of Historic Mixture and Strong Recent Selection

    Get PDF
    Genomic structure in a global collection of domesticated sheep reveals a history of artificial selection for horn loss and traits relating to pigmentation, reproduction, and body size
    corecore