275 research outputs found

    Genetic variation in South Indian castes: evidence from Y-chromosome, mitochondrial, and autosomal polymorphisms

    Get PDF
    Background: Major population movements, social structure, and caste endogamy have influenced the genetic structure of Indian populations. An understanding of these influences is increasingly important as gene mapping and case-control studies are initiated in South Indian populations. Results: We report new data on 155 individuals from four Tamil caste populations of South India and perform comparative analyses with caste populations from the neighboring state of Andhra Pradesh. Genetic differentiation among Tamil castes is low (R = 0.96% for 45 autosomal short tandem repeat (STR) markers), reflecting a largely common origin. Nonetheless, caste- and continent-specific patterns are evident. For 32 lineage-defining Y-chromosome SNPs, Tamil castes show higher affinity to Europeans than to eastern Asians, and genetic distance estimates to the Europeans are ordered by caste rank. For 32 lineage-defining mitochondrial SNPs and hypervariable sequence (HVS) 1, Tamil castes have higher affinity to eastern Asians than to Europeans. For 45 autosomal STRs, upper and middle rank castes show higher affinity to Europeans than do lower rank castes from either Tamil Nadu or Andhra Pradesh. Local between-caste variation (Tamil Nadu R = 0.96%, Andhra Pradesh R = 0.77%) exceeds the estimate of variation between these geographically separated groups (R = 0.12%). Low, but statistically significant, correlations between caste rank distance and genetic distance are demonstrated for Tamil castes using Y-chromosome, mtDNA, and autosomal data. Conclusion: Genetic data from Y-chromosome, mtDNA, and autosomal STRs are in accord with historical accounts of northwest to southeast population movements in India. The influence of ancient and historical population movements and caste social structure can be detected and replicated in South Indian caste populations from two different geographic regions

    Empirical Distributions of F-ST from Large-Scale Human Polymorphism Data

    Get PDF
    Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s FST that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-FST may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically FST analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global FST distribution closely follows an exponential distribution. Third, although the overall FST distribution is similarly shaped (inverse J), FST distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-FST of these groups is linear in allele frequency. These results suggest that investigating the extremes of the FST distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection

    Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substitute an estimate of haplotype frequency into the equation, producing a new estimate which is repeatedly fed back into the equation until the values converge to a maximum likelihood estimate (expectation-maximisation).</p> <p>Results</p> <p>We present a program, "CubeX", which calculates the biologically possible exact solution(s) and provides estimated haplotype frequencies, D', r<sup>2 </sup>and <it>χ</it><sup>2 </sup>values for each. CubeX provides a "complete" analysis of haplotype frequencies and linkage disequilibrium for a pair of biallelic markers under situations where sampling variation and genotyping errors distort sample Hardy-Weinberg equilibrium, potentially causing more than one biologically possible solution. We also present an analysis of simulations and real data using the algebraically exact solution, which indicates that under perfect sample Hardy-Weinberg equilibrium there is only one biologically possible solution, but that under other conditions there may be more.</p> <p>Conclusion</p> <p>Our analyses demonstrate that lower allele frequencies, lower sample numbers, population stratification and a possible |D'| value of 1 are particularly susceptible to distortion of sample Hardy-Weinberg equilibrium, which has significant implications for calculation of linkage disequilibrium in small sample sizes (eg HapMap) and rarer alleles (eg paucimorphisms, q < 0.05) that may have particular disease relevance and require improved approaches for meaningful evaluation.</p

    The genetic study of three population microisolates in South Tyrol (MICROS): study design and epidemiological perspectives

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>There is increasing evidence of the important role that small, isolated populations could play in finding genes involved in the etiology of diseases. For historical and political reasons, South Tyrol, the northern most Italian region, includes several villages of small dimensions which remained isolated over the centuries.</p> <p>Methods</p> <p>The MICROS study is a population-based survey on three small, isolated villages, characterized by: old settlement; small number of founders; high endogamy rates; slow/null population expansion. During the stage-1 (2002/03) genealogical data, screening questionnaires, clinical measurements, blood and urine samples, and DNA were collected for 1175 adult volunteers. Stage-2, concerning trait diagnoses, linkage analysis and association studies, is ongoing. The selection of the traits is being driven by expert clinicians. Preliminary, descriptive statistics were obtained. Power simulations for finding linkage on a quantitative trait locus (QTL) were undertaken.</p> <p>Results</p> <p>Starting from participants, genealogies were reconstructed for 50,037 subjects, going back to the early 1600s. Within the last five generations, subjects were clustered in one pedigree of 7049 subjects plus 178 smaller pedigrees (3 to 85 subjects each). A significant probability of familial clustering was assessed for many traits, especially among the cardiovascular, neurological and respiratory traits. Simulations showed that the MICROS pedigree has a substantial power to detect a LOD score ≥ 3 when the QTL specific heritability is ≥ 20%.</p> <p>Conclusion</p> <p>The MICROS study is an extensive, ongoing, two-stage survey aimed at characterizing the genetic epidemiology of Mendelian and complex diseases. Our approach, involving different scientific disciplines, is an advantageous strategy to define and to study population isolates. The isolation of the Alpine populations, together with the extensive data collected so far, make the MICROS study a powerful resource for the study of diseases in many fields of medicine. Recent successes and simulation studies give us confidence that our pedigrees can be valuable both in finding new candidates loci and to confirm existing candidate genes.</p

    Ancestry of the Iban Is Predominantly Southeast Asian: Genetic Evidence from Autosomal, Mitochondrial, and Y Chromosomes

    Get PDF
    Humans reached present-day Island Southeast Asia (ISEA) in one of the first major human migrations out of Africa. Population movements in the millennia following this initial settlement are thought to have greatly influenced the genetic makeup of current inhabitants, yet the extent attributed to different events is not clear. Recent studies suggest that south-to-north gene flow largely influenced present-day patterns of genetic variation in Southeast Asian populations and that late Pleistocene and early Holocene migrations from Southeast Asia are responsible for a substantial proportion of ISEA ancestry. Archaeological and linguistic evidence suggests that the ancestors of present-day inhabitants came mainly from north-to-south migrations from Taiwan and throughout ISEA approximately 4,000 years ago. We report a large-scale genetic analysis of human variation in the Iban population from the Malaysian state of Sarawak in northwestern Borneo, located in the center of ISEA. Genome-wide single-nucleotide polymorphism (SNP) markers analyzed here suggest that the Iban exhibit greatest genetic similarity to Indonesian and mainland Southeast Asian populations. The most common non-recombining Y (NRY) and mitochondrial (mt) DNA haplogroups present in the Iban are associated with populations of Southeast Asia. We conclude that migrations from Southeast Asia made a large contribution to Iban ancestry, although evidence of potential gene flow from Taiwan is also seen in uniparentally inherited marker data

    Adrenergic Alpha-1 Pathway Is Associated with Hypertension among Nigerians in a Pathway-focused Analysis

    Get PDF
    The pathway-focused association approach offers a hypothesis driven alternative to the agnostic genome-wide association study. Here we apply the pathway-focused approach to an association study of hypertension, systolic blood pressure (SBP), and diastolic blood pressure (DBP) in 1614 Nigerians with genome-wide data.Testing of 28 pathways with biological relevance to hypertension, selected a priori, containing a total of 101 unique genes and 4,349 unique single-nucleotide polymorphisms (SNPs) showed an association for the adrenergic alpha 1 (ADRA1) receptor pathway with hypertension (p<0.0009) and diastolic blood pressure (p<0.0007). Within the ADRA1 pathway, the genes PNMT (hypertension P(gene)<0.004, DBP P(gene)<0.004, and SBP P(gene)<0.009, and ADRA1B (hypertension P(gene)<0.005, DBP P(gene)<0.02, and SBP P(gene)<0.02) displayed the strongest associations. Neither ADRA1B nor PNMT could be the sole mediator of the observed pathway association as the ADRA1 pathway remained significant after removing ADRA1B, and other pathways involving PNMT did not reach pathway significance.We conclude that multiple variants in several genes in the ADRA1 pathway led to associations with hypertension and DBP. SNPs in ADRA1B and PNMT have not previously been linked to hypertension in a genome-wide association study, but both genes have shown associations with hypertension through linkage or model organism studies. The identification of moderately significant (10(-2)>p>10(-5)) SNPs offers a novel method for detecting the "missing heritability" of hypertension. These findings warrant further studies in similar and other populations to assess the generalizability of our results, and illustrate the potential of the pathway-focused approach to investigate genetic variation in hypertension

    A multi-ethnic study of a PNPLA3 gene variant and its association with disease severity in non-alcoholic fatty liver disease

    Get PDF
    The adiponutrin (PNPLA3) rs738409 polymorphism has been found to be associated with susceptibility to non-alcoholic fatty liver disease (NAFLD) in various cohorts. We further investigated the association of this polymorphism with non-alcoholic steatohepatitis (NASH) severity and with histological features of NAFLD. A total of 144 biopsy-proven NAFLD patients and 198 controls were genotyped for PNPLA3 gene polymorphism (rs738409 C>G). The biopsy specimens were histologically graded by a qualified pathologist. We observed an association of G allele with susceptibility to NAFLD in the pooled subjects (OR 2.34, 95% CI 1.69–3.24, p < 0.0001), and following stratification, in each of the three ethnic subgroups, namely Chinese, Indian and Malay (OR 1.94, 95% CI 1.12–3.37, p = 0.018; OR 3.51, 95% CI 1.69–7.26, p = 0.001 and OR 2.05, 95% CI 1.25–3.35, p = 0.005, respectively). The G allele is associated with susceptibility to NASH (OR 2.64, 95% CI 1.85–3.75, p < 0.0001), with NASH severity (OR 1.85, 95% CI 1.05–3.26, p = 0.035) and with presence of fibrosis (OR 1.95, 95% CI 1.17–3.26, p = 0.013) but not with simple steatosis nor with other histological parameters. Although the serum triglyceride level is significantly higher in NAFLD patients compared to controls, the G allele is associated with decreased level of triglycerides (p = 0.029) in the NAFLD patients. Overall, the rs738409 G allele is associated with severity of NASH and occurence of fibrosis in patients with NAFLD
    corecore