86 research outputs found

    A comprehensive SNP and indel imputability database

    Get PDF
    Motivation: Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms

    Genome-Wide Association Study of Anthropometric Traits and Evidence of Interactions With Age and Study Year in Filipino Women

    Get PDF
    Increased values of multiple adiposity-related anthropometric traits are important risk factors for many common complex diseases. We performed a genome-wide association (GWA) study for four quantitative traits related to body size and adiposity (body mass index [BMI], weight, waist circumference, and height) in a cohort of 1,792 adult Filipino women from the Cebu Longitudinal Health and Nutrition Survey. This is the first GWA study of anthropometric traits in Filipinos, a population experiencing a rapid transition into a more obesogenic environment. In addition to identifying suggestive evidence of additional SNP association signals (P < 10−5), we replicated (P < 0.05, same direction of additive effect) associations previously reported in European populations of both BMI and weight with MC4R and FTO, of BMI with BDNF, and of height with EFEMP1, ZBTB38, and NPPC, but none with waist circumference. We also replicated loci reported in Japanese or Korean populations as associated with BMI (OTOL1) and height (HIST1H1PS2, C14orf145, GPC5). A difference in local linkage disequilibrium between European and Asian populations suggests a narrowed association region for BDNF, while still including a proposed functional non-synonymous amino acid substitution variant (rs6265, Val66Met). Finally, we observed significant evidence (P < 0.0042) for age-by-genotype interactions influencing BMI for rs17782313 (MC4R) and rs9939609 (FTO), and for a study year-by-genotype interaction for rs4923461 (BDNF). Our results show that several genetic risk factors are associated with anthropometric traits in Filipinos and provide further insight into the effects of BDNF, FTO, and MC4R on BMI

    Genetic association with lipids in Filipinos: waist circumference modifies an APOA5 effect on triglyceride levels

    Get PDF
    Blood levels of lipoprotein cholesterol and triglycerides (TGs) are highly heritable and are major risk factors for cardiovascular disease (CVD). Approximately 100 lipid-associated loci have been identified in populations of European ancestry. We performed a genome-wide association study of lipid traits in 1,782 Filipino women from the Cebu Longitudinal Health and Nutrition Survey, and tested for evidence of interactions with waist circumference. We conducted additional association and interaction analyses in 1,719 of their young adult offspring. Genome-wide significant associations (P < 5 × 10−8) were detected at APOE for low density lipoprotein cholesterol and total cholesterol, and at APOA5 for TGs. Suggestive associations (P < 10−6) were detected at GCKR for TGs, and at CETP and TOM1 for high density lipoprotein cholesterol. Our data also supported the existence of allelic heterogeneity at APOA5, CETP, LIPC, and APOE. The secondary signal (Gly185Cys) at APOA5 exhibited a single nucleotide polymorphism (SNP)-by-waist circumference interaction affecting TGs (Pinteraction = 1.6 × 10−4), manifested by stronger SNP effects as waist circumference increased. These findings provide the first evidence that central obesity may accentuate the effect of the TG-increasing allele of the APOA5 signal, emphasizing that CVD risk could be reduced by central obesity control

    Genome-wide association study for adiponectin levels in Filipino women identifies CDH13 and a novel uncommon haplotype at KNG1–ADIPOQ

    Get PDF
    Adiponectin is an adipocyte-secreted protein involved in a variety of metabolic processes, including glucose regulation and fatty acid catabolism. We conducted a genome-wide association study to investigate the genetic loci associated with plasma adiponectin in 1776 unrelated Filipino women from the Cebu Longitudinal Health and Nutrition Survey (CLHNS). Our strongest signal for adiponectin mapped to the gene CDH13 (rs3865188, P ≤ 7.2 × 10−16), which encodes a receptor for high-molecular-weight forms of adiponectin. Strong association was also detected near the ADIPOQ gene (rs864265, P = 3.8 × 10−9) and at a novel signal 100 kb upstream near KNG1 (rs11924390, P = 7.6 × 10−7). All three signals were also observed in 1774 young adult CLHNS offspring and in combined analysis including all 3550 mothers and offspring samples (all P ≤ 1.6 × 10−9). An uncommon haplotype of rs11924390 and rs864265 (haplotype frequency = 0.050) was strongly associated with lower adiponectin compared with the most common C–G haplotype in both CLHNS mothers (P = 1.8 × 10−25) and offspring (P = 8.7 × 10−32). Comprehensive imputation of 2653 SNPs in a 2 Mb region using as reference combined CHB, JPT and CEU haplotypes from the 1000 Genomes Project revealed no variants that perfectly tagged this haplotype. Our findings provide the first genome-wide significant evidence of association with plasma adiponectin at the CDH13 locus and identify a novel uncommon KNG1–ADIPOQ haplotype strongly associated with adiponectin levels in Filipinos

    Population-specific coding variant underlies genome-wide association with adiponectin level

    Get PDF
    Adiponectin is a protein hormone that can affect major metabolic processes including glucose regulation and fat metabolism. Our previous genome-wide association (GWA) study of circulating plasma adiponectin levels in Filipino women from the Cebu Longitudinal Health and Nutrition Survey (CLHNS) detected a 100 kb two-SNP haplotype at KNG1–ADIPOQ associated with reduced adiponectin (frequency = 0.050, P = 1.8 × 10−25). Subsequent genotyping of CLHNS young adult offspring detected an uncommon variant [minor allele frequency (MAF) = 0.025] located ∼800 kb from ADIPOQ that showed strong association with lower adiponectin levels (P = 2.7 × 10−15, n = 1695) and tagged a subset of KNG1–ADIPOQ haplotype carriers with even lower adiponectin levels. Sequencing of the ADIPOQ-coding region detected variant R221S (MAF = 0.015, P = 2.9 × 10−69), which explained 17.1% of the variance in adiponectin levels and largely accounted for the initial GWA signal in Filipinos. R221S was not present in 12 514 Europeans with previously sequenced exons. To explore the mechanism of this substitution, we re-measured adiponectin level in 20 R221S offspring carriers and 20 non-carriers using two alternative antibodies and determined that the presence of R221S resulted in artificially low quantification of adiponectin level using the original immunoassay. These data provide an example of an uncommon variant responsible for a GWA signal and demonstrate that genetic associations with phenotypes measured by antibody-based quantification methods can be affected by uncommon coding SNPs residing in the antibody target region

    Across-cohort QC analyses of GWAS summary statistics from complex traits.

    Get PDF
    Genome-wide association studies (GWASs) have been successful in discovering SNP trait associations for many quantitative traits and common diseases. Typically, the effect sizes of SNP alleles are very small and this requires large genome-wide association meta-analyses (GWAMAs) to maximize statistical power. A trend towards ever-larger GWAMA is likely to continue, yet dealing with summary statistics from hundreds of cohorts increases logistical and quality control problems, including unknown sample overlap, and these can lead to both false positive and false negative findings. In this study, we propose four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs. We propose methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap. (I) We use the population genetics Fst statistic to verify the genetic origin of each cohort and their geographic location, and demonstrate using GWAMA data from the GIANT Consortium that geographic locations of cohorts can be recovered and outlier cohorts can be detected. (II) We conduct principal component analysis based on reported allele frequencies, and are able to recover the ancestral information for each cohort. (III) We propose a new statistic that uses the reported allelic effect sizes and their standard errors to identify significant sample overlap or heterogeneity between pairs of cohorts. (IV) To quantify unknown sample overlap across all pairs of cohorts, we propose a method that uses randomly generated genetic predictors that does not require the sharing of individual-level genotype data and does not breach individual privacy

    Quality control and conduct of genome-wide association meta-analyses

    Get PDF
    Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for [1] organizational aspects of GWAMAs, and for [2] QC at the study file level, the meta-level across studies, and the meta-analysis output level. Real–world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for use of a powerful and flexible software package called EasyQC. For consortia of comparable size to the GIANT consortium, the present protocol takes a minimum of about 10 months to complete
    corecore