41 research outputs found

    A population-specific reference panel empowers genetic studies of Anabaptist populations

    Get PDF
    Genotype imputation is a powerful strategy for achieving the large sample sizes required for identification of variants underlying complex phenotypes, but imputation of rare variants remains problematic. Genetically isolated populations offer one solution, however population-specific reference panels are needed to assure optimal imputation accuracy and allele frequency estimation. Here we report the Anabaptist Genome Reference Panel (AGRP), the first whole-genome catalogue of variants and phased haplotypes in people of Amish and Mennonite ancestry. Based on high-depth whole-genome sequence (WGS) from 265 individuals, the AGRP contains >12 M high-confidence single nucleotide variants and short indels, of which ~12.5% are novel. These Anabaptist-specific variants were more deleterious than variants with comparable frequencies observed in the 1000 Genomes panel. About 43,000 variants showed enriched allele frequencies in AGRP, consistent with drift. When combined with the 1000 Genomes Project reference panel, the AGRP substantially improved imputation, especially for rarer variants. The AGRP is freely available to researchers through an imputation server

    Multi-phenotype analyses of hemostatic traits with cardiovascular events reveal novel genetic associations

    Get PDF
    Background: Multi-phenotype analysis of genetically correlated phenotypes can increase the statistical power to detect loci associated with multiple traits, leading to the discovery of novel loci. This is the first study to date to comprehensively analyze the shared genetic effects within different hemostatic traits, and between these and their associated disease outcomes. Objectives: To discover novel genetic associations by combining summary data of correlated hemostatic traits and disease events. Methods: Summary statistics from genome wide-association studies (GWAS) from seven hemostatic traits (factor VII [FVII], factor VIII [FVIII], von Willebrand factor [VWF] factor XI [FXI], fibrinogen, tissue plasminogen activator [tPA], plasminogen activator inhibitor 1 [PAI-1]) and three major cardiovascular (CV) events (venous thromboembolism [VTE], coronary artery disease [CAD], ischemic stroke [IS]), were combined in 27 multi-trait combinations using metaUSAT. Genetic correlations between phenotypes were calculated using Linkage Disequilibrium Score Regression (LDSC). Newly associated loci were investigated for colocalization. We considered a significance threshold of 1.85 × 10−9 obtained after applying Bonferroni correction for the number of multi-trait combinations performed (n = 27). Results: Across the 27 multi-trait analyses, we found 4 novel pleiotropic loci (XXYLT1, KNG1, SUGP1/MAU2, TBL2/MLXIPL) that were not significant in the original individual datasets, were not described in previous GWAS for the individual traits, and that presented a common associated variant between the studied phenotypes. Conclusions: The discovery of four novel loci contributes to the understanding of the relationship between hemostasis and CV events and elucidate common genetic factors between these traits

    Guidance for the utility of linear models in meta-analysis of genetic association studies of binary phenotypes

    Get PDF
    Linear mixed models are increasingly used for the analysis of genome-wide association studies (GWAS) of binary phenotypes because they can efficiently and robustly account for population stratification and relatedness through inclusion of random effects for a genetic relationship matrix. However, the utility of linear (mixed) models in the context of meta-analysis of GWAS of binary phenotypes has not been previously explored. In this investigation, we present simulations to compare the performance of linear and logistic regression models under alternative weighting schemes in a fixed-effects meta-analysis framework, considering designs that incorporate variable case-control imbalance, confounding factors and population stratification. Our results demonstrate that linear models can be used for meta-analysis of GWAS of binary phenotypes, without loss of power, even in the presence of extreme case-control imbalance, provided that one of the following schemes is used: (i) effective sample size weighting of Z-scores or (ii) inverse-variance weighting of allelic effect sizes after conversion onto the log-odds scale. Our conclusions thus provide essential recommendations for the development of robust protocols for meta-analysis of binary phenotypes with linear models

    Environment dominates over host genetics in shaping human gut microbiota

    Get PDF
    Human gut microbiome composition is shaped by multiple factors but the relative contribution of host genetics remains elusive. Here we examine genotype and microbiome data from 1,046 healthy individuals with several distinct ancestral origins who share a relatively common environment, and demonstrate that the gut microbiome is not significantly associated with genetic ancestry, and that host genetics have a minor role in determining microbiome composition. We show that, by contrast, there are significant similarities in the compositions of the microbiomes of genetically unrelated individuals who share a household, and that over 20% of the inter-person microbiome variability is associated with factors related to diet, drugs and anthropometric measurements. We further demonstrate that microbiome data significantly improve the prediction accuracy for many human traits, such as glucose and obesity measures, compared to models that use only host genetic and environmental data. These results suggest that microbiome alterations aimed at improving clinical outcomes may be carried out across diverse genetic backgrounds
    corecore