17 research outputs found

    Genetic analyses identify widespread sex-differential participation bias

    Get PDF
    Genetic analyses identify widespread sex-differential participation bias in population-based studies and show how this bias can lead to incorrect inferences. These findings highlight new challenges for association studies as sample sizes continue to grow. Genetic association results are often interpreted with the assumption that study participation does not affect downstream analyses. Understanding the genetic basis of participation bias is challenging since it requires the genotypes of unseen individuals. Here we demonstrate that it is possible to estimate comparative biases by performing a genome-wide association study contrasting one subgroup versus another. For example, we showed that sex exhibits artifactual autosomal heritability in the presence of sex-differential participation bias. By performing a genome-wide association study of sex in approximately 3.3 million males and females, we identified over 158 autosomal loci spuriously associated with sex and highlighted complex traits underpinning differences in study participation between the sexes. For example, the body mass index-increasing allele at FTO was observed at higher frequency in males compared to females (odds ratio = 1.02, P = 4.4 x 10(-)(36)). Finally, we demonstrated how these biases can potentially lead to incorrect inferences in downstream analyses and propose a conceptual framework for addressing such biases. Our findings highlight a new challenge that genetic studies may face as sample sizes continue to grow.Peer reviewe

    Stroke genetics informs drug discovery and risk prediction across ancestries

    Get PDF
    Daniel Strbian työryhmän jäsenenä Correction; Early Access DOI: 10.1038/s41586-022-05492-5 Early Access: NOV 2022Previous genome-wide association studies (GWASs) of stroke - the second leading cause of death worldwide - were conducted predominantly in populations of European ancestry(1,2). Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis(3), and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach(4), we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry(5). Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries.Peer reviewe

    Global Biobank Meta-analysis Initiative : Powering genetic discovery across human disease

    Get PDF
    Funding Information: The work of the contributing biobanks was supported by numerous grants from governmental and charitable bodies. Biobank-specific acknowledgments and more detailed acknowledgments are included in Data S2. Initiative management, S.B.C. J.C. N.J.C. M.J.D. E.E.K. A.R.M. B.M.N. Y.O. A.V.P. D.A.v.H. R.G.W. C.J.W. W.Z. and S.Z.; individual biobank analysis, A.B. Y.B. B.M.B. C.D.B. S.C. T.-T.C. K.C. S.M.D. M.D. G.H.d.B. Y.D. N.J.D. M.-J.F. Y.-C.A.F. S.F. V.L.F. L.G.F. E.R.G. T.R.G. D.H.G. C.R.G. G.G.-A. S.E.G. L.A.G. C.H. J.B.H. W.E.H. H.H. K.H. N.I. A.I. R.J. M. Kurki, J.K. N.K. E.E.K. J.T.K. M. Kanai, T.L. K.L. M.H.L. S.L. K.L. Y.-F.L. V.L.F. R.J.F.L. E.A.L.-M. A.R.-M. S.M.-G. R.M. R.E.M. H.C.M. A.R.M. Y.M. H.M. S.E.M. I.Y.M. B.M. S.M. K.N. S.N. M.A.N.-A. K.N. Y.O. P.P. A.L.-P. A.P. B.P. S.P. M.H.P. D.J.R. N.R. M.D.R. A.R. C.S. S.S. S.S.S. J.A.S. P.S. I.S. T.T. R.T. K.T. J.U. D.A.v.H. B.V. M.V. Y.V. J.M.V. R.G.W. Y.W. S.J.W. B.N.W. K.-H.H.W. M.Z. X.Z. and S.Z.; individual biobank management, N.A. A.A.T. K.M.A.-D. P.A. K.C.B. M. Boehnke, M. Boezen, C.D.B. A.C. Z.C. C.-Y.C. J.C. N.J.C. S.M.D. S.F. Y.-C.A.F. S.F. E.F. T.G. C.R.G. C.J.G. Y.G. H.H. K.A.H. K.H. S.I.I. N.M.J. N.K. E.E.K. J.T.K. C.L. M.H.L. M.T.M.L. L.L. K.L. Y.-F.L. R.J.F.L. J.L. S.M. Y.M. K.M. I.Y.M. Y.O. C.M.O. A.V.P. B.P. D.J.P. D.J.R. M.D.R. S.S. J.W.S. H.S. K.S. T.T. U.T. R.C.T. D.A.v.H. M.V. R.G.W. D.C.W. C.W. J.W. M.Z. X.Z. and S.Z.; study design and interpretation of results, A.B. M. Boehnke, M. Boezen, B.M.B. T.-T.C. C.-Y.C. M.J.D. G.D.S. N.J.D. S.F. M.-J.F. H.K.F. E.R.G. A.G. T.G. J.B.H. J.H. K.H. R.J. M.K. E.E.K. T.K. C.M.L. V.L.F. E.A.L.-M. A.R.M. S.N. B.M.N. C.M.O. J.J.P. B.P. N.R. H.R. J.A.S. I.S. K.T. D.A.v.H. R.G.W. Y.W. D.C.W. S.J.W. C.J.W. B.N.W. J.W. K.-H.H.W. M.Z. H.Z. J.Z. W.Z. X.Z. and S.Z.; drafted and edited the paper, A.B. M. Boehnke, M. Boezen, M.J.D. G.H.d.B. N.J.D. T.R.G. J.B.H. N.I. N.M.J. M.K. V.L.F. S.M. A.R.M. H.M. S.N. B.M.N. C.M.O. B.P. H.R. C.S. J.A.S. J.W.S. K.T. Y.W. D.C.W. C.J.W. K.-H.H.W. H.Z. J.Z. W.Z. and S.Z.; primary meta-analysis and quality control, M.J.D. H.K.F. M. Kanai, J.K. J.T.K. M. Kurki, M.M. B.M.N. C.J.W. K.-H.H.W. and W.Z.; drug discovery: S.N. T.K. K.-H.H.W. W.Z. and Y.O.; fine mapping, M. Kanai, W.Z. M.J.D. and H.K.F.; polygenic risk score, Y.W. S.N. E.A.L.-M. S.K. K.T. K.L. M. Kanai, W.Z. K.W. M.-J.F. L.B. P.A. P.D. V.L.F. R.M. Y.M. B.B. S.S. J.U. E.R.G. N.J.C. I.S. Y.O. A.R.M. and J.B.H.; proteome-wide Mendelian randomization, H.Z. H.R. A.B. G.H. G.D.S. B.M.B. W.Z. B.M.N. T.R.G. and J.Z.; transcriptome-wide association study, A.B. J.B.H. W.Z. J.Z. M. Kanai, B.P. E.R.G. and N.J.C.; asthma, K.T. W.Z. Y.W. M. Kanai, S.N. Y.O. B.M.N. M.J.D. and A.R.M.; heart failure, K.-H.H.W. N.J.D. B.N.W. I.S. S.E.G. J.B.H. N.J.C. M.P. R.J.F.L. M.J.D. B.M.N. W.Z. W.E.H. and C.J.W.; idiopathic pulmonary fibrosis, J.J.P. W.Z. M.J.D. J.T.K. N.J.C. and J.B.H.; primary open-angle glaucoma, V.L.F. A.B. W.Z. Y.W. K.L. M. Kanai, E.A.L.-M. P.S. R.T. X.Z. S.N. S.S. Y.O. N.I. S.M. H.S. I.S. C.W. A.R.M. E.R.G. N.M.J. N.J.C. and J.B.H.; stroke, I.S. K.-H.H.W. W.H. B.N.W. W.Z. J.E.H. A.P. B.B. A.H.S. M.E.G. R.G.W. K.H. C.K. S.Z. M.J.D. B.M.N. and C.J.W.; venous thromboembolism, B.N.W. I.S. K.-H.H.W. B.B. V.L.F. K.T. M.D. B.N. W.Z. J.A.S. and C.J.W. All authors reviewed the manuscript. M.J.D. is a founder of Maze Therapeutics. B.M.N. is a member of the scientific advisory board at Deep Genomics and a consultant for Camp4 Therapeutics, Takeda Pharmaceutical, and Biogen. The spouse of C.J.W. works at Regeneron Pharmaceuticals. C.-Y.C. is employed by Biogen. C.R.G. owns stock in 23andMe, Inc. T.R.G. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. E.E.K. has received speaker fees from Regeneron, Illumina, and 23andMe and is a member of the advisory board for Galateo Bio. R.E.M. has received speaker fees from Illumina and is a scientific advisor to the Epigenetic Clock Development Foundation. G.D.S. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. K.S. and U.T. are employed by deCODE Genetics/Amgen, Inc. J.Z. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. S.M. is a co-founder of and holds stock in Seonix Bio. Publisher Copyright: © 2022Biobanks facilitate genome-wide association studies (GWASs), which have mapped genomic loci across a range of human diseases and traits. However, most biobanks are primarily composed of individuals of European ancestry. We introduce the Global Biobank Meta-analysis Initiative (GBMI)—a collaborative network of 23 biobanks from 4 continents representing more than 2.2 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWASs generated using harmonized genotypes and phenotypes from member biobanks for 14 exemplar diseases and endpoints. This strategy validates that GWASs conducted in diverse biobanks can be integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics. This collaborative effort improves GWAS power for diseases, benefits understudied diseases, and improves risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of human diseases and traits.Peer reviewe

    Stroke genetics informs drug discovery and risk prediction across ancestries

    Get PDF
    © 2022. The Author(s).Previous genome-wide association studies (GWASs) of stroke - the second leading cause of death worldwide - were conducted predominantly in populations of European ancestry1,2. Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis3, and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach4, we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry5. Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries.Peer reviewe

    The genetic basis of endometriosis and comorbidity with other pain and inflammatory conditions

    Get PDF
    Endometriosis is a common condition associated with debilitating pelvic pain and infertility. A genome-wide association study meta-analysis, including 60,674 cases and 701,926 controls of European and East Asian descent, identified 42 genome-wide significant loci comprising 49 distinct association signals. Effect sizes were largest for stage 3/4 disease, driven by ovarian endometriosis. Identified signals explained up to 5.01% of disease variance and regulated expression or methylation of genes in endometrium and blood, many of which were associated with pain perception/maintenance (SRP14/BMF, GDAP1, MLLT10, BSN and NGF). We observed significant genetic correlations between endometriosis and 11 pain conditions, including migraine, back and multisite chronic pain (MCP), as well as inflammatory conditions, including asthma and osteoarthritis. Multitrait genetic analyses identified substantial sharing of variants associated with endometriosis and MCP/migraine. Targeted investigations of genetically regulated mechanisms shared between endometriosis and other pain conditions are needed to aid the development of new treatments and facilitate early symptomatic intervention

    Integration of Biomarker Polygenic Risk Score Improves Prediction of Coronary Heart Disease

    Get PDF
    There are several established biomarkers for coronary heart disease (CHD), including blood pressure, cholesterol, and lipoproteins. It is of high interest to determine how a combined polygenic risk score (PRS) of CHD-associated biomarkers (BioPRS) can further improve genetic prediction of CHD. We developed CHDBioPRS, combining BioPRS with PRS of CHD in the UK Biobank and tested it on FinnGen. We found that BioPRS was clearly predictive of CHD and that CHDBioPRS improved the standard CHD PRS. The largest effect was observed with early onset cases in FinnGen, with HRs above 2 per standard deviation of CHDBioPRS.Peer reviewe

    Genome-wide meta-analysis identifies 127 open-angle glaucoma loci with consistent effect across ancestries

    Get PDF
    Primary open-angle glaucoma (POAG), is a heritable common cause of blindness world-wide. To identify risk loci, we conduct a large multi-ethnic meta-analysis of genome-wide association studies on a total of 34,179 cases and 349,321 controls, identifying 44 previously unreported risk loci and confirming 83 loci that were previously known. The majority of loci have broadly consistent effects across European, Asian and African ancestries. Cross-ancestry data improve fine-mapping of causal variants for several loci. Integration of multiple lines of genetic evidence support the functional relevance of the identified POAG risk loci and highlight potential contributions of several genes to POAG pathogenesis, including SVEP1, RERE, VCAM1, ZNF638, CLIC5, SLC2A12, YAP1, MXRA5, and SMAD6. Several drug compounds targeting POAG risk genes may be potential glaucoma therapeutic candidates. Primary open-angle glaucoma (POAG) is highly heritable, yet not well understood from a genetic perspective. Here, the authors perform a meta-analysis of genome-wide association studies in 34,179 POAG cases, identifying 44 previously unreported risk loci and mapping effects across multiple ethnicities.Peer reviewe

    Global Biobank analyses provide lessons for developing polygenic risk scores across diverse cohorts

    No full text
    Polygenic risk scores (PRSs) have been widely explored in precision medicine. However, few studies have thoroughly investigated their best practices in global populations across different diseases. We here utilized data from Global Biobank Meta-analysis Initiative (GBMI) to explore methodological considerations and PRS performance in 9 different biobanks for 14 disease endpoints. Specifically, we constructed PRSs using pruning and thresholding (P + T) and PRS-continuous shrinkage (CS). For both methods, using a European-based linkage disequilibrium (LD) reference panel resulted in comparable or higher prediction accuracy compared with several other non-European-based panels. PRS-CS overall outperformed the classic P + T method, especially for endpoints with higher SNP-based heritability. Notably, prediction accuracy is heterogeneous across endpoints, biobanks, and ancestries, especially for asthma, which has known variation in disease prevalence across populations. Overall, we provide lessons for PRS construction, evaluation, and interpretation using GBMI resources and highlight the importance of best practices for PRS in the biobank-scale genomics era.Peer reviewe
    corecore