40 research outputs found
Determinants of penetrance and variable expressivity in monogenic metabolic conditions across 77,184 exomes
Hundreds of thousands of genetic variants have been reported to cause severe monogenic diseases, but the probability that a variant carrier develops the disease (termed penetrance) is unknown for virtually all of them. Additionally, the clinical utility of common polygenetic variation remains uncertain. Using exome sequencing from 77,184 adult individuals (38,618 multi-ancestral individuals from a type 2 diabetes case-control study and 38,566 participants from the UK Biobank, for whom genotype array data were also available), we apply clinical standard-of-care gene variant curation for eight monogenic metabolic conditions. Rare variants causing monogenic diabetes and dyslipidemias display effect sizes significantly larger than the top 1% of the corresponding polygenic scores. Nevertheless, penetrance estimates for monogenic variant carriers average 60% or lower for most conditions. We assess epidemiologic and genetic factors contributing to risk prediction in monogenic variant carriers, demonstrating that inclusion of polygenic variation significantly improves biomarker estimation for two monogenic dyslipidemias. Penetrance of variants in monogenic disease and clinical utility of common polygenic variation has not been well explored on a large-scale. Here, the authors use exome sequencing data from 77,184 individuals to generate penetrance estimates and assess the utility of polygenic variation in risk prediction of monogenic variants.Peer reviewe
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Rare coding variants in 35 genes associate with circulating lipid levels-A multi-ancestry analysis of 170,000 exomes
Large-scale gene sequencing studies for complex traits have the potential to identify causal genes with therapeutic implications. We performed gene-based association testing of blood lipid levels with rare (minor allele frequency 1%) predicted damaging coding variation by using sequence data from 170,000 individuals from multiple ancestries: 97,493 European, 30,025 South Asian, 16,507 African, 16,440 Hispanic/Latino, 10,420 East Asian, and 1,182 Samoan. We identified 35 genes associated with circulating lipid levels; some of these genes have not been previously associated with lipid levels when using rare coding variation from population-based samples. We prioritize 32 genes in array-based genome-wide association study (GWAS) loci based on aggregations of rare coding variants; three (EVI5, SH2B3, and PLIN1) had no prior association of rare coding variants with lipid levels. Most of our associated genes showed evidence of association among multiple ancestries. Finally, we observed an enrichment of gene-based associations for low-density lipoprotein cholesterol drug target genes and for genes closest to GWAS index single-nucleotide polymorphisms (SNPs). Our results demonstrate that gene-based associations can be beneficial for drug target development and provide evidence that the gene closest to the array-based GWAS index SNP is often the functional gene for blood lipid levels.Peer reviewe
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Recommended from our members
Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility.
To further understanding of the genetic basis of type 2 diabetes (T2D) susceptibility, we aggregated published meta-analyses of genome-wide association studies (GWAS), including 26,488 cases and 83,964 controls of European, east Asian, south Asian and Mexican and Mexican American ancestry. We observed a significant excess in the directional consistency of T2D risk alleles across ancestry groups, even at SNPs demonstrating only weak evidence of association. By following up the strongest signals of association from the trans-ethnic meta-analysis in an additional 21,491 cases and 55,647 controls of European ancestry, we identified seven new T2D susceptibility loci. Furthermore, we observed considerable improvements in the fine-mapping resolution of common variant association signals at several T2D susceptibility loci. These observations highlight the benefits of trans-ethnic GWAS for the discovery and characterization of complex trait loci and emphasize an exciting opportunity to extend insight into the genetic architecture and pathogenesis of human diseases across populations of diverse ancestry
Quantifying the Impact of Rare and Ultra-rare Coding Variation across the Phenotypic Spectrum
There is a limited understanding about the impact of rare protein-truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein-truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, and ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization, and reduced age at enrollment. Gene sets implicated from GWASs did not show a significant protein-truncating variants burden beyond what was captured by established Mendelian genes. In conclusion, we provide a thorough investigation of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.Peer reviewe
Identification and functional characterization of G6PC2 coding variants influencing glycemic traits define an effector transcript at the G6PC2-ABCB11 locus.
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights
Identification and Functional Characterization of G6PC2 Coding Variants Influencing Glycemic Traits Define an Effector Transcript at the G6PC2-ABCB11 Locus
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights
