25 research outputs found
Quantifying the Impact of Rare and Ultra-rare Coding Variation across the Phenotypic Spectrum
There is a limited understanding about the impact of rare protein-truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein-truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, and ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization, and reduced age at enrollment. Gene sets implicated from GWASs did not show a significant protein-truncating variants burden beyond what was captured by established Mendelian genes. In conclusion, we provide a thorough investigation of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.Peer reviewe
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Whole-Genome Sequencing Coupled to Imputation Discovers Genetic Signals for Anthropometric Traits
Deep sequence-based imputation can enhance the discovery power of genome-wide association studies by assessing previously unexplored variation across the common-and low-frequency spectra. We applied a hybrid whole-genome sequencing (WGS) and deep imputation approach to examine the broader allelic architecture of 12 anthropometric traits associated with height, body mass, and fat distribution in up to 267,616 individuals. We report 106 genome-wide significant signals that have not been previously identified, including 9 low-frequency variants pointing to functional candidates. Of the 106 signals, 6 are in genomic regions that have not been implicated with related traits before, 28 are independent signals at previously reported regions, and 72 represent previously reported signals for a different anthropometric trait. 71% of signals reside within genes and fine mapping resolves 23 signals to one or two likely causal variants. We confirm genetic overlap between human monogenic and polygenic anthropometric traits and find signal enrichment in cis expression QTLs in relevant tissues. Our results highlight the potential of WGS strategies to enhance biologically relevant discoveries across the frequency spectrum.Peer reviewe
Identification and Functional Characterization of G6PC2 Coding Variants Influencing Glycemic Traits Define an Effector Transcript at the G6PC2-ABCB11 Locus
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights
Identification and functional characterization of G6PC2 coding variants influencing glycemic traits define an effector transcript at the G6PC2-ABCB11 locus.
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5Ă10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights
Evaluating the calibration and power of three gene-based association tests of rare variants for the X chromosome.
Although genome-wide association studies (GWAS) have identified thousands of trait-associated genetic variants, there are relatively few findings on the X chromosome. For analysis of low-frequency variants (minor allele frequency <5%), investigators can use region- or gene-based tests where multiple variants are analyzed jointly to increase power. To date, there are no gene-based tests designed for association testing of low-frequency variants on the X chromosome. Here we propose three gene-based tests for the X chromosome: burden, sequence kernel association test (SKAT), and optimal unified SKAT (SKAT-O). Using simulated case-control and quantitative trait (QT) data, we evaluate the calibration and power of these tests as a function of (1) male:female sample size ratio; and (2) coding of haploid male genotypes for variants under X-inactivation. For case-control studies, all three tests are reasonably well-calibrated for all scenarios we evaluated. As expected, power for gene-based tests depends on the underlying genetic architecture of the genomic region analyzed. Studies with more (haploid) males are generally less powerful due to decreased number of chromosomes. Power generally is slightly greater when the coding scheme for male genotypes matches the true underlying model, but the power loss for misspecifying the (generally unknown) model is small. For QT studies, type I error and power results largely mirror those for binary traits. We demonstrate the use of these three gene-based tests for X-chromosome association analysis in simulated data and sequencing data from the Genetics of Type 2 Diabetes (GoT2D) study