28 research outputs found
Quantifying the Impact of Rare and Ultra-rare Coding Variation across the Phenotypic Spectrum
There is a limited understanding about the impact of rare protein-truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein-truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, and ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization, and reduced age at enrollment. Gene sets implicated from GWASs did not show a significant protein-truncating variants burden beyond what was captured by established Mendelian genes. In conclusion, we provide a thorough investigation of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.Peer reviewe
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Testing the role of predicted gene knockouts in human anthropometric trait variation
National Heart, Lung, and Blood Institute (NHLBI)
S.L. is funded by a Canadian Institutes of Health Research
Banting doctoral scholarship. G.L. is funded by Genome Canada
and Génome Québec; the Canada Research Chairs program; and
the Montreal Heart Institute Foundation. C.M.L. is supported by
Wellcome Trust (grant numbers 086596/Z/08/Z, 086596/Z/08/A);
and the Li Ka Shing Foundation. N.S. is funded by National Institutes
of Health (grant numbers HL088456, HL111089, HL116747).
The Mount Sinai BioMe Biobank Program is supported by the Andrea
and Charles Bronfman Philanthropies. GO ESP is supported
by NHLBI (RC2 HL-103010 to HeartGO, RC2 HL-102923 to LungGO,
RC2 HL-102924 to WHISP). The ESP exome sequencing was
performed through NHLBI (RC2 HL-102925 to BroadGO, RC2 HL-
102926 to SeattleGO). EGCUT work was supported through the
Estonian Genome Center of University of Tartu by the Targeted
Financing from the Estonian Ministry of Science and Education
(grant number SF0180142s08); the Development Fund of the University
of Tartu (grant number SP1GVARENG); the European Regional
Development Fund to the Centre of Excellence in
Genomics (EXCEGEN) [grant number 3.2.0304.11-0312]; and
through FP7 (grant number 313010). EGCUT were further supported
by the US National Institute of Health (grant number
R01DK075787). A.K.M. was supported by an American Diabetes
Association Mentor-Based Postdoctoral Fellowship (#7-12-MN-
02). The BioVU dataset used in the analyses described were obtained
from Vanderbilt University Medical Centers BioVU which
is supported by institutional funding and by the Vanderbilt CTSA
grant ULTR000445 from NCATS/NIH. Genome-wide genotyping
was funded by NIH grants RC2GM092618 from NIGMS/OD and
U01HG004603 from NHGRI/NIGMS. Funding to pay the Open Access
publication charges for this article was provided by a block
grant from Research Councils UK to the University of Cambridge
Identification and functional characterization of G6PC2 coding variants influencing glycemic traits define an effector transcript at the G6PC2-ABCB11 locus.
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights
Identification and Functional Characterization of G6PC2 Coding Variants Influencing Glycemic Traits Define an Effector Transcript at the G6PC2-ABCB11 Locus
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights
Evaluating the calibration and power of three gene-based association tests of rare variants for the X chromosome.
Although genome-wide association studies (GWAS) have identified thousands of trait-associated genetic variants, there are relatively few findings on the X chromosome. For analysis of low-frequency variants (minor allele frequency <5%), investigators can use region- or gene-based tests where multiple variants are analyzed jointly to increase power. To date, there are no gene-based tests designed for association testing of low-frequency variants on the X chromosome. Here we propose three gene-based tests for the X chromosome: burden, sequence kernel association test (SKAT), and optimal unified SKAT (SKAT-O). Using simulated case-control and quantitative trait (QT) data, we evaluate the calibration and power of these tests as a function of (1) male:female sample size ratio; and (2) coding of haploid male genotypes for variants under X-inactivation. For case-control studies, all three tests are reasonably well-calibrated for all scenarios we evaluated. As expected, power for gene-based tests depends on the underlying genetic architecture of the genomic region analyzed. Studies with more (haploid) males are generally less powerful due to decreased number of chromosomes. Power generally is slightly greater when the coding scheme for male genotypes matches the true underlying model, but the power loss for misspecifying the (generally unknown) model is small. For QT studies, type I error and power results largely mirror those for binary traits. We demonstrate the use of these three gene-based tests for X-chromosome association analysis in simulated data and sequencing data from the Genetics of Type 2 Diabetes (GoT2D) study
Evaluating empirical bounds on complex disease genetic architecture.
The genetic architecture of human diseases governs the success of genetic mapping and the future of personalized medicine. Although numerous studies have queried the genetic basis of common disease, contradictory hypotheses have been advocated about features of genetic architecture (for example, the contribution of rare versus common variants). We developed an integrated simulation framework, calibrated to empirical data, to enable the systematic evaluation of such hypotheses. For type 2 diabetes (T2D), two simple parameters--(i) the target size for causal mutation and (ii) the coupling between selection and phenotypic effect--define a broad space of architectures. Whereas extreme models are excluded by the combination of epidemiology, linkage and genome-wide association studies, many models remain consistent, including those where rare variants explain either little (<25%) or most (>80%) of T2D heritability. Ongoing sequencing and genotyping studies will further constrain the space of possible architectures, but very large samples (for example, >250,000 unselected individuals) will be required to localize most of the heritability underlying T2D and other traits characterized by these models
