27 research outputs found

    The effect of rare variants on inflation of the test statistics in case-control analyses.

    Get PDF
    BACKGROUND: The detection of bias due to cryptic population structure is an important step in the evaluation of findings of genetic association studies. The standard method of measuring this bias in a genetic association study is to compare the observed median association test statistic to the expected median test statistic. This ratio is inflated in the presence of cryptic population structure. However, inflation may also be caused by the properties of the association test itself particularly in the analysis of rare variants. We compared the properties of the three most commonly used association tests: the likelihood ratio test, the Wald test and the score test when testing rare variants for association using simulated data. RESULTS: We found evidence of inflation in the median test statistics of the likelihood ratio and score tests for tests of variants with less than 20 heterozygotes across the sample, regardless of the total sample size. The test statistics for the Wald test were under-inflated at the median for variants below the same minor allele frequency. CONCLUSIONS: In a genetic association study, if a substantial proportion of the genetic variants tested have rare minor allele frequencies, the properties of the association test may mask the presence or absence of bias due to population structure. The use of either the likelihood ratio test or the score test is likely to lead to inflation in the median test statistic in the absence of population structure. In contrast, the use of the Wald test is likely to result in under-inflation of the median test statistic which may mask the presence of population structure.This work was supported by a grant from Cancer Research UK (C490/A16561). AP is funded by a Medical Research Council studentship.This is the final published version. It first appeared at http://dx.doi.org/10.1186%2Fs12859-015-0496-1

    Rare coding variants and X-linked loci associated with age at menarche.

    Get PDF
    More than 100 loci have been identified for age at menarche by genome-wide association studies; however, collectively these explain only ∼3% of the trait variance. Here we test two overlooked sources of variation in 192,974 European ancestry women: low-frequency protein-coding variants and X-chromosome variants. Five missense/nonsense variants (in ALMS1/LAMB2/TNRC6A/TACR3/PRKAG1) are associated with age at menarche (minor allele frequencies 0.08-4.6%; effect sizes 0.08-1.25 years per allele; P<5 × 10(-8)). In addition, we identify common X-chromosome loci at IGSF1 (rs762080, P=9.4 × 10(-13)) and FAAH2 (rs5914101, P=4.9 × 10(-10)). Highlighted genes implicate cellular energy homeostasis, post-transcriptional gene silencing and fatty-acid amide signalling. A frequently reported mutation in TACR3 for idiopathic hypogonatrophic hypogonadism (p.W275X) is associated with 1.25-year-later menarche (P=2.8 × 10(-11)), illustrating the utility of population studies to estimate the penetrance of reportedly pathogenic mutations. Collectively, these novel variants explain ∼0.5% variance, indicating that these overlooked sources of variation do not substantially explain the 'missing heritability' of this complex trait.UK sponsors (see article for overseas ones): This work made use of data and samples generated by the 1958 Birth Cohort (NCDS). Access to these resources was enabled via the 58READIE Project funded by Wellcome Trust and Medical Research Council (grant numbers WT095219MA and G1001799). A full list of the financial, institutional and personal contributions to the development of the 1958 Birth Cohort Biomedical resource is available at http://www2.le.ac.uk/projects/birthcohort. Genotyping was undertaken as part of the Wellcome Trust Case-Control Consortium (WTCCC) under Wellcome Trust award 076113, and a full list of the investigators who contributed to the generation of the data is available at www.wtccc.org.uk ... The Fenland Study is funded by the Wellcome Trust and the Medical Research Council, as well as by the Support for Science Funding programme and CamStrad. ... SIBS - CRUK ref: C1287/A8459 SEARCH - CRUK ref: A490/A10124 EMBRACE is supported by Cancer Research UK Grants C1287/A10118, C1287/A16563 and C1287/A17523. Genotyping was supported by Cancer Research - UK grant C12292/A11174D and C8197/A16565. Gareth Evans and Fiona Lalloo are supported by an NIHR grant to the Biomedical Research Centre, Manchester. The Investigators at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust are supported by an NIHR grant to the Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. Ros Eeles and Elizabeth Bancroft are supported by Cancer Research UK Grant C5047/A8385. ... Generation Scotland - Scottish Executive Health Department, Chief Scientist Office, grant number CZD/16/6. Exome array genotyping for GS:SFHS was funded by the Medical Research Council UK. 23andMe - This work was supported in part by NIH Award 2R44HG006981-02 from the National Human Genome Research Institute.This is the final version of the article. It first appeared from NPG via http://dx.doi.org/10.1038/ncomms875

    Rare and low-frequency coding variants alter human adult height

    Get PDF
    Height is a highly heritable, classic polygenic trait with ~700 common associated variants identified so far through genome - wide association studies . Here , we report 83 height - associated coding variants with lower minor allele frequenc ies ( range of 0.1 - 4.8% ) and effects of up to 2 16 cm /allele ( e.g. in IHH , STC2 , AR and CRISPLD2 ) , >10 times the average effect of common variants . In functional follow - up studies, rare height - increasing alleles of STC2 (+1 - 2 cm/allele) compromise d proteolytic inhibition of PAPP - A and increased cleavage of IGFBP - 4 in vitro , resulting in higher bioavailability of insulin - like growth factors . The se 83 height - associated variants overlap genes mutated in monogenic growth disorders and highlight new biological candidates ( e.g. ADAMTS3, IL11RA, NOX4 ) and pathways ( e.g . proteoglycan/ glycosaminoglycan synthesis ) involved in growth . Our results demonstrate that sufficiently large sample sizes can uncover rare and low - frequency variants of moderate to large effect associated with polygenic human phenotypes , and that these variants implicate relevant genes and pathways

    Exome genotyping arrays to identify rare and low frequency variants associated with epithelial ovarian cancer risk

    Get PDF
    Rare and low frequency variants are not well covered in most germline genotyping arrays and are understudied in relation to epithelial ovarian cancer (EOC) risk. To address this gap, we used genotyping arrays targeting rarer protein-coding variation in 8,165 EOC cases and 11,619 controls from the international Ovarian Cancer Association Consortium (OCAC). Pooled association analyses were conducted at the variant and gene level for 98,543 variants directly genotyped through two exome genotyping projects. Only common variants that represent or are in strong linkage disequilibrium (LD) with previously-identified signals at established loci reached traditional thresholds for exome-wide significance (P < 5.0 × 10 (−) (7)). One of the most significant signals (P(all histologies )=( )1.01 × 10 (−) (13);P(serous )=( )3.54 × 10 (−) (14)) occurred at 3q25.31 for rs62273959, a missense variant mapping to the LEKR1 gene that is in LD (r(2 )=( )0.90) with a previously identified ‘best hit’ (rs7651446) mapping to an intron of TIPARP. Suggestive associations (5.0 × 10 (−) (5 )>( )P≥5.0 ×10 (−) (7)) were detected for rare and low-frequency variants at 16 novel loci. Four rare missense variants were identified (ACTBL2 rs73757391 (5q11.2), BTD rs200337373 (3p25.1), KRT13 rs150321809 (17q21.2) and MC2R rs104894658 (18p11.21)), but only MC2R rs104894668 had a large effect size (OR = 9.66). Genes most strongly associated with EOC risk included ACTBL2 (P(AML )=( )3.23 × 10 (−) (5); P(SKAT-o )=( )9.23 × 10 (−) (4)) and KRT13 (P(AML )=( )1.67 × 10 (−) (4); P(SKAT-o )=( )1.07 × 10 (−) (5)), reaffirming variant-level analysis. In summary, this large study identified several rare and low-frequency variants and genes that may contribute to EOC susceptibility, albeit with possible small effects. Future studies that integrate epidemiology, sequencing, and functional assays are needed to further unravel the unexplained heritability and biology of this disease

    Common germline polymorphisms associated with breast cancer-specific survival

    Get PDF
    Abstract Introduction Previous studies have identified common germline variants nominally associated with breast cancer survival. These associations have not been widely replicated in further studies. The purpose of this study was to evaluate the association of previously reported SNPs with breast cancer-specific survival using data from a pooled analysis of eight breast cancer survival genome-wide association studies (GWAS) from the Breast Cancer Association Consortium. Methods A literature review was conducted of all previously published associations between common germline variants and three survival outcomes: breast cancer-specific survival, overall survival and disease-free survival. All associations that reached the nominal significance level of P value <0.05 were included. Single nucleotide polymorphisms that had been previously reported as nominally associated with at least one survival outcome were evaluated in the pooled analysis of over 37,000 breast cancer cases for association with breast cancer-specific survival. Previous associations were evaluated using a one-sided test based on the reported direction of effect. Results Fifty-six variants from 45 previous publications were evaluated in the meta-analysis. Fifty-four of these were evaluated in the full set of 37,954 breast cancer cases with 2,900 events and the two additional variants were evaluated in a reduced sample size of 30,000 samples in order to ensure independence from the previously published studies. Five variants reached nominal significance (P <0.05) in the pooled GWAS data compared to 2.8 expected under the null hypothesis. Seven additional variants were associated (P <0.05) with ER-positive disease. Conclusions Although no variants reached genome-wide significance (P <5 x 10−8), these results suggest that there is some evidence of association between candidate common germline variants and breast cancer prognosis. Larger studies from multinational collaborations are necessary to increase the power to detect associations, between common variants and prognosis, at more stringent significance levels

    Common germline polymorphisms associated with breast cancer-specific survival

    No full text
    Introduction: Previous studies have identified common germline variants nominally associated with breast cancer survival. These associations have not been widely replicated in further studies. The purpose of this study was to evaluate the association of previously reported SNPs with breast cancer-specific survival using data from a pooled analysis of eight breast cancer survival genome-wide association studies (GWAS) from the Breast Cancer Association Consortium. Methods: A literature review was conducted of all previously published associations between common germline variants and three survival outcomes: breast cancer-specific survival, overall survival and disease-free survival. All associations that reached the nominal significance level of P value &amp;lt;0.05 were included. Single nucleotide polymorphisms that had been previously reported as nominally associated with at least one survival outcome were evaluated in the pooled analysis of over 37,000 breast cancer cases for association with breast cancer-specific survival. Previous associations were evaluated using a one-sided test based on the reported direction of effect. Results: Fifty-six variants from 45 previous publications were evaluated in the meta-analysis. Fifty-four of these were evaluated in the full set of 37,954 breast cancer cases with 2,900 events and the two additional variants were evaluated in a reduced sample size of 30,000 samples in order to ensure independence from the previously published studies. Five variants reached nominal significance (P &amp;lt;0.05) in the pooled GWAS data compared to 2.8 expected under the null hypothesis. Seven additional variants were associated (P &amp;lt;0.05) with ER-positive disease. Conclusions: Although no variants reached genome-wide significance (P &amp;lt;5 x 10(-8)), these results suggest that there is some evidence of association between candidate common germline variants and breast cancer prognosis. Larger studies from multinational collaborations are necessary to increase the power to detect associations, between common variants and prognosis, at more stringent significance levels
    corecore