479 research outputs found

    Replication of LDL SWAs hits in PROSPER/PHASE as validation for future (pharmaco)genetic analyses

    Get PDF
    <p><b>Background:</b> The PHArmacogenetic study of Statins in the Elderly at risk (PHASE) is a genome wide association study in the PROspective Study of Pravastatin in the Elderly at risk for vascular disease (PROSPER) that investigates the genetic variation responsible for the individual variation in drug response to pravastatin. Statins lower LDL-cholesterol in general by 30%, however not in all subjects. Moreover, clinical response is highly variable and adverse effects occur in a minority of patients. In this report we first describe the rationale of the PROSPER/PHASE project and second show that the PROSPER/PHASE study can be used to study pharmacogenetics in the elderly.</p> <p><b>Methods:</b> The genome wide association study (GWAS) was conducted using the Illumina 660K-Quad beadchips following manufacturer's instructions. After a stringent quality control 557,192 SNPs in 5,244 subjects were available for analysis. To maximize the availability of genetic data and coverage of the genome, imputation up to 2.5 million autosomal CEPH HapMap SNPs was performed with MACH imputation software. The GWAS for LDL-cholesterol is assessed with an additive linear regression model in PROBABEL software, adjusted for age, sex, and country of origin to account for population stratification.</p> <p><b>Results:</b> Forty-two SNPs reached the GWAS significant threshold of p = 5.0e-08 in 5 genomic loci (APOE/APOC1; LDLR; FADS2/FEN1; HMGCR; PSRC1/CELSR5). The top SNP (rs445925, chromosome 19) with a p-value of p = 2.8e-30 is located within the APOC1 gene and near the APOE gene. The second top SNP (rs6511720, chromosome 19) with a p-value of p = 5.22e-15 is located within the LDLR gene. All 5 genomic loci were previously associated with LDL-cholesterol levels, no novel loci were identified. Replication in WOSCOPS and CARE confirmed our results.</p> <p><b>Conclusion:</b> With the GWAS in the PROSPER/PHASE study we confirm the previously found genetic associations with LDL-cholesterol levels. With this proof-of-principle study we show that the PROSPER/PHASE study can be used to investigate genetic associations in a similar way to population based studies. The next step of the PROSPER/PHASE study is to identify the genetic variation responsible for the variation in LDL-cholesterol lowering in response to statin treatment in collaboration with other large trials.</p&gt

    Using LASSO regression to detect predictive aggregate effects in genetic studies

    Get PDF
    We use least absolute shrinkage and selection operator (LASSO) regression to select genetic markers and phenotypic features that are most informative with respect to a trait of interest. We compare several strategies for applying LASSO methods in risk prediction models, using the Genetic Analysis Workshop 17 exome simulation data consisting of 697 individuals with information on genotypic and phenotypic features (smoking, age, sex) in 5-fold cross-validated fashion. The cross-validated averages of the area under the receiver operating curve range from 0.45 to 0.63 for different strategies using only genotypic markers. The same values are improved to 0.69–0.87 when both genotypic and phenotypic information are used. The ability of the LASSO method to find true causal markers is limited, but the method was able to discover several common variants (e.g., FLT1) under certain conditions

    Multi-locus Test Conditional on Confirmed Effects Leads to Increased Power in Genome-wide Association Studies

    Get PDF
    Complex diseases or phenotypes may involve multiple genetic variants and interactions between genetic, environmental and other factors. Current genome-wide association studies (GWAS) mostly used single-locus analysis and had identified genetic effects with multiple confirmations. Such confirmed single-nucleotide polymorphism (SNP) effects were likely to be true genetic effects and ignoring this information in testing new effects of the same phenotype results in decreased statistical power due to increased residual variance that has a component of the omitted effects. In this study, a multi-locus association test (MLT) was proposed for GWAS analysis conditional on SNPs with confirmed effects to improve statistical power. Analytical formulae for statistical power were derived and were verified by simulation for MLT accounting for confirmed SNPs and for single-locus test (SLT) without accounting for confirmed SNPs. Statistical power of the two methods was compared by case studies with simulated and the Framingham Heart Study (FHS) GWAS data. Results showed that the MLT method had increased statistical power over SLT. In the GWAS case study on four cholesterol phenotypes and serum metabolites, the MLT method improved statistical power by 5% to 38% depending on the number and effect sizes of the conditional SNPs. For the analysis of HDL cholesterol (HDL-C) and total cholesterol (TC) of the FHS data, the MLT method conditional on confirmed SNPs from GWAS catalog and NCBI had considerably more significant results than SLT

    Novel genetic analysis for case-control genome-wide association studies: quantification of power and genomic prediction accuracy

    Get PDF
    Genome-wide association studies (GWAS) are routinely conducted for both quantitative and binary (disease) traits. We present two analytical tools for use in the experimental design of GWAS. Firstly, we present power calculations quantifying power in a unified framework for a range of scenarios. In this context we consider the utility of quantitative scores (e.g. endophenotypes) that may be available on cases only or both cases and controls. Secondly, we consider, the accuracy of prediction of genetic risk from genome-wide SNPs and derive an expression for genomic prediction accuracy using a liability threshold model for disease traits in a case-control design. The expected values based on our derived equations for both power and prediction accuracy agree well with observed estimates from simulations

    Genome-wide association study of Alzheimer's disease

    Get PDF
    In addition to apolipoprotein E (APOE), recent large genome-wide association studies (GWASs) have identified nine other genes/loci (CR1, BIN1, CLU, PICALM, MS4A4/MS4A6E, CD2AP, CD33, EPHA1 and ABCA7) for late-onset Alzheimer's disease (LOAD). However, the genetic effect attributable to known loci is about 50%, indicating that additional risk genes for LOAD remain to be identified. In this study, we have used a new GWAS data set from the University of Pittsburgh (1291 cases and 938 controls) to examine in detail the recently implicated nine new regions with Alzheimer's disease (AD) risk, and also performed a meta-analysis utilizing the top 1% GWAS single-nucleotide polymorphisms (SNPs) with P<0.01 along with four independent data sets (2727 cases and 3336 controls) for these SNPs in an effort to identify new AD loci. The new GWAS data were generated on the Illumina Omni1-Quad chip and imputed at ∼2.5 million markers. As expected, several markers in the APOE regions showed genome-wide significant associations in the Pittsburg sample. While we observed nominal significant associations (P<0.05) either within or adjacent to five genes (PICALM, BIN1, ABCA7, MS4A4/MS4A6E and EPHA1), significant signals were observed 69–180 kb outside of the remaining four genes (CD33, CLU, CD2AP and CR1). Meta-analysis on the top 1% SNPs revealed a suggestive novel association in the PPP1R3B gene (top SNP rs3848140 with P=3.05E–07). The association of this SNP with AD risk was consistent in all five samples with a meta-analysis odds ratio of 2.43. This is a potential candidate gene for AD as this is expressed in the brain and is involved in lipid metabolism. These findings need to be confirmed in additional samples

    Genome-wide association studies and genetic architecture of common human diseases

    Get PDF
    Genome-wide association scans provide the first successful method to identify genetic variation contributing to risk for common complex disease. Progress in identifying genes associated with melanoma show complex relationships between genes for pigmentation and the development of melanoma. Novel risk loci account for only a small fraction of the genetic variation contributing to this and many other diseases. Large meta-analyses find additional variants, but there is current debate about the contribution of common polymorphisms, rare polymorphisms or mutations to disease risk

    Genome-Wide Association Analysis of Incident Coronary Heart Disease (CHD) in African Americans: A Short Report

    Get PDF
    African Americans have the highest rate of mortality due to coronary heart disease (CHD). Although multiple loci have been identified influencing CHD risk in European-Americans using a genome-wide association (GWAS) approach, no GWAS of incident CHD has been reported for African Americans. We performed a GWAS for incident CHD events collected during 19 years of follow-up in 2,905 African Americans from the Atherosclerosis Risk in Communities (ARIC) study. We identified a genome-wide significant SNP (rs1859023, MAF = 31%) located at 7q21 near the PFTK1 gene (HR = 0.57, 95% CI 0.46 to 0.69, p = 1.86×10−08), which replicated in an independent sample of over 8,000 African American women from the Women's Health Initiative (WHI) (HR = 0.81, 95% CI 0.70 to 0.93, p = 0.005). PFTK1 encodes a serine/threonine-protein kinase, PFTAIRE-1, that acts as a cyclin-dependent kinase regulating cell cycle progression and cell proliferation. This is the first finding of incident CHD locus identified by GWAS in African Americans

    Proinflammatory genotype is associated with the frailty phenotype in the English Longitudinal Study of Ageing

    Get PDF
    Background: Frailty is a state of increased vulnerability to poor resolution of homeostasis after a stressor event, which increases the risk of adverse outcomes including falls, disability and death. The underlying pathophysiological pathways of frailty are not known but the hypothalamic–pituitary–adrenal axis and heightened chronic systemic inflammation appear to be major contributors. Methods: We used the English Longitudinal Study of Ageing dataset of 3160 individuals over the age of 50 and assessed their frailty status according to the Fried-criteria. We selected single nucleotide polymorphisms in genes involved in the steroid hormone or inflammatory pathways and performed linear association analysis using age and sex as covariates. To support the biological plausibility of any genetic associations, we selected biomarker levels for further analyses to act as potential endophenotypes of our chosen genetic loci. Results: The strongest association with frailty was observed in the Tumor Necrosis Factor (TNF) (rs1800629, P = 0.001198, β = 0.0894) and the Protein Tyrosine Phosphatase, Receptor type, J (PTPRJ) (rs1566729, P = 0.001372, β = 0.09397) genes. Rs1800629 was significantly associated with decreased levels of high-density lipoprotein (HDL) (P = 0.00949) and cholesterol levels (P = 0.00315), whereas rs1566729 was associated with increased levels of HDL (P = 0.01943). After correcting for multiple testing none of the associations remained significant. Conclusions: We provide potential evidence for the involvement of a multifunctional proinflammatory cytokine gene (TNF) in the frailty phenotype. The implication of this gene is further supported by association with the endophenotype biomarker results

    MultiPhen: Joint Model of Multiple Phenotypes Can Increase Discovery in GWAS

    Get PDF
    The genome-wide association study (GWAS) approach has discovered hundreds of genetic variants associated with diseases and quantitative traits. However, despite clinical overlap and statistical correlation between many phenotypes, GWAS are generally performed one-phenotype-at-a-time. Here we compare the performance of modelling multiple phenotypes jointly with that of the standard univariate approach. We introduce a new method and software, MultiPhen, that models multiple phenotypes simultaneously in a fast and interpretable way. By performing ordinal regression, MultiPhen tests the linear combination of phenotypes most associated with the genotypes at each SNP, and thus potentially captures effects hidden to single phenotype GWAS. We demonstrate via simulation that this approach provides a dramatic increase in power in many scenarios. There is a boost in power for variants that affect multiple phenotypes and for those that affect only one phenotype. While other multivariate methods have similar power gains, we describe several benefits of MultiPhen over these. In particular, we demonstrate that other multivariate methods that assume the genotypes are normally distributed, such as canonical correlation analysis (CCA) and MANOVA, can have highly inflated type-1 error rates when testing case-control or non-normal continuous phenotypes, while MultiPhen produces no such inflation. To test the performance of MultiPhen on real data we applied it to lipid traits in the Northern Finland Birth Cohort 1966 (NFBC1966). In these data MultiPhen discovers 21% more independent SNPs with known associations than the standard univariate GWAS approach, while applying MultiPhen in addition to the standard approach provides 37% increased discovery. The most associated linear combinations of the lipids estimated by MultiPhen at the leading SNPs accurately reflect the Friedewald Formula, suggesting that MultiPhen could be used to refine the definition of existing phenotypes or uncover novel heritable phenotypes
    corecore