39 research outputs found

    Patient stratification by genetic risk in Alzheimer\u27s disease is only effective in the presence of phenotypic heterogeneity

    Get PDF
    Case-only designs in longitudinal cohorts are a valuable resource for identifying disease-relevant genes, pathways, and novel targets influencing disease progression. This is particularly relevant in Alzheimer\u27s disease (AD), where longitudinal cohorts measure disease progression, defined by rate of cognitive decline. Few of the identified drug targets for AD have been clinically tractable, and phenotypic heterogeneity is an obstacle to both clinical research and basic science. In four cohorts (n = 7241), we performed genome-wide association studies (GWAS) and Mendelian randomization (MR) to discover novel targets associated with progression and assess causal relationships. We tested opportunities for patient stratification by deriving polygenic risk scores (PRS) for AD risk and severity and tested the value of these scores in predicting progression. Genome-wide association studies identified no loci associated with progression at genome-wide significance (α = 5×10-8); MR analyses provided no significant evidence of an association between cognitive decline in AD patients and protein levels in brain, cerebrospinal fluid (CSF), and plasma. Polygenic risk scores for AD risk did not reliably stratify fast from slow progressors; however, a deeper investigation found that APOE ε4 status predicts amyloid-β and tau positive versus negative patients (odds ratio for an additional APOE ε4 allele = 5.78 [95% confidence interval: 3.76-8.89], P\u3c0.001) when restricting to a subset of patients with available CSF biomarker data. These results provided no evidence for large-effect, common-variant loci involved in the rate of memory decline, suggesting that patient stratification based on common genetic risk factors for progression may have limited utility. Where clinically relevant biomarkers suggest diagnostic heterogeneity, there is evidence that a priori identified genetic risk factors may have value in patient stratification. Mendelian randomization was less tractable due to the lack of large-effect loci, and future analyses with increased samples sizes are needed to replicate and validate our results

    A machine learning approach to brain epigenetic analysis reveals kinases associated with Alzheimers disease.

    Get PDF
    Alzheimers disease (AD) is influenced by both genetic and environmental factors; thus, brain epigenomic alterations may provide insights into AD pathogenesis. Multiple array-based Epigenome-Wide Association Studies (EWASs) have identified robust brain methylation changes in AD; however, array-based assays only test about 2% of all CpG sites in the genome. Here, we develop EWASplus, a computational method that uses a supervised machine learning strategy to extend EWAS coverage to the entire genome. Application to six AD-related traits predicts hundreds of new significant brain CpGs associated with AD, some of which are further validated experimentally. EWASplus also performs well on data collected from independent cohorts and different brain regions. Genes found near top EWASplus loci are enriched for kinases and for genes with evidence for physical interactions with known AD genes. In this work, we show that EWASplus implicates additional epigenetic loci for AD that are not found using array-based AD EWASs

    Proteomic signatures improve risk prediction for common and rare diseases

    Get PDF
    For many diseases there are delays in diagnosis due to a lack of objective biomarkers for disease onset. Here, in 41,931 individuals from the United Kingdom Biobank Pharma Proteomics Project, we integrated measurements of ~3,000 plasma proteins with clinical information to derive sparse prediction models for the 10-year incidence of 218 common and rare diseases (81–6,038 cases). We then compared prediction models developed using proteomic data with models developed using either basic clinical information alone or clinical information combined with data from 37 clinical assays. The predictive performance of sparse models including as few as 5 to 20 proteins was superior to the performance of models developed using basic clinical information for 67 pathologically diverse diseases (median delta C-index = 0.07; range = 0.02–0.31). Sparse protein models further outperformed models developed using basic information combined with clinical assay data for 52 diseases, including multiple myeloma, non-Hodgkin lymphoma, motor neuron disease, pulmonary fibrosis and dilated cardiomyopathy. For multiple myeloma, single-cell RNA sequencing from bone marrow in newly diagnosed patients showed that four of the five predictor proteins were expressed specifically in plasma cells, consistent with the strong predictive power of these proteins. External replication of sparse protein models in the EPIC-Norfolk study showed good generalizability for prediction of the six diseases tested. These findings show that sparse plasma protein signatures, including both disease-specific proteins and protein predictors shared across several diseases, offer clinically useful prediction of common and rare diseases

    Whole-genome sequencing of 490,640 UK Biobank participants

    Get PDF
    Whole-genome sequencing provides an unbiased and complete view of the human genome and enables the discovery of genetic variation without the technical limitations of other genotyping technologies. Here we report on whole-genome sequencing of 490,640 UK Biobank participants, building on previous genotyping effort1. This advance deepens our understanding of how genetics associates with disease biology and further enhances the value of this open resource for the study of human biology and health. Coupling this dataset with rich phenotypic data, we surveyed within- and cross-ancestry genomic associations and identified novel genetic and clinical insights. Although most associations with disease traits were primarily observed in individuals of European ancestries, strong or novel signals were also identified in individuals of African and Asian ancestries. With the improved ability to accurately genotype structural variants and exonic variation in both coding and UTR sequences, we strengthened and revealed novel insights relative to whole-exome sequencing2,3 analyses. This dataset, representing a large collection of whole-genome sequencing data that is available to the UK Biobank research community, will enable advances of our understanding of the human genome, facilitate the discovery of diagnostics and therapeutics with higher efficacy and improved safety profile, and enable precision medicine strategies with the potential to improve global health

    Genetic control of the human brain proteome

    No full text

    An integrated -omics analysis of the epigenetic landscape of gene expression in human blood cells

    No full text
    Abstract Background Gene expression can be influenced by DNA methylation 1) distally, at regulatory elements such as enhancers, as well as 2) proximally, at promoters. Our current understanding of the influence of distal DNA methylation changes on gene expression patterns is incomplete. Here, we characterize genome-wide methylation and expression patterns for ~ 13 k genes to explore how DNA methylation interacts with gene expression, throughout the genome. Results We used a linear mixed model framework to assess the correlation of DNA methylation at ~ 400 k CpGs with gene expression changes at ~ 13 k transcripts in two independent datasets from human blood cells. Among CpGs at which methylation significantly associates with transcription (eCpGs), > 50% are distal (> 50 kb) or trans (different chromosome) to the correlated gene. Many eCpG-transcript pairs are consistent between studies and ~ 90% of neighboring eCpGs associate with the same gene, within studies. We find that enhancers (P < 5e-18) and microRNA genes (P = 9e-3) are overrepresented among trans eCpGs, and insulators and long intergenic non-coding RNAs are enriched among cis and distal eCpGs. Intragenic-eCpG-transcript correlations are negative in 60–70% of occurrences and are enriched for annotated gene promoters and enhancers (P < 0.002), highlighting the importance of intragenic regulation. Gene Ontology analysis indicates that trans eCpGs are enriched for transcription factor genes and chromatin modifiers, suggesting that some trans eCpGs represent the influence of gene networks and higher-order transcriptional control. Conclusions This work sheds new light on the interplay between epigenetic changes and gene expression, and provides useful data for mining biologically-relevant results from epigenome-wide association studies
    corecore