48 research outputs found
Lifestyle and Genetic Factors Modify Parent-of-Origin Effects on the Human Methylome
BACKGROUND: parent-of-origin effects (POE) play important roles in complex disease and thus understanding their regulation and associated molecular and phenotypic variation are warranted. Previous studies mainly focused on the detection of genomic regions or phenotypes regulated by POE. Understanding whether POE may be modified by environmental or genetic exposures is important for understanding of the source of POE-associated variation, but only a few case studies addressing modifiable POE exist. METHODS: in order to understand this high order of POE regulation, we screened 101 genetic and environmental factors such as âpredicted mRNA expression levelsâ of DNA methylation/imprinting machinery genes and environmental exposures. POE-mQTL-modifier interaction models were proposed to test the potential of these factors to modify POE at DNA methylation using data from Generation Scotland: The Scottish Family Health Study(N=2315). FINDINGS: a set of vulnerable/modifiable POE-CpGs were identified (modifiable-POE-regulated CpGs, N=3). Four factors, âlifetime smoking statusâ and âpredicted mRNA expression levelsâ of TET2, SIRT1 and KDM1A, were found to significantly modify the POE on the three CpGs in both discovery and replication datasets. We further identified plasma protein and health-related phenotypes associated with the methylation level of one of the identified CpGs. INTERPRETATION: the modifiable POE identified here revealed an important yet indirect path through which genetic background and environmental exposures introduce their effect on DNA methylation, motivating future comprehensive evaluation of the role of these modifiers in complex diseases. FUNDING: NSFC (81971270),H2020-MSCA-ITN(721815), Wellcome (204979/Z/16/Z,104036/Z/14/Z), MRC (MC_UU_00007/10, MC_PC_U127592696), CSO (CZD/16/6,CZB/4/276, CZB/4/710), SFC (HR03006), EUROSPAN (LSHG-CT-2006-018947), BBSRC (BBS/E/D/30002276), SYSU, Arthritis Research UK, NHLBI, NIH
Linking protein to phenotype with Mendelian Randomization detects 38 proteins with causal roles in human diseases and traits
To efficiently transform genetic associations into drug targets requires evidence that a particular gene, and its encoded protein, contribute causally to a disease. To achieve this, we employ a three-step proteome-by-phenome Mendelian Randomization (MR) approach. In step one, 154 protein quantitative trait loci (pQTLs) were identified and independently replicated. From these pQTLs, 64 replicated locally-acting variants were used as instrumental variables for proteome-by-phenome MR across 846 traits (step two). When its assumptions are met, proteome-by-phenome MR, is equivalent to simultaneously running many randomized controlled trials. Step 2 yielded 38 proteins that significantly predicted variation in traits and diseases in 509 instances. Step 3 revealed that amongst the 271 instances from GeneAtlas (UK Biobank), 77 showed little evidence of pleiotropy (HEIDI), and 92 evidence of colocalization (eCAVIAR). Results were wide ranging: including, for example, new evidence for a causal role of tyrosine-protein phosphatase non-receptor type substrate 1 (SHPS1; SIRPA) in schizophrenia, and a new finding that intestinal fatty acid binding protein (FABP2) abundance contributes to the pathogenesis of cardiovascular disease. We also demonstrated confirmatory evidence for the causal role of four further proteins (FGF5, IL6R, LPL, LTA) in cardiovascular disease risk
Assessing the role of genome-wide DNA methylation between smoking and risk of lung cancer using repeated measurements: the HUNT Study
Background - It is unclear if smoking-related DNA methylation represents a causal pathway between smoking and risk of lung cancer. We sought to identify novel smoking-related DNA methylation sites in blood, with repeated measurements, and to appraise the putative role of DNA methylation in the pathway between smoking and lung cancer development.
Methods - We derived a nested case-control study from the TrĂžndelag Health Study (HUNT), including 140 incident patients who developed lung cancer during 2009â13 and 140 controls. We profiled 850âK DNA methylation sites (Illumina Infinium EPIC array) in DNA extracted from blood that was collected in HUNT2 (1995â97) and HUNT3 (2006â08) for the same individuals. Epigenome-wide association studies (EWAS) were performed for a detailed smoking phenotype and for lung cancer. Two-step Mendelian randomization (MR) analyses were performed to assess the potential causal effect of smoking on DNA methylation as well as of DNA methylation (13 sites as putative mediators) on risk of lung cancer.
Results - The EWAS for smoking in HUNT2 identified associations at 76 DNA methylation sites (Pââ8), including 16 novel sites. Smoking was associated with DNA hypomethylation in a dose-response relationship among 83% of the 76 sites, which was confirmed by analyses using repeated measurements from blood that was collected at 11âyears apart for the same individuals. Two-step MR analyses showed evidence for a causal effect of smoking on DNA methylation but no evidence for a causal link between DNA methylation and the risk of lung cancer.
Conclusions - DNA methylation modifications in blood did not seem to represent a causal pathway linking smoking and the lung cancer risk
Genome-Wide Association Study of Non-Alcoholic Fatty Liver Disease using Electronic Health Records
Genomeâwide association studies (GWAS) have identified several risk loci for nonalcoholic fatty liver disease (NAFLD). Previous studies have largely relied on small sample sizes and have assessed quantitative traits. We performed a caseâcontrol GWAS in the UK Biobank using recorded diagnosis of NAFLD based on diagnostic codes recommended in recent consensus guidelines. We performed a GWAS of 4,761 cases of NAFLD and 373,227 healthy controls without evidence of NAFLD. Sensitivity analyses were performed excluding other coâexisting hepatic pathology, adjusting for body mass index (BMI) and adjusting for alcohol intake. A total of 9,723,654 variants were assessed by logistic regression adjusted for age, sex, genetic principal components, and genotyping batch. We performed a GWAS metaâanalysis using available summary association statistics. Six risk loci were identified (P < 5*10(â8)) (apolipoprotein E [APOE], patatinâlike phospholipase domain containing 3 [PNPLA3, transmembrane 6 superfamily member 2 [TM6SF2], glucokinase regulator [GCKR], mitochondrial amidoxime reducing component 1 [MARC1], and tribbles pseudokinase 1 [TRIB1]). All loci retained significance in sensitivity analyses without coâexistent hepatic pathology and after adjustment for BMI. PNPLA3 and TM6SF2 remained significant after adjustment for alcohol (alcohol intake was known in only 158,388 individuals), with others demonstrating consistent direction and magnitude of effect. All six loci were significant on metaâanalysis. Rs429358 (P = 2.17*10(â11)) is a missense variant within the APOE gene determining Ï”4 versus Ï”2/Ï”3 alleles. The Ï”4 allele of APOE offered protection against NAFLD (odds ratio for heterozygotes 0.84 [95% confidence interval 0.78â0.90] and homozygotes 0.64 [0.50â0.79]). Conclusion: This GWAS replicates six known NAFLDâsusceptibility loci and confirms that the Ï”4 allele of APOE is associated with protection against NAFLD. The results are consistent with published GWAS using histological and radiological measures of NAFLD, confirming that NAFLD identified through diagnostic codes from consensus guidelines is a valid alternative to more invasive and costly approaches
Evaluation of pragmatic oxygenation measurement as a proxy for Covid-19 severity
Choosing optimal outcome measures maximizes statistical power, accelerates discovery and improves reliability in early-phase trials. We devised and evaluated a modification to a pragmatic measure of oxygenation function, the [Formula: see text] ratio. Because of the ceiling effect in oxyhaemoglobin saturation, [Formula: see text] ratio ceases to reflect pulmonary oxygenation function at high [Formula: see text] values. We found that the correlation of [Formula: see text] with the reference standard ([Formula: see text]/[Formula: see text] ratio) improves substantially when excluding [Formula: see text] and refer to this measure as [Formula: see text]. Using observational data from 39,765 hospitalised COVID-19 patients, we demonstrate that [Formula: see text] is predictive of mortality, and compare the sample sizes required for trials using four different outcome measures. We show that a significant difference in outcome could be detected with the smallest sample size using [Formula: see text]. We demonstrate that [Formula: see text] is an effective intermediate outcome measure in COVID-19. It is a non-invasive measurement, representative of disease severity and provides greater statistical power
Evaluation of pragmatic oxygenation measurement as a proxy for Covid-19 severity
Choosing optimal outcome measures maximizes statistical power, accelerates discovery and improves reliability in early-phase trials. We devised and evaluated a modification to a pragmatic measure of oxygenation function, the [Formula: see text] ratio. Because of the ceiling effect in oxyhaemoglobin saturation, [Formula: see text] ratio ceases to reflect pulmonary oxygenation function at high [Formula: see text] values. We found that the correlation of [Formula: see text] with the reference standard ([Formula: see text]/[Formula: see text] ratio) improves substantially when excluding [Formula: see text] and refer to this measure as [Formula: see text]. Using observational data from 39,765 hospitalised COVID-19 patients, we demonstrate that [Formula: see text] is predictive of mortality, and compare the sample sizes required for trials using four different outcome measures. We show that a significant difference in outcome could be detected with the smallest sample size using [Formula: see text]. We demonstrate that [Formula: see text] is an effective intermediate outcome measure in COVID-19. It is a non-invasive measurement, representative of disease severity and provides greater statistical power
GWAS and meta-analysis identifies 49 genetic variants underlying critical COVID-19
Critical illness in COVID-19 is an extreme and clinically homogeneous disease phenotype that we have previously shown1 to be highly efficient for discovery of genetic associations2. Despite the advanced stage of illness at presentation, we have shown that host genetics in patients who are critically ill with COVID-19 can identify immunomodulatory therapies with strong beneficial effects in this group3. Here we analyse 24,202 cases of COVID-19 with critical illness comprising a combination of microarray genotype and whole-genome sequencing data from cases of critical illness in the international GenOMICC (11,440 cases) study, combined with other studies recruiting hospitalized patients with a strong focus on severe and critical disease: ISARIC4C (676 cases) and the SCOURGE consortium (5,934 cases). To put these results in the context of existing work, we conduct a meta-analysis of the new GenOMICC genome-wide association study (GWAS) results with previously published data. We find 49 genome-wide significant associations, of which 16 have not been reported previously. To investigate the therapeutic implications of these findings, we infer the structural consequences of protein-coding variants, and combine our GWAS results with gene expression data using a monocyte transcriptome-wide association study (TWAS) model, as well as gene and protein expression using Mendelian randomization. We identify potentially druggable targets in multiple systems, including inflammatory signalling (JAK1), monocyte-macrophage activation and endothelial permeability (PDE4A), immunometabolism (SLC2A5 and AK5), and host factors required for viral entry and replication (TMPRSS2 and RAB2A)
GWAS and Meta-Analysis Identifies 49 Genetic Variants Underlying Critical COVID-19
Critical illness in COVID-19 is an extreme and clinically homogeneous disease phenotype that we have previously shown1 to be highly efficient for discovery of genetic associations2. Despite the advanced stage of illness at presentation, we have shown that host genetics in patients who are critically ill with COVID-19 can identify immunomodulatory therapies with strong beneficial effects in this group3. Here we analyse 24,202 cases of COVID-19 with critical illness comprising a combination of microarray genotype and whole-genome sequencing data from cases of critical illness in the international GenOMICC (11,440 cases) study, combined with other studies recruiting hospitalized patients with a strong focus on severe and critical disease: ISARIC4C (676 cases) and the SCOURGE consortium (5,934 cases). To put these results in the context of existing work, we conduct a meta-analysis of the new GenOMICC genome-wide association study (GWAS) results with previously published data. We find 49 genome-wide significant associations, of which 16 have not been reported previously. To investigate the therapeutic implications of these findings, we infer the structural consequences of protein-coding variants, and combine our GWAS results with gene expression data using a monocyte transcriptome-wide association study (TWAS) model, as well as gene and protein expression using Mendelian randomization. We identify potentially druggable targets in multiple systems, including inflammatory signalling (JAK1), monocyte-macrophage activation and endothelial permeability (PDE4A), immunometabolism (SLC2A5 and AK5), and host factors required for viral entry and replication (TMPRSS2 and RAB2A)
Recommended from our members
Genomic and drug target evaluation of 90 cardiovascular proteins in 30,931 individuals.
Circulating proteins are vital in human health and disease and are frequently used as biomarkers for clinical decision-making or as targets for pharmacological intervention. Here, we map and replicate protein quantitative trait loci (pQTL) for 90 cardiovascular proteins in over 30,000 individuals, resulting in 451 pQTLs for 85 proteins. For each protein, we further perform pathway mapping to obtain trans-pQTL gene and regulatory designations. We substantiate these regulatory findings with orthogonal evidence for trans-pQTLs using mouse knockdown experiments (ABCA1 and TRIB1) and clinical trial results (chemokine receptors CCR2 and CCR5), with consistent regulation. Finally, we evaluate known drug targets, and suggest new target candidates or repositioning opportunities using Mendelian randomization. This identifies 11 proteins with causal evidence of involvement in human disease that have not previously been targeted, including EGF, IL-16, PAPPA, SPON1, F3, ADM, CASP-8, CHI3L1, CXCL16, GDF15 and MMP-12. Taken together, these findings demonstrate the utility of large-scale mapping of the genetics of the proteome and provide a resource for future precision studies of circulating proteins in human health