42 research outputs found
Recommended from our members
Protein-metabolite association studies identify novel proteomic determinants of metabolite levels in human plasma
Although many novel gene-metabolite and gene-protein associations have been identified using high-throughput biochemical profiling, systematic studies that leverage human genetics to illuminate causal relationships between circulating proteins and metabolites are lacking. Here, we performed protein-metabolite association studies in 3,626 plasma samples from three human cohorts. We detected 171,800 significant protein-metabolite pairwise correlations between 1,265 proteins and 365 metabolites, including established relationships in metabolic and signaling pathways such as the protein thyroxine-binding globulin and the metabolite thyroxine, as well as thousands of new findings. In Mendelian randomization (MR) analyses, we identified putative causal protein-to-metabolite associations. We experimentally validated top MR associations in proof-of-concept plasma metabolomics studies in three murine knockout strains of key protein regulators. These analyses identified previously unrecognized associations between bioactive proteins and metabolites in human plasma. We provide publicly available data to be leveraged for studies in human metabolism and disease
Genome-wide association studies identify 137 genetic loci for DNA methylation biomarkers of aging
BACKGROUND: Biological aging estimators derived from DNA methylation data are heritable and correlate with morbidity and mortality. Consequently, identification of genetic and environmental contributors to the variation in these measures in populations has become a major goal in the field. RESULTS: Leveraging DNA methylation and SNP data from more than 40,000 individuals, we identify 137 genome-wide significant loci, of which 113 are novel, from genome-wide association study (GWAS) meta-analyses of four epigenetic clocks and epigenetic surrogate markers for granulocyte proportions and plasminogen activator inhibitor 1 levels, respectively. We find evidence for shared genetic loci associated with the Horvath clock and expression of transcripts encoding genes linked to lipid metabolism and immune function. Notably, these loci are independent of those reported to regulate DNA methylation levels at constituent clock CpGs. A polygenic score for GrimAge acceleration showed strong associations with adiposity-related traits, educational attainment, parental longevity, and C-reactive protein levels. CONCLUSION: This study illuminates the genetic architecture underlying epigenetic aging and its shared genetic contributions with lifestyle factors and longevity
Genome-wide association studies identify 137 genetic loci for DNA methylation biomarkers of aging
BACKGROUND: Biological aging estimators derived from DNA methylation data are heritable and correlate with morbidity and mortality. Consequently, identification of genetic and environmental contributors to the variation in these measures in populations has become a major goal in the field. RESULTS: Leveraging DNA methylation and SNP data from more than 40,000 individuals, we identify 137 genome-wide significant loci, of which 113 are novel, from genome-wide association study (GWAS) meta-analyses of four epigenetic clocks and epigenetic surrogate markers for granulocyte proportions and plasminogen activator inhibitor 1 levels, respectively. We find evidence for shared genetic loci associated with the Horvath clock and expression of transcripts encoding genes linked to lipid metabolism and immune function. Notably, these loci are independent of those reported to regulate DNA methylation levels at constituent clock CpGs. A polygenic score for GrimAge acceleration showed strong associations with adiposity-related traits, educational attainment, parental longevity, and C-reactive protein levels. CONCLUSION: This study illuminates the genetic architecture underlying epigenetic aging and its shared genetic contributions with lifestyle factors and longevity
Powerful, Scalable and Resource-Efficient Meta-Analysis of Rare Variant Associations in Large Whole Genome Sequencing Studies
Meta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples
Targeting the hypoxic fraction of tumours using hypoxia activated prodrugs
The presence of a microenvironment within most tumours containing regions of low oxygen tension or hypoxia has profound biological and therapeutic implications. Tumour hypoxia is known to promote the development of an aggressive phenotype, resistance to both chemotherapy and radiotherapy and is strongly associated with poor clinical outcome. Paradoxically, it is recognised as a high priority target and one therapeutic strategies designed to eradicate hypoxic cells in tumours are a group of compounds known collectively as hypoxia activated prodrugs (HAPs) or bioreductive drugs. These drugs are inactive prodrugs that require enzymatic activation (typically by 1 or 2 electron oxidoreductases) to generate cytotoxic species with selectivity for hypoxic cells being determined by (i) the ability of oxygen to either reverse or inhibit the activation process and (ii) the presence of elevated expression of oxidoreductases in tumours. The concepts underpinning HAP development were established over 40 years ago and have been refined over the years to produce a new generation of HAPs that are under preclinical and clinical development. The purpose of this article is to describe current progress in the development of HAPs focusing on the mechanisms of action, preclinical properties and clinical progress of leading examples
Clonal hematopoiesis associated with epigenetic aging and clinical outcomes
Clonal hematopoiesis of indeterminate potential (CHIP) is a common precursor state for blood cancers that most frequently occurs due to mutations in the DNA-methylation modifying enzymes DNMT3A or TET2. We used DNA-methylation array and whole-genome sequencing data from four cohorts together comprising 5522 persons to study the association between CHIP, epigenetic clocks, and health outcomes. CHIP was strongly associated with epigenetic age acceleration, defined as the residual after regressing epigenetic clock age on chronological age, in several clocks, ranging from 1.31 years (GrimAge, p < 8.6 Ă 10â7) to 3.08 years (EEAA, p < 3.7 Ă 10â18). Mutations in most CHIP genes except DNA-damage response genes were associated with increases in several measures of age acceleration. CHIP carriers with mutations in multiple genes had the largest increases in age acceleration and decrease in estimated telomere length. Finally, we found that ~40% of CHIP carriers had acceleration >0 in both Hannum and GrimAge (referred to as AgeAccelHG+). This group was at high risk of all-cause mortality (hazard ratio 2.90, p < 4.1 Ă 10â8) and coronary heart disease (CHD) (hazard ratio 3.24, p < 9.3 Ă 10â6) compared to those who were CHIPâ/AgeAccelHGâ. In contrast, the other ~60% of CHIP carriers who were AgeAccelHGâ were not at increased risk of these outcomes. In summary, CHIP is strongly linked to age acceleration in multiple clocks, and the combination of CHIP and epigenetic aging may be used to identify a population at high risk for adverse outcomes and who may be a target for clinical interventions
Genome-wide association studies identify 137 genetic loci for DNA methylation biomarkers of aging
Background Biological aging estimators derived from DNA methylation data are heritable and correlate with morbidity and mortality. Consequently, identification of genetic and environmental contributors to the variation in these measures in populations has become a major goal in the field. Results Leveraging DNA methylation and SNP data from more than 40,000 individuals, we identify 137 genome-wide significant loci, of which 113 are novel, from genome-wide association study (GWAS) meta-analyses of four epigenetic clocks and epigenetic surrogate markers for granulocyte proportions and plasminogen activator inhibitor 1 levels, respectively. We find evidence for shared genetic loci associated with the Horvath clock and expression of transcripts encoding genes linked to lipid metabolism and immune function. Notably, these loci are independent of those reported to regulate DNA methylation levels at constituent clock CpGs. A polygenic score for GrimAge acceleration showed strong associations with adiposity-related traits, educational attainment, parental longevity, and C-reactive protein levels. Conclusion This study illuminates the genetic architecture underlying epigenetic aging and its shared genetic contributions with lifestyle factors and longevity.Peer reviewe
Canonical correlation analysis for multi-omics: Application to cross-cohort analysis
Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of featuresâreferred to as canonical variables (CVs)âwithin each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits
Whole-genome association analyses of sleep-disordered breathing phenotypes in the NHLBI TOPMed program
Background: Sleep-disordered breathing is a common disorder associated with significant morbidity. The genetic architecture of sleep-disordered breathing remains poorly understood. Through the NHLBI Trans-Omics for Precision Medicine (TOPMed) program, we performed the first whole-genome sequence analysis of sleep-disordered breathing. Methods: The study sample was comprised of 7988 individuals of diverse ancestry. Common-variant and pathway analyses included an additional 13,257 individuals. We examined five complementary traits describing different aspects of sleep-disordered breathing: the apnea-hypopnea index, average oxyhemoglobin desaturation per event, average and minimum oxyhemoglobin saturation across the sleep episode, and the percentage of sleep with oxyhemoglobin saturation < 90%. We adjusted for age, sex, BMI, study, and family structure using MMSKAT and EMMAX mixed linear model approaches. Additional bioinformatics analyses were performed with MetaXcan, GIGSEA, and ReMap. Results: We identified a multi-ethnic set-based rare-variant association (p = 3.48 Ă 10â8) on chromosome X with ARMCX3. Additional rare-variant associations include ARMCX3-AS1, MRPS33, and C16orf90. Novel common-variant loci were identified in the NRG1 and SLC45A2 regions, and previously associated loci in the IL18RAP and ATP2B4 regions were associated with novel phenotypes. Transcription factor binding site enrichment identified associations with genes implicated with respiratory and craniofacial traits. Additional analyses identified significantly associated pathways. Conclusions: We have identified the first gene-based rare-variant associations with objectively measured sleep-disordered breathing traits. Our results increase the understanding of the genetic architecture of sleep-disordered breathing and highlight associations in genes that modulate lung development, inflammation, respiratory rhythmogenesis, and HIF1A-mediated hypoxic response
Leveraging linkage evidence to identify low-frequency and rare variants on 16p13 associated with blood pressure using TOPMed whole genome sequencing data
In this study, we investigated low-frequency and rare variants associated with blood pressure (BP) by focusing on a linkage region on chromosome 16p13. We used whole genome sequencing (WGS) data obtained through the NHLBI Trans-Omics for Precision Medicine (TOPMed) program on 395 Cleveland Family Study (CFS) European Americans (CFS-EA). By analyzing functional coding variants and non-coding rare variants with CADD score > 10 residing within the chromosomal region in families with linkage evidence, we observed 25 genes with nominal statistical evidence (burden or SKAT p < 0.05). One of the genes is RBFOX1, an evolutionarily conserved RNA-binding protein that regulates tissue-specific alternative splicing that we previously reported to be associated with BP using exome array data in CFS. After follow-up analysis of the 25 genes in ten independent TOPMed studies with individuals of European, African, and East Asian ancestry, and Hispanics (N = 29,988), we identified variants in SLX4 (p = 2.19 Ă 10â4) to be significantly associated with BP traits when accounting for multiple testing. We also replicated the associations previously reported for RBFOX1 (p = 0.007). Follow-up analysis with GTEx eQTL data shows SLX4 variants are associated with gene expression in coronary artery, multiple brain tissues, and right atrial appendage of the heart. Our study demonstrates that linkage analysis of family data can provide an efficient approach for detecting rare variants associated with complex traits in WGS data