10 research outputs found
Identification of an imprinted master trans regulator at the KLF14 locus related to multiple metabolic phenotypes.
Genome-wide association studies have identified many genetic variants associated with complex traits. However, at only a minority of loci have the molecular mechanisms mediating these associations been characterized. In parallel, whereas cis regulatory patterns of gene expression have been extensively explored, the identification of trans regulatory effects in humans has attracted less attention. Here we show that the type 2 diabetes and high-density lipoprotein cholesterol-associated cis-acting expression quantitative trait locus (eQTL) of the maternally expressed transcription factor KLF14 acts as a master trans regulator of adipose gene expression. Expression levels of genes regulated by this trans-eQTL are highly correlated with concurrently measured metabolic traits, and a subset of the trans-regulated genes harbor variants directly associated with metabolic phenotypes. This trans-eQTL network provides a mechanistic understanding of the effect of the KLF14 locus on metabolic disease risk and offers a potential model for other complex traits
Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk
The timing of puberty is a highly polygenic childhood trait that is epidemiologically associated with various adult diseases. Using 1000 Genomes Project-imputed genotype data in up to similar to 370,000 women, we identify 389 independent signals (P <5 x 10(-8)) for age at menarche, a milestone in female pubertal development. In Icelandic data, these signals explain similar to 7.4% of the population variance in age at menarche, corresponding to similar to 25% of the estimated heritability. We implicate similar to 250 genes via coding variation or associated expression, demonstrating significant enrichment in neural tissues. Rare variants near the imprinted genes MKRN3 and DLK1 were identified, exhibiting large effects when paternally inherited. Mendelian randomization analyses suggest causal inverse associations, independent of body mass index (BMI), between puberty timing and risks for breast and endometrial cancers in women and prostate cancer in men. In aggregate, our findings highlight the complexity of the genetic regulation of puberty timing and support causal links with cancer susceptibility
Genetic variants associated with mosaic Y chromosome loss highlight cell cycle genes and overlap with cancer susceptibility.
The Y chromosome is frequently lost in hematopoietic cells, which represents the most common somatic alteration in men. However, the mechanisms that regulate mosaic loss of chromosome Y (mLOY), and its clinical relevance, are unknown. We used genotype-array-intensity data and sequence reads from 85,542 men to identify 19 genomic regions (P < 5 × 10-8) that are associated with mLOY. Cumulatively, these loci also predicted X chromosome loss in women (n = 96,123; P = 4 × 10-6). Additional epigenome-wide methylation analyses using whole blood highlighted 36 differentially methylated sites associated with mLOY. The genes identified converge on aspects of cell proliferation and cell cycle regulation, including DNA synthesis (NPAT), DNA damage response (ATM), mitosis (PMF1, CENPN and MAD1L1) and apoptosis (TP53). We highlight the shared genetic architecture between mLOY and cancer susceptibility, in addition to inferring a causal effect of smoking on mLOY. Collectively, our results demonstrate that genotype-array-intensity data enables a measure of cell cycle efficiency at population scale and identifies genes implicated in aneuploidy, genome instability and cancer susceptibility.This research has been conducted using the UK Biobank Resource under Application Number 9905. This work was supported by the UK Medical Research Council (Unit Programme numbers MC_UU_12015/1 and MC_UU_12015/2). Research in the S. Jackson laboratory is funded by Cancer Research UK (CRUK; programme grant C6/A18796), with Institute core funding provided by CRUK (C6946/A14492) and the Wellcome Trust (WT092096). S. Jackson receives salary from the University of Cambridge, supplemented by CRUK
Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk
The timing of puberty is a highly polygenic childhood trait that is epidemiologically associated with various adult diseases. Using 1000 Genomes Project–imputed genotype data in up to ~370,000 women, we identify 389 independent signals (P < 5 × 10) for age at menarche, a milestone in female pubertal development. In Icelandic data, these signals explain ~7.4% of the population variance in age at menarche, corresponding to ~25% of the estimated heritability. We implicate ~250 genes via coding variation or associated expression, demonstrating significant enrichment in neural tissues. Rare variants near the imprinted genes MKRN3 and DLK1 were identified, exhibiting large effects when paternally inherited. Mendelian randomization analyses suggest causal inverse associations, independent of body mass index (BMI), between puberty timing and risks for breast and endometrial cancers in women and prostate cancer in men. In aggregate, our findings highlight the complexity of the genetic regulation of puberty timing and support causal links with cancer susceptibility
Recommended from our members
Identification of an imprinted master trans regulator at the KLF14 locus related to multiple metabolic phenotypes.
Genome-wide association studies have identified many genetic variants associated with complex traits. However, at only a minority of loci have the molecular mechanisms mediating these associations been characterized. In parallel, whereas cis regulatory patterns of gene expression have been extensively explored, the identification of trans regulatory effects in humans has attracted less attention. Here we show that the type 2 diabetes and high-density lipoprotein cholesterol-associated cis-acting expression quantitative trait locus (eQTL) of the maternally expressed transcription factor KLF14 acts as a master trans regulator of adipose gene expression. Expression levels of genes regulated by this trans-eQTL are highly correlated with concurrently measured metabolic traits, and a subset of the trans-regulated genes harbor variants directly associated with metabolic phenotypes. This trans-eQTL network provides a mechanistic understanding of the effect of the KLF14 locus on metabolic disease risk and offers a potential model for other complex traits
Mapping cis- and trans-regulatory effects across multiple tissues in twins
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many expression quantitative trait locus (eQTL) studies, typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis effect on expression cannot be accounted for by common cis variants, a finding that reveals the contribution of low-frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene, and we identify several replicating trans variants that act predominantly in a tissue-restricted manner and may regulate the transcription of many genes
Loci associated with ischaemic stroke and its subtypes (SiGN): a genome-wide association study
BACKGROUND:
The discovery of disease-associated loci through genome-wide association studies (GWAS) is the leading genetic approach to the identification of novel biological pathways underlying diseases in humans. Until recently, GWAS in ischaemic stroke have been limited by small sample sizes and have yielded few loci associated with ischaemic stroke. We did a large-scale GWAS to identify additional susceptibility genes for stroke and its subtypes.
METHODS:
To identify genetic loci associated with ischaemic stroke, we did a two-stage GWAS. In the first stage, we included 16 851 cases with state-of-the-art phenotyping data and 32 473 stroke-free controls. Cases were aged 16 to 104 years, recruited between 1989 and 2012, and subtypes of ischaemic stroke were recorded by centrally trained and certified investigators who used the web-based protocol, Causative Classification of Stroke (CCS). We constructed case-control strata by identifying samples that were genotyped on nearly identical arrays and were of similar genetic ancestral background. We cleaned and imputed data by use of dense imputation reference panels generated from whole-genome sequence data. We did genome-wide testing to identify stroke-associated loci within each stratum for each available phenotype, and we combined summary-level results using inverse variance-weighted fixed-effects meta-analysis. In the second stage, we did in-silico lookups of 1372 single nucleotide polymorphisms identified from the first stage GWAS in 20 941 cases and 364 736 unique stroke-free controls. The ischaemic stroke subtypes of these cases had previously been established with the Trial of Org 10 172 in Acute Stroke Treatment (TOAST) classification system, in accordance with local standards. Results from the two stages were then jointly analysed in a final meta-analysis.
FINDINGS:
We identified a novel locus (G allele at rs12122341) at 1p13.2 near TSPAN2 that was associated with large artery atherosclerosis-related stroke (first stage odds ratio [OR] 1·21, 95% CI 1·13-1·30, p=4·50 × 10-8; joint OR 1·19, 1·12-1·26, p=1·30 × 10-9). Our results also supported robust associations with ischaemic stroke for four other loci that have been reported in previous studies, including PITX2 (first stage OR 1·39, 1·29-1·49, p=3·26 × 10-19; joint OR 1·37, 1·30-1·45, p=2·79 × 10-32) and ZFHX3 (first stage OR 1·19, 1·11-1·27, p=2·93 × 10-7; joint OR 1·17, 1·11-1·23, p=2·29 × 10-10) for cardioembolic stroke, and HDAC9 (first stage OR 1·29, 1·18-1·42, p=3·50 × 10-8; joint OR 1·24, 1·15-1·33, p=4·52 × 10-9) for large artery atherosclerosis stroke. The 12q24 locus near ALDH2, which has previously been associated with all ischaemic stroke but not with any specific subtype, exceeded genome-wide significance in the meta-analysis of small artery stroke (first stage OR 1·20, 1·12-1·28, p=6·82 × 10-8; joint OR 1·17, 1·11-1·23, p=2·92 × 10-9). Other loci associated with stroke in previous studies, including NINJ2, were not confirmed.
INTERPRETATION:
Our results suggest that all ischaemic stroke-related loci previously implicated by GWAS are subtype specific. We identified a novel gene associated with large artery atherosclerosis stroke susceptibility. Follow-up studies will be necessary to establish whether the locus near TSPAN2 can be a target for a novel therapeutic approach to stroke prevention. In view of the subtype-specificity of the associations detected, the rich phenotyping data available in the Stroke Genetics Network (SiGN) are likely to be crucial for further genetic discoveries related to ischaemic stroke