36 research outputs found

    Genomic characterisation of Australian wild rice species

    Get PDF

    Power, false discovery rate and Winner's Curse in eQTL studies.

    Get PDF
    Investigation of the genetic architecture of gene expression traits has aided interpretation of disease and trait-associated genetic variants; however, key aspects of expression quantitative trait loci (eQTL) study design and analysis remain understudied. We used extensive, empirically driven simulations to explore eQTL study design and the performance of various analysis strategies. Across multiple testing correction methods, false discoveries of genes with eQTLs (eGenes) were substantially inflated when false discovery rate (FDR) control was applied to all tests and only appropriately controlled using hierarchical procedures. All multiple testing correction procedures had low power and inflated FDR for eGenes whose causal SNPs had small allele frequencies using small sample sizes (e.g. frequency 25%). Overestimation of eQTL effect sizes, so-called 'Winner's Curse', was common in low and moderate power settings. To address this, we developed a bootstrap method (BootstrapQTL) that led to more accurate effect size estimation. These insights provide a foundation for future eQTL studies, especially those with sampling constraints and subtly different conditions

    Genomic risk scores for juvenile idiopathic arthritis and its subtypes

    Get PDF
    Objectives: Juvenile idiopathic arthritis (JIA) is an autoimmune disease and a common cause of chronic disability in children. Diagnosis of JIA is based purely on clinical symptoms, which can be variable, leading to diagnosis and treatment delays. Despite JIA having substantial heritability, the construction of genomic risk scores (GRSs) to aid or expedite diagnosis has not been assessed. Here, we generate GRSs for JIA and its subtypes and evaluate their performance. Methods: We examined three case/control cohorts (UK, US-based and Australia) with genome-wide single nucleotide polymorphism (SNP) genotypes. We trained GRSs for JIA and its subtypes using lasso-penalised linear models in cross-validation on the UK cohort, and externally tested it in the other cohorts. Results: The JIA GRS alone achieved cross-validated area under the receiver operating characteristic curve (AUC)=0.670 in the UK cohort and externally-validated AUCs of 0.657 and 0.671 in the US-based and Australian cohorts, respectively. In logistic regression of case/control status, the corresponding odds ratios (ORs) per standard deviation (SD) of GRS were 1.831 (1.685 to 1.991) and 2.008 (1.731 to 2.345), and were unattenuated by adjustment for sex or the top 10 genetic principal components. Extending our analysis to JIA subtypes revealed that the enthesitis-related JIA had both the longest time-to-referral and the subtype GRS with the strongest predictive capacity overall across data sets: AUCs 0.82 in UK; 0.84 in Australian; and 0.70 in US-based. The particularly common oligoarthritis JIA also had a GRS that outperformed those for JIA overall, with AUCs of 0.72, 0.74 and 0.77, respectively. Conclusions: A GRS for JIA has potential to augment clinical JIA diagnosis protocols, prioritising higher-risk individuals for follow-up and treatment. Consistent with JIA heterogeneity, subtype-specific GRSs showed particularly high performance for enthesitis-related and oligoarthritis JIA

    Elevated serum alpha-1 antitrypsin is a major component of GlycA-associated risk for future morbidity and mortality

    Get PDF
    Background GlycA is a nuclear magnetic resonance (NMR) spectroscopy biomarker that predicts risk of disease from myriad causes. It is heterogeneous; arising from five circulating glycoproteins with dynamic concentrations: alpha-1 antitrypsin (AAT), alpha-1-acid glycoprotein (AGP), haptoglobin (HP), transferrin (TF), and alpha-1-antichymotrypsin (AACT). The contributions of each glycoprotein to the disease and mortality risks predicted by GlycA remain unknown. Methods We trained imputation models for AAT, AGP, HP, and TF from NMR metabolite measurements in 626 adults from a population cohort with matched NMR and immunoassay data. Levels of AAT, AGP, and HP were estimated in 11,861 adults from two population cohorts with eight years of follow-up, then each biomarker was tested for association with all common endpoints. Whole blood gene expression data was used to identify cellular processes associated with elevated AAT. Results Accurate imputation models were obtained for AAT, AGP, and HP but not for TF. While AGP had the strongest correlation with GlycA, our analysis revealed variation in imputed AAT levels was the most predictive of morbidity and mortality for the widest range of diseases over the eight year follow-up period, including heart failure (meta-analysis hazard ratio = 1.60 per standard deviation increase of AAT, P-value = 1×10−10), influenza and pneumonia (HR = 1.37, P = 6×10−10), and liver diseases (HR = 1.81, P = 1×10−6). Transcriptional analyses revealed association of elevated AAT with diverse inflammatory immune pathways. Conclusions This study clarifies the molecular underpinnings of the GlycA biomarker’s associated disease risk, and indicates a previously unrecognised association between elevated AAT and severe disease onset and mortality.Peer reviewe

    Acute effects of active breaks during prolonged sitting on subcutaneous adipose tissue gene expression: an ancillary analysis of a randomised controlled trial.

    Get PDF
    Active breaks in prolonged sitting has beneficial impacts on cardiometabolic risk biomarkers. The molecular mechanisms include regulation of skeletal muscle gene and protein expression controlling metabolic, inflammatory and cell development pathways. An active communication network exists between adipose and muscle tissue, but the effect of active breaks in prolonged sitting on adipose tissue have not been investigated. This study characterized the acute transcriptional events induced in adipose tissue by regular active breaks during prolonged sitting. We studied 8 overweight/obese adults participating in an acute randomized three-intervention crossover trial. Interventions were performed in the postprandial state and included: (i) prolonged uninterrupted sitting; or prolonged sitting interrupted with 2-minute bouts of (ii) light- or (iii) moderate-intensity treadmill walking every 20 minutes. Subcutaneous adipose tissue biopsies were obtained after each condition. Microarrays identified 36 differentially expressed genes between the three conditions (fold change ≥0.5 in either direction; p < 0.05). Pathway analysis indicated that breaking up of prolonged sitting led to differential regulation of adipose tissue metabolic networks and inflammatory pathways, increased insulin signaling, modulation of adipocyte cell cycle, and facilitated cross-talk between adipose tissue and other organs. This study provides preliminary insight into the adipose tissue regulatory systems that may contribute to the physiological effects of interrupting prolonged sitting

    Genomic risk prediction of coronary artery disease in nearly 500,000 adults: implications for early screening and primary prevention

    Get PDF
    Background Coronary artery disease (CAD) has substantial heritability and a polygenic architecture; however, genomic risk scores have not yet leveraged the totality of genetic information available nor been externally tested at population-scale to show potential utility in primary prevention. Methods Using a meta-analytic approach to combine large-scale genome-wide and targeted genetic association data, we developed a new genomic risk score for CAD (metaGRS), consisting of 1.7 million genetic variants. We externally tested metaGRS, individually and in combination with available conventional risk factors, in 22,242 CAD cases and 460,387 non-cases from UK Biobank. Findings In UK Biobank, a standard deviation increase in metaGRS had a hazard ratio (HR) of 1.71 (95% CI 1.68–1.73) for CAD, greater than any other externally tested genetic risk score. Individuals in the top 20% of the metaGRS distribution had a HR of 4.17 (95% CI 3.97–4.38) compared with those in the bottom 20%. The metaGRS had higher C-index (C=0.623, 95% CI 0.615–0.631) for incident CAD than any of four conventional factors (smoking, diabetes, hypertension, and body mass index), and addition of the metaGRS to a model of conventional risk factors increased C-index by 3.7%. In individuals on lipid-lowering or anti-hypertensive medications at recruitment, metaGRS hazard for incident CAD was significantly but only partially attenuated with HR of 2.83 (95% CI 2.61– 3.07) between the top and bottom 20% of the metaGRS distribution. Interpretation Recent genetic association studies have yielded enough information to meaningfully stratify individuals using the metaGRS for CAD risk in both early and later life, thus enabling targeted primary intervention in combination with conventional risk factors. The metaGRS effect was partially attenuated by lipid and blood pressure-lowering medication, however other prevention strategies will be required to fully benefit from earlier genomic risk stratification. Funding National Health and Medical Research Council of Australia, British Heart Foundation, Australian Heart Foundation.This study was supported by funding from National Health and Medical Research Council (NHMRC) grant APP1062227. Supported in part by the Victorian Government’s OIS Program. M.I. was supported by an NHMRC and Australian Heart Foundation Career Development Fellowship (no. 1061435). G.A. was supported by an NHMRC Early Career Fellowship (no. 1090462). N.J.S., C.P.N. and B.K. are supported by the British Heart Foundation and N.J.S. is a NIHR Senior Investigator. R.S.P. is supported by the British Heart Foundation (FS/14/76/30933). The MRC/BHF Cardiovascular Epidemiology Unit is supported by the UK Medical Research Council [MR/L003120/1], British Heart Foundation [RG/13/13/30194], and UK National Institute for Health Research Cambridge Biomedical Research Centre. J.D. is a British Heart Foundation Professor and NIHR Senior Investigator

    Neonatal genetics of gene expression reveal potential origins of autoimmune and allergic disease risk

    Get PDF
    Abstract: Chronic immune-mediated diseases of adulthood often originate in early childhood. To investigate genetic associations between neonatal immunity and disease, we map expression quantitative trait loci (eQTLs) in resting myeloid cells and CD4+ T cells from cord blood samples, as well as in response to lipopolysaccharide (LPS) or phytohemagglutinin (PHA) stimulation, respectively. Cis-eQTLs are largely specific to cell type or stimulation, and 31% and 52% of genes with cis-eQTLs have response eQTLs (reQTLs) in myeloid cells and T cells, respectively. We identified cis regulatory factors acting as mediators of trans effects. There is extensive colocalisation between condition-specific neonatal cis-eQTLs and variants associated with immune-mediated diseases, in particular CTSH had widespread colocalisation across diseases. Mendelian randomisation shows causal neonatal gene expression effects on disease risk for BTN3A2, HLA-C and others. Our study elucidates the genetics of gene expression in neonatal immune cells, and aetiological origins of autoimmune and allergic diseases

    Trajectories of childhood immune development and respiratory health relevant to asthma and allergy.

    Get PDF
    Events in early life contribute to subsequent risk of asthma; however, the causes and trajectories of childhood wheeze are heterogeneous and do not always result in asthma. Similarly, not all atopic individuals develop wheeze, and vice versa. The reasons for these differences are unclear. Using unsupervised model-based cluster analysis, we identified latent clusters within a prospective birth cohort with deep immunological and respiratory phenotyping. We characterised each cluster in terms of immunological profile and disease risk, and replicated our results in external cohorts from the UK and USA. We discovered three distinct trajectories, one of which is a high-risk 'atopic' cluster with increased propensity for allergic diseases throughout childhood. Atopy contributes varyingly to later wheeze depending on cluster membership. Our findings demonstrate the utility of unsupervised analysis in elucidating heterogeneity in asthma pathogenesis and provide a foundation for improving management and prevention of childhood asthma

    Comprehensive genetic analysis of the human lipidome identifies loci associated with lipid homeostasis with links to coronary artery disease

    Get PDF
    We integrated lipidomics and genomics to unravel the genetic architecture of lipid metabolism and identify genetic variants associated with lipid species putatively in the mechanistic pathway for coronary artery disease (CAD). We quantified 596 lipid species in serum from 4,492 individuals from the Busselton Health Study. The discovery GWAS identified 3,361 independent lipid-loci associations, involving 667 genomic regions (479 previously unreported), with validation in two independent cohorts. A meta-analysis revealed an additional 70 independent genomic regions associated with lipid species. We identified 134 lipid endophenotypes for CAD associated with 186 genomic loci. Associations between independent lipid-loci with coronary atherosclerosis were assessed in ∼ 456,000 individuals from the UK Biobank. Of the 53 lipid-loci that showed evidence of association (P \u3c 1 × 10−3), 43 loci were associated with at least one lipid endophenotype. These findings illustrate the value of integrative biology to investigate the aetiology of atherosclerosis and CAD, with implications for other complex diseases
    corecore