535 research outputs found

    Multi-omics of obesity and weight change in the post-genomic era

    Get PDF

    Erratum to: Assessment of predictive performance in incomplete data by combining internal validation and multiple imputation

    Get PDF
    Abstract Background Missing values are a frequent issue in human studies. In many situations, multiple imputation (MI) is an appropriate missing data handling strategy, whereby missing values are imputed multiple times, the analysis is performed in every imputed data set, and the obtained estimates are pooled. If the aim is to estimate (added) predictive performance measures, such as (change in) the area under the receiver-operating characteristic curve (AUC), internal validation strategies become desirable in order to correct for optimism. It is not fully understood how internal validation should be combined with multiple imputation. Methods In a comprehensive simulation study and in a real data set based on blood markers as predictors for mortality, we compare three combination strategies: Val-MI, internal validation followed by MI on the training and test parts separately, MI-Val, MI on the full data set followed by internal validation, and MI(-y)-Val, MI on the full data set omitting the outcome followed by internal validation. Different validation strategies, including bootstrap und cross-validation, different (added) performance measures, and various data characteristics are considered, and the strategies are evaluated with regard to bias and mean squared error of the obtained performance estimates. In addition, we elaborate on the number of resamples and imputations to be used, and adopt a strategy for confidence interval construction to incomplete data. Results Internal validation is essential in order to avoid optimism, with the bootstrap 0.632+ estimate representing a reliable method to correct for optimism. While estimates obtained by MI-Val are optimistically biased, those obtained by MI(-y)-Val tend to be pessimistic in the presence of a true underlying effect. Val-MI provides largely unbiased estimates, with a slight pessimistic bias with increasing true effect size, number of covariates and decreasing sample size. In Val-MI, accuracy of the estimate is more strongly improved by increasing the number of bootstrap draws rather than the number of imputations. With a simple integrated approach, valid confidence intervals for performance estimates can be obtained. Conclusions When prognostic models are developed on incomplete data, Val-MI represents a valid strategy to obtain estimates of predictive performance measures

    Anxiety Associated Increased CpG Methylation in the Promoter of Asb1: A Translational Approach Evidenced by Epidemiological and Clinical Studies and a Murine Model

    Get PDF
    Epigenetic regulation in anxiety is suggested, but evidence from large studies is needed. We conducted an epigenome-wide association study (EWAS) on anxiety in a population-based cohort and validated our finding in a clinical cohort as well as a murine model. In the KORA cohort, participants (n= 1522, age 32–72 years) were administered the Generalized Anxiety Disorder (GAD-7) instrument, whole blood DNA methylation was measured (Illumina 450K BeadChip), and circulating levels of hs-CRP and IL-18 were assessed in the association between anxiety and methylation. DNA methylation was measured using the same instrument in a study of patients with anxiety disorders recruited at the Max Planck Institute of Psychiatry (MPIP, 131 non-medicated cases and 169 controls). To expand our mechanistic understanding, these findings were reverse translated in a mouse model of acute social defeat stress. In the KORA study, participants were classified according to mild, moderate, or severe levels of anxiety (29.4%/6.0%/1.5%, respectively). Severe anxiety was associated with 48.5% increased methylation at a single CpG site (cg12701571) located in the promoter of the gene encoding Asb1 (β-coefficient = 0.56 standard error (SE) =0.10, p (Bonferroni) = 0.005), a protein hypothetically involved in regulation of cytokine signaling. An interaction between IL-18 and severe anxiety with methylation of this CpG cite showed a tendency towards significance in the total population (p =0.083) and a significant interaction among women (p =0.014). Methylation of the same CpG was positively associated with Panic and Agoraphobia scale (PAS) scores (β= 0.005, SE= 0.002, p=0.021, n= 131) among cases in the MPIP study. In a murine model of acute social defeat stress, Asb1 gene expression was significantly upregulated in a tissue-specific manner (p= 0.006), which correlated with upregulation of the neuroimmunomodulating cytokine interleukin 1 beta. Our findings suggest epigenetic regulation of the stress-responsive Asb1 gene in anxiety-related phenotypes. Further studies are necessary to elucidate the causal direction of this association and the potential role of Asb1-mediated immune dysregulation in anxiety disorders

    Genetic variants including markers from the exome chip and metabolite traits of type 2 diabetes

    Get PDF
    Diabetes-associated metabolites may aid the identification of new risk variants for type 2 diabetes. Using targeted metabolomics within a subsample of the German EPIC-Potsdam study (n = 2500), we tested previously published SNPs for their association with diabetes-associated metabolites and conducted an additional exploratory analysis using data from the exome chip including replication within 2,692 individuals from the German KORA F4 study. We identified a total of 16 loci associated with diabetes-related metabolite traits, including one novel association between rs499974 (MOGAT2) and a diacyl-phosphatidylcholine ratio (PC aa C40:5/PC aa C38:5). Gene-based tests on all exome chip variants revealed associations between GFRAL and PC aa C42:1/PC aa C42:0, BIN1 and SM (OH) C22:2/SM C18:0 and TFRC and SM (OH) C22:2/SM C16:1). Selecting variants for gene-based tests based on functional annotation identified one additional association between OR51Q1 and hexoses. Among single genetic variants consistently associated with diabetes-related metabolites, two (rs174550 (FADS1), rs3204953 (REV3L)) were significantly associated with type 2 diabetes in large-scale meta-analysis for type 2 diabetes. In conclusion, we identified a novel metabolite locus in single variant analyses and four genes within gene-based tests and confirmed two previously known mGWAS loci which might be relevant for the risk of type 2 diabetes

    Network reconstruction for trans acting genetic loci using multi-omics data and prior information

    Get PDF
    BACKGROUND: Molecular measurements of the genome, the transcriptome, and the epigenome, often termed multi-omics data, provide an in-depth view on biological systems and their integration is crucial for gaining insights in complex regulatory processes. These data can be used to explain disease related genetic variants by linking them to intermediate molecular traits (quantitative trait loci, QTL). Molecular networks regulating cellular processes leave footprints in QTL results as so-called trans-QTL hotspots. Reconstructing these networks is a complex endeavor and use of biological prior information can improve network inference. However, previous efforts were limited in the types of priors used or have only been applied to model systems. In this study, we reconstruct the regulatory networks underlying trans-QTL hotspots using human cohort data and data-driven prior information. METHODS: We devised a new strategy to integrate QTL with human population scale multi-omics data. State-of-the art network inference methods including BDgraph and glasso were applied to these data. Comprehensive prior information to guide network inference was manually curated from large-scale biological databases. The inference approach was extensively benchmarked using simulated data and cross-cohort replication analyses. Best performing methods were subsequently applied to real-world human cohort data. RESULTS: Our benchmarks showed that prior-based strategies outperform methods without prior information in simulated data and show better replication across datasets. Application of our approach to human cohort data highlighted two novel regulatory networks related to schizophrenia and lean body mass for which we generated novel functional hypotheses. CONCLUSIONS: We demonstrate that existing biological knowledge can improve the integrative analysis of networks underlying trans associations and generate novel hypotheses about regulatory mechanisms

    Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies.

    Get PDF
    BACKGROUND: Untargeted mass spectrometry (MS)-based metabolomics data often contain missing values that reduce statistical power and can introduce bias in biomedical studies. However, a systematic assessment of the various sources of missing values and strategies to handle these data has received little attention. Missing data can occur systematically, e.g. from run day-dependent effects due to limits of detection (LOD); or it can be random as, for instance, a consequence of sample preparation. METHODS: We investigated patterns of missing data in an MS-based metabolomics experiment of serum samples from the German KORA F4 cohort (n = 1750). We then evaluated 31 imputation methods in a simulation framework and biologically validated the results by applying all imputation approaches to real metabolomics data. We examined the ability of each method to reconstruct biochemical pathways from data-driven correlation networks, and the ability of the method to increase statistical power while preserving the strength of established metabolic quantitative trait loci. RESULTS: Run day-dependent LOD-based missing data accounts for most missing values in the metabolomics dataset. Although multiple imputation by chained equations performed well in many scenarios, it is computationally and statistically challenging. K-nearest neighbors (KNN) imputation on observations with variable pre-selection showed robust performance across all evaluation schemes and is computationally more tractable. CONCLUSION: Missing data in untargeted MS-based metabolomics data occur for various reasons. Based on our results, we recommend that KNN-based imputation is performed on observations with variable pre-selection since it showed robust results in all evaluation schemes.This work was supported by grants from the German Federal Ministry of Education and Research (BMBF), by BMBF Grant No. 01ZX1313C (project e:Athero-MED) and Grant No. 03IS2061B (project Gani_Med). Moreover, the research leading to these results has received funding from the European Union’s Seventh Framework Programme [FP7-Health-F5-2012] under grant agreement No. 305280 (MIMOmics) and from the European Research Council (starting grant “LatentCauses”). KS is supported by Biomedical Research Program funds at Weill Cornell Medical College in Qatar, a program funded by the Qatar Foundation. The KORA Augsburg studies were financed by the Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany and supported by grants from the German Federal Ministry of Education and Research (BMBF). Analyses in the EPIC-Norfolk study were supported by funding from the Medical Research Council (MC_PC_13048 and MC_UU_12015/1)

    Characterization of the metabolic profile associated with serum 25-hydroxyvitamin D : a cross-sectional analysis in population-based data

    Get PDF
    Background: Numerous observational studies have observed associations between vitamin D deficiency and cardiometabolic diseases, but these findings might be confounded by obesity. A characterization of the metabolic profile associated with serum 25-hydroxyvitamin D [25(OH)D] levels, in general and stratified by abdominal obesity, may help to untangle the relationship between vitamin D, obesity and cardiometabolic health. Methods: Serum metabolomics measurements were obtained from a nuclear magnetic resonance spectroscopy (NMR)- and a mass spectrometry (MS)-based platform. The discovery was conducted in 1726 participants of the population-based KORA-F4 study, in which the associations of the concentrations of 415 metabolites with 25(OH)D levels were assessed in linear models. The results were replicated in 6759 participants (NMR) and 609 (MS) participants, respectively, of the population-based FINRISK 1997 study. Results: Mean [standard deviation (SD)] 25(OH)D levels were 15.2 (7.5) ng/ml in KORA F4 and 13.8 (5.9) ng/ml in FINRISK 1997; 37 metabolites were associated with 25(OH) D in KORA F4 at P <0.05/415. Of these, 30 associations were replicated in FINRISK 1997 at P <0.05/37. Among these were constituents of (very) large very-low-density lipoprotein and small low-density lipoprotein subclasses and related measures like serum triglycerides as well as fatty acids and measures reflecting the degree of fatty acid saturation. The observed associations were independent of waist circumference and generally similar in abdominally obese and non-obese participants. Conclusions: Independently of abdominal obesity, higher 25(OH)D levels were associated with a metabolite profile characterized by lower concentrations of atherogenic lipids and a higher degree of fatty acid polyunsaturation. These results indicate that the relationship between vitamin D deficiency and cardiometabolic diseases is unlikely to merely reflect obesity-related pathomechanisms.Peer reviewe

    Genome-Wide Analysis of DNA Methylation and Fine Particulate Matter Air Pollution in Three Study Populations: KORA F3, KORA F4, and the Normative Aging Study

    Get PDF
    BACKGROUND: Epidemiological studies have reported associations between particulate matter (PM) concentrations and cancer and respiratory and cardiovascular diseases. DNA methylation has been identified as a possible link but so far it has only been analyzed in candidate sites. OBJECTIVES: We studied the association between DNA methylation and short-and mid-term air pollution exposure using genome-wide data and identified potential biological pathways for-additional investigation. METHODS: We collected whole blood samples from three independent studies-KORA F3 (2004-2005) and F4 (2006-2008) in Germany, and the Normative Aging Study (1999-2007) in the United States-and measured genome-wide DNA methylation proportions with the Illumina 450k BeadChip. PM concentration was measured daily at fixed monitoring stations and three different trailing averages were considered and regressed against DNA methylation: 2-day, 7-day and 28-day. Meta-analysis was performed to pool the study-specific results. RESULTS: Random-effect meta-analysis revealed 12 CpG (cytosine-guanine dinucleotide) sites as associated with PM concentration (1 for 2-day average, 1 for 7-day, and 10 for 28-day) at a genome-wide Bonferroni significance level (p 0.05 and I-2< 0.5: the site from the 7-day average results and 3 for the 28-day average. Applying false discovery rate, p-value < 0.05 was observed in 8 and 1,819 additional CpGs at 7- and 28-day average PM2.5 exposure respectively. CONCLUSION: The PM-related CpG sites found in our study suggest novel plausible systemic pathways linking ambient PM exposure to adverse health effect through variations in DNA methylation
    corecore