11 research outputs found

    Epigenetics, heritability and longitudinal analysis

    Get PDF
    Background Longitudinal data and repeated measurements in epigenome-wide association studies (EWAS) provide a rich resource for understanding epigenetics. We summarize 7 analytical approaches to the GAW20 data sets that addressed challenges and potential applications of phenotypic and epigenetic data. All contributions used the GAW20 real data set and employed either linear mixed effect (LME) models or marginal models through generalized estimating equations (GEE). These contributions were subdivided into 3 categories: (a) quality control (QC) methods for DNA methylation data; (b) heritability estimates pretreatment and posttreatment with fenofibrate; and (c) impact of drug response pretreatment and posttreatment with fenofibrate on DNA methylation and blood lipids. Results Two contributions addressed QC and identified large statistical differences with pretreatment and posttreatment DNA methylation, possibly a result of batch effects. Two contributions compared epigenome-wide heritability estimates pretreatment and posttreatment, with one employing a Bayesian LME and the other using a variance-component LME. Density curves comparing these studies indicated these heritability estimates were similar. Another contribution used a variance-component LME to depict the proportion of heritability resulting from a genetic and shared environment. By including environmental exposures as random effects, the authors found heritability estimates became more stable but not significantly different. Two contributions investigated treatment response. One estimated drug-associated methylation effects on triglyceride levels as the response, and identified 11 significant cytosine-phosphate-guanine (CpG) sites with or without adjusting for high-density lipoprotein. The second contribution performed weighted gene coexpression network analysis and identified 6 significant modules of at least 30 CpG sites, including 3 modules with topological differences pretreatment and posttreatment. Conclusions Four conclusions from this GAW20 working group are: (a) QC measures are an important consideration for EWAS studies that are investigating multiple time points or repeated measurements; (b) application of heritability estimates between time points for individual CpG sites is a useful QC measure for DNA methylation studies; (c) drug intervention demonstrated strong epigenome-wide DNA methylation patterns across the 2 time points; and (d) new statistical methods are required to account for the environmental contributions of DNA methylation across time. These contributions demonstrate numerous opportunities exist for the analysis of longitudinal data in future epigenetic studies

    Modeling dependency structures in 450k DNA methylation data

    Get PDF
    Motivation: DNA methylation has been shown to be spatially dependent across chromosomes. Previous studies have focused on the influence of genomic context on the dependency structure, while not considering differences in dependency structure between individuals. Results: We modeled spatial dependency with a flexible framework to quantify the dependency structure, focusing on inter-individual differences by exploring the association between dependency parameters and technical and biological variables. The model was applied to a subset of the Finnish Twin Cohort study (N = 1611 individuals). The estimates of the dependency parameters varied considerably across individuals, but were generally consistent across chromosomes within individuals. The variation in dependency parameters was associated with bisulfite conversion plate, zygosity, sex and age. The age differences presumably reflect accumulated environmental exposures and/or accumulated small methylation differences caused by stochastic mitotic events, establishing recognizable, individual patterns more strongly seen in older individuals.Peer reviewe

    An EPIC predictor of gestational age and its application to newborns conceived by assisted reproductive technologies

    Get PDF
    Background Gestational age is a useful proxy for assessing developmental maturity, but correct estimation of gestational age is difficult using clinical measures. DNA methylation at birth has proven to be an accurate predictor of gestational age. Previous predictors of epigenetic gestational age were based on DNA methylation data from the Illumina HumanMethylation 27 K or 450 K array, which have subsequently been replaced by the Illumina MethylationEPIC 850 K array (EPIC). Our aims here were to build an epigenetic gestational age clock specific for the EPIC array and to evaluate its precision and accuracy using the embryo transfer date of newborns from the largest EPIC-derived dataset to date on assisted reproductive technologies (ART). Methods We built an epigenetic gestational age clock using Lasso regression trained on 755 randomly selected non-ART newborns from the Norwegian Study of Assisted Reproductive Technologies (START)-a substudy of the Norwegian Mother, Father, and Child Cohort Study (MoBa). For the ART-conceived newborns, the START dataset had detailed information on the embryo transfer date and the specific ART procedure used for conception. The predicted gestational age was compared to clinically estimated gestational age in 200 non-ART and 838 ART newborns using MM-type robust regression. The performance of the clock was compared to previously published gestational age clocks in an independent replication sample of 148 newborns from the Prediction and Prevention of Preeclampsia and Intrauterine Growth Restrictions (PREDO) study-a prospective pregnancy cohort of Finnish women. Results Our new epigenetic gestational age clock showed higher precision and accuracy in predicting gestational age than previous gestational age clocks (R-2 = 0.724, median absolute deviation (MAD) = 3.14 days). Restricting the analysis to CpGs shared between 450 K and EPIC did not reduce the precision of the clock. Furthermore, validating the clock on ART newborns with known embryo transfer date confirmed that DNA methylation is an accurate predictor of gestational age (R-2 = 0.767, MAD = 3.7 days). Conclusions We present the first EPIC-based predictor of gestational age and demonstrate its robustness and precision in ART and non-ART newborns. As more datasets are being generated on the EPIC platform, this clock will be valuable in studies using gestational age to assess neonatal development.Peer reviewe

    Epigenetics, heritability and longitudinal analysis

    Get PDF
    © 2018 The Author(s). Background: Longitudinal data and repeated measurements in epigenome-wide association studies (EWAS) provide a rich resource for understanding epigenetics. We summarize 7 analytical approaches to the GAW20 data sets that addressed challenges and potential applications of phenotypic and epigenetic data. All contributions used the GAW20 real data set and employed either linear mixed effect (LME) models or marginal models through generalized estimating equations (GEE). These contributions were subdivided into 3 categories: (a) quality control (QC) methods for DNA methylation data; (b) heritability estimates pretreatment and posttreatment with fenofibrate; and (c) impact of drug response pretreatment and posttreatment with fenofibrate on DNA methylation and blood lipids. Results: Two contributions addressed QC and identified large statistical differences with pretreatment and posttreatment DNA methylation, possibly a result of batch effects. Two contributions compared epigenome-wide heritability estimates pretreatment and posttreatment, with one employing a Bayesian LME and the other using a variance-component LME. Density curves comparing these studies indicated these heritability estimates were similar. Another contribution used a variance-component LME to depict the proportion of heritability resulting from a genetic and shared environment. By including environmental exposures as random effects, the authors found heritability estimates became more stable but not significantly different. Two contributions investigated treatment response. One estimated drug-associated methylation effects on triglyceride levels as the response, and identified 11 significant cytosine-phosphate-guanine (CpG) sites with or without adjusting for high-density lipoprotein. The second contribution performed weighted gene coexpression network analysis and identified 6 significant modules of at least 30 CpG sites, including 3 modules with topological differences pretreatment and posttreatment. Conclusions: Four conclusions from this GAW20 working group are: (a) QC measures are an important consideration for EWAS studies that are investigating multiple time points or repeated measurements; (b) application of heritability estimates between time points for individual CpG sites is a useful QC measure for DNA methylation studies; (c) drug intervention demonstrated strong epigenome-wide DNA methylation patterns across the 2 time points; and (d) new statistical methods are required to account for the environmental contributions of DNA methylation across time. These contributions demonstrate numerous opportunities exist for the analysis of longitudinal data in future epigenetic studies

    Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures

    No full text
    Background An important feature in many genomic studies is quality control and normalization. This is particularly important when analyzing epigenetic data, where the process of obtaining measurements can be bias prone. The GAW20 data was from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN), a study with multigeneration families, where DNA cytosine-phosphate-guanine (CpG) methylation was measured pre- and posttreatment with fenofibrate. We performed quality control assessment of the GAW20 DNA methylation data, including normalization, assessment of batch effects and detection of sample swaps. Results We show that even after normalization, the GOLDN methylation data has systematic differences pre- and posttreatment. Through investigation of (a) CpGs sites containing a single nucleotide polymorphism, (b) the stability of breeding values for methylation across time points, and (c) autosomal gender-associated CpGs, 13 sample swaps were detected, 11 of which were posttreatment. Conclusions This paper demonstrates several ways to perform quality control of methylation data in the absence of raw data files and highlights the importance of normalization and quality control of the GAW20 methylation data from the GOLDN study

    A Bayesian mixed modeling approach for estimating heritability

    No full text
    Background A Bayesian mixed model approach using integrated nested Laplace approximations (INLA) allows us to construct flexible models that can account for pedigree structure. Using these models, we estimate genome-wide patterns of DNA methylation heritability (h2), which are currently not well understood, as well as h2 of blood lipid measurements. Methods We included individuals from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) study with Infinium 450 K cytosine-phosphate-guanine (CpG) methylation and blood lipid data pre- and posttreatment with fenofibrate in families with up to three-generation pedigrees. For genome-wide patterns, we constructed 1 model per CpG with methylation as the response variable, with a random effect to model kinship, and age and gender as fixed effects. Results In total, 425,791 CpG sites pre-, but only 199,027 CpG sites posttreatment were found to have nonzero heritability. Across these CpG sites, the distributions of h2 estimates are similar in pre- and posttreatment (pre: median = 0.31, interquartile range [IQR] = 0.16; post: median = 0.34, IQR = 0.20). Blood lipid h2 estimates were similar pre- and posttreatment with overlapping 95% credibility intervals. Heritability was nonzero for treatment effect, that is, the difference between pre- and posttreatment blood lipids. Estimates for triglycerides h2 are 0.48 (pre), 0.42 (post), and 0.21 (difference); likewise for high-density lipoprotein cholesterol h2 the estimates are 0.61, 0.68, and 0.10. Conclusions We show that with INLA, a fully Bayesian approach to estimate DNA methylation h2 is possible on a genome-wide scale. This provides uncertainty assessment of the estimates, and allows us to perform model selection via deviance information criterion (DIC) to identify CpGs with strong evidence for nonzero heritability

    An examination of mediation by DNA methylation on birthweight differences induced by assisted reproductive technologies

    No full text
    Background Children born after assisted reproductive technologies (ART) differ in birthweight from those naturally conceived. It has been hypothesized that this might be explained by epigenetic mechanisms. We examined whether cord blood DNA methylation mediated the birthweight difference between 890 newborns conceived by ART (764 by fresh embryo transfer and 126 frozen thawed embryo transfer) and 983 naturally conceived newborns from the Norwegian Mother, Father, and Child Cohort Study (MoBa). DNA methylation was measured by the Illumina Infinium MethylationEPIC array. We conducted mediation analyses to assess whether differentially methylated CpGs mediated the differences in birthweight observed between: (1) fresh embryo transfer and natural conception and (2) frozen and fresh embryo transfer. Results We observed a difference in birthweight between fresh embryo transfer and naturally conceived offspring of − 120 g. 44% (95% confidence interval [CI] 26% to 81%) of this difference in birthweight between fresh embryo transfer and naturally conceived offspring was explained by differences in methylation levels at four CpGs near LOXL1, CDH20, and DRC1. DNA methylation differences at two CpGs near PTGS1 and RASGRP4 jointly mediated 22% (95% CI 8.1% to 50.3%) of the birthweight differences between fresh and frozen embryo transfer. Conclusion Our findings suggest that DNA methylation is an important mechanism in explaining birthweight differences according to the mode of conception. Further research should examine how gene regulation at these loci influences fetal growth

    Associations between epigenetic age acceleration and infertility

    No full text
    Study question: Is the use of ART, a proxy for infertility, associated with epigenetic age acceleration? Summary answer: The epigenetic age acceleration measured by Dunedin Pace of Aging methylation (DunedinPoAm) differed significantly between non-ART and ART mothers. What is known already: Among mothers who used ART, epigenetic age acceleration may be associated with low oocyte yield and poor ovarian response. However, the difference in epigenetic age acceleration between non-ART and ART mothers (or even fathers) has not been examined. Study design, size, duration: The Norwegian Mother, Father and Child Cohort Study (MoBa) recruited pregnant women and their partners across Norway at around 18 gestational weeks between 1999 and 2008. Approximately 95 000 mothers, 75 000 fathers and 114 000 children were included. Peripheral blood samples were taken from mothers and fathers at ultrasound appointments or from mothers at childbirth, and umbilical cord blood samples were collected from the newborns at birth. Participants/materials, setting, methods: Among the MoBa participants, we selected 1000 couples who conceived by coitus and 894 couples who conceived by IVF (n = 525) or ICSI (n = 369). We measured their DNA methylation (DNAm) levels using the Illumina MethylationEPIC array and calculated epigenetic age acceleration. A linear mixed model was used to examine the differences in five different epigenetic age accelerations between non-ART and ART parents. Main results and the role of chance: We found a significant difference in the epigenetic age acceleration calculated by DunedinPoAm between IVF and non-ART mothers (0.021 years, P-value = 2.89E-06) after adjustment for potential confounders. Further, we detected elevated DunedinPoAm in mothers with tubal factor infertility (0.030 years, P-value = 1.34E-05), ovulation factor (0.023 years, P-value = 0.0018) and unexplained infertility (0.023 years, P-value = 1.39E-04) compared with non-ART mothers. No differences in epigenetic age accelerations between non-ART and ICSI fathers were found. DunedinPoAm also showed stronger associations with smoking, education and parity than the other four epigenetic age accelerations. Limitations, reasons for caution: We were not able to determine the directionality of the causal pathway between the epigenetic age accelerations and infertility. Since parents' peripheral blood samples were collected after conception, we cannot rule out the possibility that the epigenetic profile of ART mothers was influenced by the ART treatment. Hence, the results should be interpreted with caution, and our results might not be generalizable to non-pregnant women. Wider implications of the findings: A plausible biological mechanism behind the reported association is that IVF mothers could be closer to menopause than non-ART mothers. The pace of decline of the ovarian reserve that eventually leads to menopause varies between females yet, in general, accelerates after the age of 30, and some studies show an increased risk of infertility in females with low ovarian reserve. Study funding/competing interest(s): This study was partly funded by the Research Council of Norway (Women's fertility, project no. 320656) and through its Centres of Excellence Funding Scheme (project no. 262700). M.C.M. has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant agreement number 947684). The authors declare no conflict of interest.publishedVersio

    Blood-based epigenetic estimators of chronological age in human adults using DNA methylation data from the Illumina MethylationEPIC array

    Get PDF
    BackgroundEpigenetic clocks have been recognized for their precise prediction of chronological age, age-related diseases, and all-cause mortality. Existing epigenetic clocks are based on CpGs from the Illumina HumanMethylation450 BeadChip (450K) which has now been replaced by the latest platform, Illumina MethylationEPIC BeadChip (EPIC). Thus, it remains unclear to what extent EPIC contributes to increased precision and accuracy in the prediction of chronological age.ResultsWe developed three blood-based epigenetic clocks for human adults using EPIC-based DNA methylation (DNAm) data from the Norwegian Mother, Father and Child Cohort Study (MoBa) and the Gene Expression Omnibus (GEO) public repository: 1) an Adult Blood-based EPIC Clock (ABEC) trained on DNAm data from MoBa (n=1592, age-span: 19 to 59years), 2) an extended ABEC (eABEC) trained on DNAm data from MoBa and GEO (n=2227, age-span: 18 to 88years), and 3) a common ABEC (cABEC) trained on the same training set as eABEC but restricted to CpGs common to 450K and EPIC. Our clocks showed high precision (Pearson correlation between chronological and epigenetic age (r)>0.94) in independent cohorts, including GSE111165 (n=15), GSE115278 (n=108), GSE132203 (n=795), and the Epigenetics in Pregnancy (EPIPREG) study of the STORK Groruddalen Cohort (n=470). This high precision is unlikely due to the use of EPIC, but rather due to the large sample size of the training set.ConclusionsOur ABECs predicted adults' chronological age precisely in independent cohorts. As EPIC is now the dominant platform for measuring DNAm, these clocks will be useful in further predictions of chronological age, age-related diseases, and mortality.Peer reviewe
    corecore