143 research outputs found

    MethylPCA: a toolkit to control for confounders in methylome-wide association studies

    Get PDF
    Background In methylome-wide association studies (MWAS) there are many possible differences between cases and controls (e.g. related to life style, diet, and medication use) that may affect the methylome and produce false positive findings. An effective approach to control for these confounders is to first capture the major sources of variation in the methylation data and then regress out these components in the association analyses. This approach is, however, computationally very challenging due to the extremely large number of methylation sites in the human genome. Result We introduce MethylPCA that is specifically designed to control for potential confounders in studies where the number of methylation sites is extremely large. MethylPCA offers a complete and flexible data analysis including 1) an adaptive method that performs data reduction prior to PCA by empirically combining methylation data of neighboring sites, 2) an efficient algorithm that performs a principal component analysis (PCA) on the ultra high-dimensional data matrix, and 3) association tests. To accomplish this MethylPCA allows for parallel execution of tasks, uses C++ for CPU and I/O intensive calculations, and stores intermediate results to avoid computing the same statistics multiple times or keeping results in memory. Through simulations and an analysis of a real whole methylome MBD-seq study of 1,500 subjects we show that MethylPCA effectively controls for potential confounders. Conclusions MethylPCA provides users a convenient tool to perform MWAS. The software effectively handles the challenge in memory and speed to perform tasks that would be impossible to accomplish using existing software when millions of sites are interrogated with the sample sizes required for MWAS

    A New Method for Detecting Associations with Rare Copy-Number Variants

    Get PDF
    Copy number variants (CNVs) play an important role in the etiology of many diseases such as cancers and psychiatric disorders. Due to a modest marginal effect size or the rarity of the CNVs, collapsing rare CNVs together and collectively evaluating their effect serves as a key approach to evaluating the collective effect of rare CNVs on disease risk. While a plethora of powerful collapsing methods are available for sequence variants (e.g., SNPs) in association analysis, these methods cannot be directly applied to rare CNVs due to the CNV-specific challenges, i.e., the multi-faceted nature of CNV polymorphisms (e.g., CNVs vary in size, type, dosage, and details of gene disruption), and etiological heterogeneity (e.g., heterogeneous effects of duplications and deletions that occur within a locus or in different loci). Existing CNV collapsing analysis methods (a.k.a. the burden test) tend to have suboptimal performance due to the fact that these methods often ignore heterogeneity and evaluate only the marginal effects of a CNV feature. We introduce CCRET, a random effects test for collapsing rare CNVs when searching for disease associations. CCRET is applicable to variants measured on a multi-categorical scale, collectively modeling the effects of multiple CNV features, and is robust to etiological heterogeneity. Multiple confounders can be simultaneously corrected. To evaluate the performance of CCRET, we conducted extensive simulations and analyzed large-scale schizophrenia datasets. We show that CCRET has powerful and robust performance under multiple types of etiological heterogeneity, and has performance comparable to or better than existing methods when there is no heterogeneity

    Heritability of perinatal depression and genetic overlap with nonperinatal depression.

    Get PDF
    OBJECTIVE: The authors investigated the relative importance of genetic and environmental influences on perinatal depression, and the genetic overlap between perinatal depression and nonperinatal depression. METHOD: Analyses were conducted using structural equation modeling for 1) the lifetime version of the Edinburgh Postnatal Depression Scale in 3,427 Swedish female twins and 2) clinical diagnoses of depression separated into perinatal depression and nonperinatal depression in a Swedish population-based cohort of 580,006 sisters. RESULTS: In the twin study, the heritability of perinatal depression was estimated at 54% (95% CI=35%-70%), with the remaining variance attributable to nonshared environment (46%; 95% CI=31%-65%). In the sibling design, the heritability of perinatal depression was estimated at 44% (95% CI=35%-52%) and the heritability of nonperinatal depression at 32% (95% CI=24%-41%). Bivariate analysis showed that 14% of the total variance (or 33% of the genetic variance) in perinatal depression was unique for perinatal depression. CONCLUSIONS: The heritability of perinatal depression was estimated at 54% and 44%, respectively, in separate samples, and the heritability of nonperinatal depression at 32%. One-third of the genetic contribution was unique to perinatal depression and not shared with nonperinatal depression, suggesting only partially overlapping genetic etiologies for perinatal depression and nonperinatal depression. The authors suggest that perinatal depression constitutes a subset of depression that could be prioritized for genomic discovery efforts. The study findings have direct translational impact that can assist clinicians in the counseling of their patients regarding risk and prognosis of perinatal depression.The Swedish Research CouncilThe Swedish foundation for Strategic ResearchThe Swedish Brain foundationThe National Institute of Mental HealthAccepte

    One CNV Discordance in <span class="italic">NRXN1</span> Observed Upon Genome-wide Screening in 38 Pairs of Adult Healthy Monozygotic Twins

    Get PDF
    Monozygotic (MZ) twins stem from the same single fertilized egg and therefore share all their inherited genetic variation. This is one of the unequivocal facts on which genetic epidemiology and twin studies are based. To what extent this also implies that MZ twins share genotypes in adult tissues is not precisely established, but a common pragmatic assumption is that MZ twins are 100% genetically identical also in adult tissues. During the past decade, this view has been challenged by several reports, with observations of differences in post-zygotic copy number variations (CNVs) between members of the same MZ pair. In this study, we performed a systematic search for differences of CNVs within 38 adult MZ pairs who had been misclassified as dizygotic (DZ) twins by questionnaire-based assessment. Initial scoring by PennCNV suggested a total of 967 CNV discordances. The within-pair correlation in number of CNVs detected was strongly dependent on confidence score filtering and reached a plateau of r = 0.8 when restricting to CNVs detected with confidence score larger than 50. The top-ranked discordances were subsequently selected for validation by quantitative polymerase chain reaction (qPCR), from which one single ~120kb deletion in NRXN1 on chromosome 2 (bp 51017111-51136802) was validated. Despite involving an exon, no sign of cognitive/mental consequences was apparent in the affected twin pair, potentially reflecting limited or lack of expression of the transcripts containing this exon in nerve/brain

    Utilizing Twins as Controls for Non-Twin Case-Materials in Genome Wide Association Studies

    Get PDF
    Twin registries around the globe have collected DNA samples from large numbers of monozygotic and dizygotic twins. The twin sample collections are frequently used as controls in disease-specific studies together with non-twins. This approach is unbiased under the hypothesis that twins and singletons are comparable in terms of allele frequencies; i.e. there are no genetic variants associated with being a twin per se. To test this hypothesis we performed a genome-wide association study comparing the allele frequency of 572,352 single nucleotide polymorphisms (SNPs) in 1,413 monozygotic (MZ) and 5,451 dizygotic (DZ) twins with 3,720 healthy singletons. Twins and singletons have been genotyped using the same platform. SNPs showing association with being a twin at P-value < 1 × 10-5 were selected for replication analysis in 1,492 twins (463 MZ and 1,029 DZ) and 1,880 singletons from Finland. No SNPs reached genome-wide significance (P-value < 5 × 10-8) in the main analysis combining MZ and DZ twins. In a secondary analysis including only DZ twins two SNPs (rs2033541 close to ADAMTSL1 and rs4149283 close to ABCA1) were genome-wide significant after meta-analysis with the Finnish population. The estimated proportion of variance on the liability scale explained by all SNPs was 0.08 (P-value=0.003) when MZ and DZ were considered together and smaller for MZ (0.06, P-value=0.10) compared to DZ (0.09, P-value=0.003) when analyzed separately. In conclusion, twins and singletons can be used in genetic studies together with general population samples without introducing large bias. Further research is needed to explore genetic variances associated with DZ twinning

    Mental health indicators in Sweden over a 12-month period during the COVID-19 pandemic – Baseline data of the Omtanke2020 Study

    Get PDF
    Funding Information: This study was funded with grants from NordForsk (CovidMent, 105668 ), Horizon 2020 (CoMorMent, 847776 ), and the Karolinska Institutet . Funding Information: The Omtanke2020 study is supported by NordForsk (project No. 105668 ) and Karolinska Institute (Strategic Research Area in Epidemiology and Senior Researcher Award). We acknowledge The Swedish Twin Registry for access to contact information to participating twins. The Swedish Twin Registry is managed by Karolinska Institutet and receives funding through the Swedish Research Council under the grant no 2017-00641. The Funding Sources had no direct or indirect impact on the analysis and interpretation of the results. Publisher Copyright: © 2022 The AuthorsBackground: The ongoing COVID-19 pandemic has had an unprecedented impact on the lives of people globally and is expected to have profound effects on mental health. Here we aim to describe the mental health burden experienced in Sweden using baseline data of the Omtanke2020 Study. Method: We analysed self-reported, cross-sectional baseline data collected over a 12-month period (June 9, 2020–June 8, 2021) from the Omtanke2020 Study including 27,950 adults in Sweden. Participants were volunteers or actively recruited through existing cohorts and, after providing informed consent, responded to online questionnaires on socio-demographics, mental and physical health, as well as COVID-19 infection and impact. Poisson regression was fitted to assess the relative risk of demonstrating high level symptoms of depression, anxiety, and COVID-19 related distress. Result: The proportion of persons with high level of symptoms was 15.6 %, 9.5 % and 24.5 % for depression, anxiety, and COVID-19 specific post-traumatic stress disorder (PTSD), respectively. Overall, 43.4 % of the participants had significant, clinically relevant symptoms for at least one of the three mental health outcomes and 7.3 % had significant symptoms for all three outcomes. We also observed differences in the prevalence of these outcomes across strata of sex, age, recruitment type, COVID-19 status, region, and seasonality. Conclusion: While the proportion of persons with high mental health burden remains higher than the ones reported in pre-pandemic publications, our estimates are lower than previously reported levels of depression, anxiety, and PTSD during the pandemic in Sweden and elsewhere.Peer reviewe

    A methylome-wide study of aging using massively parallel sequencing of the methyl-CpG-enriched genomic fraction from blood in over 700 subjects

    Get PDF
    The central importance of epigenetics to the aging process is increasingly being recognized. Here we perform a methylome-wide association study (MWAS) of aging in whole blood DNA from 718 individuals, aged 25–92 years (mean = 55). We sequenced the methyl-CpG-enriched genomic DNA fraction, averaging 67.3 million reads per subject, to obtain methylation measurements for the ∼27 million autosomal CpGs in the human genome. Following extensive quality control, we adaptively combined methylation measures for neighboring, highly-correlated CpGs into 4 344 016 CpG blocks with which we performed association testing. Eleven age-associated differentially methylated regions (DMRs) passed Bonferroni correction (P-value < 1.15 × 10−8). Top findings replicated in an independent sample set of 558 subjects using pyrosequencing of bisulfite-converted DNA (min P-value < 10−30). To examine biological themes, we selected 70 DMRs with false discovery rate of <0.1. Of these, 42 showed hypomethylation and 28 showed hypermethylation with age. Hypermethylated DMRs were more likely to overlap with CpG islands and shores. Hypomethylated DMRs were more likely to be in regions associated with polycomb/regulatory proteins (e.g. EZH2) or histone modifications H3K27ac, H3K4m1, H3K4m2, H3K4m3 and H3K9ac. Among genes implicated by the top DMRs were protocadherins, homeobox genes, MAPKs and ryanodine receptors. Several of our DMRs are at genes with potential relevance for age-related disease. This study successfully demonstrates the application of next-generation sequencing to MWAS, by interrogating a large proportion of the methylome and returning potentially novel age DMRs, in addition to replicating several loci implicated in previous studies using microarrays
    • …
    corecore