20 research outputs found

    Genomics of clinal local adaptation in Pinus sylvestris under continuous environmental and spatial genetic setting

    Get PDF
    Understanding the consequences of local adaptation at the genomic diversity is a central goal in evolutionary genetics of natural populations. In species with large continuous geographical distributions the phenotypic signal of local adaptation is frequently clear, but the genetic basis often remains elusive. We examined the patterns of genetic diversity inPinus sylvestris, a keystone species in many Eurasian ecosystems with a huge distribution range and decades of forestry research showing that it is locally adapted to the vast range of environmental conditions. MakingP. sylvestrisan even more attractive subject of local adaptation study, population structure has been shown to be weak previously and in this study. However, little is known about the molecular genetic basis of adaptation, as the massive size of gymnosperm genomes has prevented large scale genomic surveys. We generated a both geographically and genomically extensive dataset using a targeted sequencing approach. By applying divergence-based and landscape genomics methods we identified several loci contributing to local adaptation, but only few with large allele frequency changes across latitude. We also discovered a very large (ca. 300 Mbp) putative inversion potentially under selection, which to our knowledge is the first such discovery in conifers. Our results call for more detailed analysis of structural variation in relation to genomic basis of local adaptation, emphasize the lack of large effect loci contributing to local adaptation in the coding regions and thus point out the need for more attention toward multi-locus analysis of polygenic adaptation

    Leveraging Northern European population history : novel low-frequency variants for polycystic ovary syndrome

    Get PDF
    STUDY QUESTION Can we identify novel variants associated with polycystic ovary syndrome (PCOS) by leveraging the unique population history of Northern Europe? SUMMARY ANSWER We identified three novel genome-wide significant associations with PCOS, with two putative independent causal variants in the checkpoint kinase 2 (CHEK2) gene and a third in myosin X (MYO10). WHAT IS KNOWN ALREADY PCOS is a common, complex disorder with unknown aetiology. While previous genome-wide association studies (GWAS) have mapped several loci associated with PCOS, the analysis of populations with unique population history and genetic makeup has the potential to uncover new low-frequency variants with larger effects. STUDY DESIGN, SIZE, DURATION A population-based case-control GWAS was carried out. PARTICIPANTS/MATERIALS, SETTING, METHODS We identified PCOS cases from national registers by ICD codes (ICD-10 E28.2, ICD-9 256.4, or ICD-8 256.90), and all remaining women were considered controls. We then conducted a three-stage case-control GWAS: in the discovery phase, we had a total of 797 cases and 140 558 controls from the FinnGen study. For validation, we used an independent dataset from the Estonian Biobank, including 2812 cases and 89 230 controls. Finally, we performed a joint meta-analysis of 3609 cases and 229 788 controls from both cohorts. Additionally, we reran the association analyses including BMI as a covariate, with 2169 cases and 160 321 controls from both cohorts. MAIN RESULTS AND THE ROLE OF CHANCE Two out of the three novel genome-wide significant variants associating with PCOS, rs145598156 (P = 3.6x10(-8), odds ratio (OR) = 3.01 [2.02-4.50] minor allele frequency (MAF) = 0.005) and rs182075939 (P = 1.9x10(-16), OR = 1.69 [1.49-1.91], MAF = 0.04), were found to be enriched in the Finnish and Estonian populations and are tightly linked to a deletion c.1100delC (r(2) = 0.95) and a missense I157T (r(2) = 0.83) in CHEK2. The third novel association is a common variant near MYO10 (rs9312937, P = 1.7 x 10(-8), OR = 1.16 [1.10-1.23], MAF = 0.44). We also replicated four previous reported associations near the genes Erb-B2 Receptor Tyrosine Kinase 4 (ERBB4), DENN Domain Containing 1A (DENND1A), FSH Subunit Beta (FSHB) and Zinc Finger And BTB Domain Containing 16 (ZBTB16). When adding BMI as a covariate only one of the novel variants remained genome-wide significant in the meta-analysis (the EstBB lead signal in CHEK2 rs182075939, P = 1.9x10(-16), OR = 1.74 [1.5-2.01]) possibly owing to reduced sample size. LARGE SCALE DATA The age- and BMI-adjusted GWAS meta-analysis summary statistics are available for download from the GWAS Catalog with accession numbers GCST90044902 and GCST90044903. LIMITATIONS, REASONS FOR CAUTION The main limitation was the low prevalence of PCOS in registers; however, the ones with the diagnosis most likely represent the most severe cases. Also, BMI data were not available for all (63% for FinnGen, 76% for EstBB), and the biobank setting limited the accessibility of PCOS phenotypes and laboratory values. WIDER IMPLICATIONS OF THE FINDINGS This study encourages the use of isolated populations to perform genetic association studies for the identification of rare variants contributing to the genetic landscape of complex diseases such as PCOS. STUDY FUNDING/COMPETING INTEREST(S) This work has received funding from the European Union's Horizon 2020 research and innovation programme under the MATER Marie Sklodowska-Curie grant agreement No. 813707 (N.P.-G., T.L., T.P.), the Estonian Research Council grant (PRG687, T.L.), the Academy of Finland grants 315921 (T.P.), 321763 (T.P.), 297338 (J.K.), 307247 (J.K.), 344695 (H.L.), Novo Nordisk Foundation grant NNF17OC0026062 (J.K.), the Sigrid Juselius Foundation project grants (T.L., J.K., T.P.), Finska Lakaresallskapet (H.L.) and Jane and Aatos Erkko Foundation (H.L.). The funders had no role in study design, data collection and analysis, publishing or preparation of the manuscript. The authors declare no conflicts of interest.Peer reviewe

    Taming the massive genome of Scots pine with PiSy50k, a new genotyping array for conifer research

    Get PDF
    Pinus sylvestris (Scots pine) is the most widespread coniferous tree in the boreal forests of Eurasia, with major economic and ecological importance. However, its large and repetitive genome presents a challenge for conducting genome-wide analyses such as association studies, genetic mapping and genomic selection. We present a new 50K single-nucleotide polymorphism (SNP) genotyping array for Scots pine research, breeding and other applications. To select the SNP set, we first genotyped 480 Scots pine samples on a 407 540 SNP screening array and identified 47 712 high-quality SNPs for the final array (called 'PiSy50k'). Here, we provide details of the design and testing, as well as allele frequency estimates from the discovery panel, functional annotation, tissue-specific expression patterns and expression level information for the SNPs or corresponding genes, when available. We validated the performance of the PiSy50k array using samples from Finland and Scotland. Overall, 39 678 (83.2%) SNPs showed low error rates (mean = 0.9%). Relatedness estimates based on array genotypes were consistent with the expected pedigrees, and the level of Mendelian error was negligible. In addition, array genotypes successfully discriminate between Scots pine populations of Finnish and Scottish origins. The PiSy50k SNP array will be a valuable tool for a wide variety of future genetic studies and forestry applications.Peer reviewe

    Evidence of a causal effect of genetic tendency to gain muscle mass on uterine leiomyomata

    No full text
    Many genetic factors that contribute to uterine leiomyomata (UL) - the most common tumours of the female genital tract - remain to be discovered. Here, the authors conduct a UL meta-genome-wide association study, and find loci related to altered muscle tissue biology that are associated with UL

    Evidence of a causal effect of genetic tendency to gain muscle mass on uterine leiomyomata

    No full text
    Uterine leiomyomata (UL) are the most common tumours of the female genital tract and the primary cause of surgical removal of the uterus. Genetic factors contribute to UL susceptibility. To add understanding to the heritable genetic risk factors, we conduct a genome-wide association study (GWAS) of UL in up to 426,558 European women from FinnGen and a previous UL meta-GWAS. In addition to the 50 known UL loci, we identify 22 loci that have not been associated with UL in prior studies. UL-associated loci harbour genes enriched for development, growth, and cellular senescence. Of particular interest are the smooth muscle cell differentiation and proliferation-regulating genes functioning on the myocardin-cyclin dependent kinase inhibitor 1 A pathway. Our results further suggest that genetic predisposition to increased fat-free mass may be causally related to higher UL risk, underscoring the involvement of altered muscle tissue biology in UL pathophysiology. Overall, our findings add to the understanding of the genetic pathways underlying UL, which may aid in developing novel therapeutics.peerReviewe

    Associations of polygenic risk scores for preeclampsia and blood pressure with hypertensive disorders of pregnancy

    Get PDF
    Background:Preexisting hypertension increases risk for preeclampsia. We examined whether a generic blood pressure polygenic risk score (BP-PRS), compared with a preeclampsia-specific polygenic risk score (PE-PRS), could better predict hypertensive disorders of pregnancy.Methods:Our study sample included 141 298 genotyped FinnGen study participants with at least one childbirth and followed from 1969 to 2021. We calculated PRSs for SBP and preeclampsia using summary statistics for greater than 1.1 million single nucleotide polymorphisms.Results:We observed 8488 cases of gestational hypertension (GHT) and 6643 cases of preeclampsia. BP-PRS was associated with GHT [multivariable-adjusted hazard ratio for 1SD increase in PRS (hazard ratio 1.38; 95% CI 1.35-1.41)] and preeclampsia (1.26, 1.23-1.29), respectively. The PE-PRS was also associated with GHT (1.16; 1.14-1.19) and preeclampsia (1.21, 1.18-1.24), but with statistically more modest magnitudes of effect (P = 0.01). The model c-statistic for preeclampsia improved when PE-PRS was added to clinical risk factors (P = 4.6 x 10(-15)). Additional increment in the c-statistic was observed when BP-PRS was added to a model already including both clinical risk factors and PE-PRS (P = 1.1 x 10(-14)).Conclusion:BP-PRS is strongly associated with hypertensive disorders of pregnancy. Our current observations suggest that the BP-PRS could capture the genetic architecture of preeclampsia better than the current PE-PRSs. These findings also emphasize the common pathways in the development of all BP disorders. The clinical utility of a BP-PRS for preeclampsia prediction warrants further investigation.Peer reviewe

    Genomics of clinal local adaptation in Pinus sylvestris under continuous environmental and spatial genetic setting

    No full text
    Abstract Understanding the consequences of local adaptation at the genomic diversity is a central goal in evolutionary genetics of natural populations. In species with large continuous geographical distributions the phenotypic signal of local adaptation is frequently clear, but the genetic basis often remains elusive. We examined the patterns of genetic diversity in Pinus sylvestris, a keystone species in many Eurasian ecosystems with a huge distribution range and decades of forestry research showing that it is locally adapted to the vast range of environmental conditions. Making P. sylvestris an even more attractive subject of local adaptation study, population structure has been shown to be weak previously and in this study. However, little is known about the molecular genetic basis of adaptation, as the massive size of gymnosperm genomes has prevented large scale genomic surveys. We generated a both geographically and genomically extensive dataset using a targeted sequencing approach. By applying divergence-based and landscape genomics methods we identified several loci contributing to local adaptation, but only few with large allele frequency changes across latitude. We also discovered a very large (ca. 300 Mbp) putative inversion potentially under selection, which to our knowledge is the first such discovery in conifers. Our results call for more detailed analysis of structural variation in relation to genomic basis of local adaptation, emphasize the lack of large effect loci contributing to local adaptation in the coding regions and thus point out the need for more attention toward multi-locus analysis of polygenic adaptation
    corecore