55 research outputs found

    Evidence of Early-Stage Selection on EPAS1 and GPR126 Genes in Andean High Altitude Populations.

    Get PDF
    The aim of this study is to identify genetic variants that harbour signatures of recent positive selection and may facilitate physiological adaptations to hypobaric hypoxia. To achieve this, we conducted whole genome sequencing and lung function tests in 19 Argentinean highlanders (>3500 m) comparing them to 16 Native American lowlanders. We developed a new statistical procedure using a combination of population branch statistics (PBS) and number of segregating sites by length (nSL) to detect beneficial alleles that arose since the settlement of the Andes and are currently present in 15-50% of the population. We identified two missense variants as significant targets of selection. One of these variants, located within the GPR126 gene, has been previously associated with the forced expiratory volume/forced vital capacity ratio. The other novel missense variant mapped to the EPAS1 gene encoding the hypoxia inducible factor 2α. EPAS1 is known to be the major selection candidate gene in Tibetans. The derived allele of GPR126 is associated with lung function in our sample of highlanders (p < 0.05). These variants may contribute to the physiological adaptations to hypobaric hypoxia, possibly by altering lung function. The new statistical approach might be a useful tool to detect selected variants in population studies

    Positive selection of AS3MT to arsenic water in Andean populations.

    Get PDF
    Arsenic is a carcinogen associated with skin lesions and cardiovascular diseases. The Colla population from the Puna region in Northwest Argentinean is exposed to levels of arsenic in drinking water exceeding the recommended maximum by a factor of 20. Yet, they thrive in this challenging environment since thousands of years and therefore we hypothesize strong selection signatures in genes involved in arsenic metabolism. We analyzed genome-wide genotype data for 730,000 loci in 25 Collas, considering 24 individuals of the neighbouring Calchaquíes and 24 Wichí from the Gran Chaco region in the Argentine province of Salta as control groups. We identified a strong signal of positive selection in the main arsenic methyltransferase AS3MT gene, which has been previously associated with lower concentrations of the most toxic product of arsenic metabolism monomethylarsonic acid. This study confirms recent studies reporting selection signals in the AS3MT gene albeit using different samples, tests and control populations

    Monitoring SARS-CoV-2 variant transitions using differences in diagnostic cycle threshold values of target genes

    Get PDF
    Monitoring the emergence of new SARS-CoV-2 variants is important to detect potential risks of increased transmission or disease severity. We investigated the identification of SARS-CoV-2 variants from real-time reverse transcriptase polymerase chain reaction (RT-PCR) routine diagnostics data. Cycle threshold (Ct) values of positive samples were collected from April 2021 to January 2022 in the Northern Metropolitan Area of Barcelona (n = 15,254). Viral lineage identification from whole genome sequencing (WGS) was available for 4618 (30.3%) of these samples. Pairwise differences in the Ct values between gene targets (ΔCt) were analyzed for variants of concern or interest circulating in our area. A specific delay in the Ct of the N-gene compared to the RdRp-gene (ΔCt) was observed for Alpha, Delta, Eta and Omicron. Temporal differences in ΔCt correlated with the dynamics of viral replacement of Alpha by Delta and of Delta by Omicron according to WGS results. Using ΔCt, prediction of new variants of concern at early stages of circulation was achieved with high sensitivity and specificity (91.1% and 97.8% for Delta; 98.5% and 90.8% for Omicron). Thus, tracking population-wide trends in ΔCt values obtained from routine diagnostics testing in combination with WGS could be useful for real-time management and response to local epidemics

    A Selective Sweep on a Deleterious Mutation in CPT1A in Arctic Populations.

    Get PDF
    Arctic populations live in an environment characterized by extreme cold and the absence of plant foods for much of the year and are likely to have undergone genetic adaptations to these environmental conditions in the time they have been living there. Genome-wide selection scans based on genotype data from native Siberians have previously highlighted a 3 Mb chromosome 11 region containing 79 protein-coding genes as the strongest candidates for positive selection in Northeast Siberians. However, it was not possible to determine which of the genes might be driving the selection signal. Here, using whole-genome high-coverage sequence data, we identified the most likely causative variant as a nonsynonymous G>A transition (rs80356779; c.1436C>T [p.Pro479Leu] on the reverse strand) in CPT1A, a key regulator of mitochondrial long-chain fatty-acid oxidation. Remarkably, the derived allele is associated with hypoketotic hypoglycemia and high infant mortality yet occurs at high frequency in Canadian and Greenland Inuits and was also found at 68% frequency in our Northeast Siberian sample. We provide evidence of one of the strongest selective sweeps reported in humans; this sweep has driven this variant to high frequency in circum-Arctic populations within the last 6-23 ka despite associated deleterious consequences, possibly as a result of the selective advantage it originally provided to either a high-fat diet or a cold environment.This research was supported by ERC Starting Investigator grant (FP7 - 261213) to T.K. http://erc.europa.eu/. CTS, YX, QA and MS were supported by the Wellcome Trust (098051). TA was supported by The Wellcome Trust (WT100066MA). M.M and R.V. were supported by EU ERDF Centre of Excellence in Genomics to EBC; T.K, M.M and R.V. by Estonian Institutional Research grant (IUT24-1), and M.M by Estonian Science Foundation (grant 8973).This is the accepted manuscript. The final version is available from Cell/Elsevier at http://www.cell.com/ajhg/abstract/S0002-9297%2814%2900422-4

    Epigenome-Wide Association Study of Incident Type 2 Diabetes in a British Population: EPIC-Norfolk Study.

    Get PDF
    Epigenetic changes may contribute substantially to risks of diseases of aging. Previous studies reported seven methylation variable positions (MVPs) robustly associated with incident type 2 diabetes mellitus (T2DM). However, their causal roles in T2DM are unclear. In an incident T2DM case-cohort study nested within the population-based European Prospective Investigation into Cancer and Nutrition (EPIC)-Norfolk cohort, we used whole blood DNA collected at baseline, up to 11 years before T2DM onset, to investigate the role of methylation in the etiology of T2DM. We identified 15 novel MVPs with robust associations with incident T2DM and robustly confirmed three MVPs identified previously (near to TXNIP, ABCG1, and SREBF1). All 18 MVPs showed directionally consistent associations with incident and prevalent T2DM in independent studies. Further conditional analyses suggested that the identified epigenetic signals appear related to T2DM via glucose and obesity-related pathways acting before the collection of baseline samples. We integrated genome-wide genetic data to identify methylation-associated quantitative trait loci robustly associated with 16 of the 18 MVPs and found one MVP, cg00574958 at CPT1A, with a possible direct causal role in T2DM. None of the implicated genes were previously highlighted by genetic association studies, suggesting that DNA methylation studies may reveal novel biological mechanisms involved in tissue responses to glycemia

    GWAS of epigenetic aging rates in blood reveals a critical role for TERT.

    Get PDF
    DNA methylation age is an accurate biomarker of chronological age and predicts lifespan, but its underlying molecular mechanisms are unknown. In this genome-wide association study of 9907 individuals, we find gene variants mapping to five loci associated with intrinsic epigenetic age acceleration (IEAA) and gene variants in three loci associated with extrinsic epigenetic age acceleration (EEAA). Mendelian randomization analysis suggests causal influences of menarche and menopause on IEAA and lipoproteins on IEAA and EEAA. Variants associated with longer leukocyte telomere length (LTL) in the telomerase reverse transcriptase gene (TERT) paradoxically confer higher IEAA (P < 2.7 × 10-11). Causal modeling indicates TERT-specific and independent effects on LTL and IEAA. Experimental hTERT-expression in primary human fibroblasts engenders a linear increase in DNA methylation age with cell population doubling number. Together, these findings indicate a critical role for hTERT in regulating the epigenetic clock, in addition to its established role of compensating for cell replication-dependent telomere shortening

    Identifying and correcting epigenetics measurements for systematic sources of variation

    Get PDF
    Abstract Background Methylation measures quantified by microarray techniques can be affected by systematic variation due to the technical processing of samples, which may compromise the accuracy of the measurement process and contribute to bias the estimate of the association under investigation. The quantification of the contribution of the systematic source of variation is challenging in datasets characterized by hundreds of thousands of features. In this study, we introduce a method previously developed for the analysis of metabolomics data to evaluate the performance of existing normalizing techniques to correct for unwanted variation. Illumina Infinium HumanMethylation450K was used to acquire methylation levels in over 421,000 CpG sites for 902 study participants of a case-control study on breast cancer nested within the EPIC cohort. The principal component partial R-square (PC-PR2) analysis was used to identify and quantify the variability attributable to potential systematic sources of variation. Three correcting techniques, namely ComBat, surrogate variables analysis (SVA) and a linear regression model to compute residuals were applied. The impact of each correcting method on the association between smoking status and DNA methylation levels was evaluated, and results were compared with findings from a large meta-analysis. Results A sizeable proportion of systematic variability due to variables expressing ‘batch’ and ‘sample position’ within ‘chip’ was identified, with values of the partial R2 statistics equal to 9.5 and 11.4% of total variation, respectively. After application of ComBat or the residuals’ methods, the contribution was 1.3 and 0.2%, respectively. The SVA technique resulted in a reduced variability due to ‘batch’ (1.3%) and ‘sample position’ (0.6%), and in a diminished variability attributable to ‘chip’ within a batch (0.9%). After ComBat or the residuals’ corrections, a larger number of significant sites (k = 600 and k = 427, respectively) were associated to smoking status than the SVA correction (k = 96). Conclusions The three correction methods removed systematic variation in DNA methylation data, as assessed by the PC-PR2, which lent itself as a useful tool to explore variability in large dimension data. SVA produced more conservative findings than ComBat in the association between smoking and DNA methylation

    Genomic and phenotypic insights from an atlas of genetic effects on DNA methylation

    Get PDF
    Characterizing genetic influences on DNA methylation (DNAm) provides an opportunity to understand mechanisms underpinning gene regulation and disease. In the present study, we describe results of DNAm quantitative trait locus (mQTL) analyses on 32,851 participants, identifying genetic variants associated with DNAm at 420,509 DNAm sites in blood. We present a database of >270,000 independent mQTLs, of which 8.5% comprise long-range (trans) associations. Identified mQTL associations explain 15–17% of the additive genetic variance of DNAm. We show that the genetic architecture of DNAm levels is highly polygenic. Using shared genetic control between distal DNAm sites, we constructed networks, identifying 405 discrete genomic communities enriched for genomic annotations and complex traits. Shared genetic variants are associated with both DNAm levels and complex diseases, but only in a minority of cases do these associations reflect causal relationships from DNAm to trait or vice versa, indicating a more complex genotype–phenotype map than previously anticipated

    Genomic and phenotypic insights from an atlas of genetic effects on DNA methylation.

    Get PDF
    Characterizing genetic influences on DNA methylation (DNAm) provides an opportunity to understand mechanisms underpinning gene regulation and disease. In the present study, we describe results of DNAm quantitative trait locus (mQTL) analyses on 32,851 participants, identifying genetic variants associated with DNAm at 420,509 DNAm sites in blood. We present a database of >270,000 independent mQTLs, of which 8.5% comprise long-range (trans) associations. Identified mQTL associations explain 15–17% of the additive genetic variance of DNAm. We show that the genetic architecture of DNAm levels is highly polygenic. Using shared genetic control between distal DNAm sites, we constructed networks, identifying 405 discrete genomic communities enriched for genomic annotations and complex traits. Shared genetic variants are associated with both DNAm levels and complex diseases, but only in a minority of cases do these associations reflect causal relationships from DNAm to trait or vice versa, indicating a more complex genotype–phenotype map than previously anticipated.C.L.R., G.D.S., G.S., J.L.M., K.B., M. Suderman, T.G.R. and T.R.G. are supported by the UK Medical Research Council (MRC) Integrative Epidemiology Unit at the University of Bristol (MC_UU_00011/1, MC_UU_00011/4, MC_UU_00011/5). C.L.R. receives support from a Cancer Research UK Programme grant (no. C18281/A191169). G.H. is funded by the Wellcome Trust and the Royal Society (208806/Z/17/Z). E.H. and J.M. were supported by MRC project grants (nos. MR/K013807/1 and MR/R005176/1 to J.M.) and an MRC Clinical Infrastructure award (no. MR/M008924/1 to J.M.). B.T.H. is supported by the Netherlands CardioVascular Research Initiative (the Dutch Heart Foundation, Dutch Federation of University Medical Centres, the Netherlands Organisation for Health Research and Development, and the Royal Netherlands Academy of Sciences) for the GENIUS project ‘Generating the best evidence-based pharmaceutical targets for atherosclerosis’ (CVON2011-19, CVON2017-20). J.T.B. was supported by the Economic and Social Research Council (grant no. ES/N000404/1). The present study was also supported by JPI HDHL-funded DIMENSION project (administered by the BBSRC UK, grant no. BB/S020845/1 to J.T.B., and by ZonMW the Netherlands, grant no. 529051021 to B.T.H). A.D.B. has been supported by a Wellcome Trust PhD Training Fellowship for Clinicians and the Edinburgh Clinical Academic Track programme (204979/Z/16/Z). J. Klughammer was supported by a DOC fellowship of the Austrian Academy of Sciences. Cohort-specific acknowledgements and funding are presented in the Supplementary Note

    Genomic analyses inform on migration events during the peopling of Eurasia

    Get PDF
    High-coverage whole-genome sequence studies have so far focused\ud on a limited number1 of geographically restricted populations2–5,\ud or been targeted at specific diseases, such as cancer6. Nevertheless,\ud the availability of high-resolution genomic data has led to the\ud development of new methodologies for inferring population\ud history7–9 and refuelled the debate on the mutation rate in humans10.\ud Here we present the Estonian Biocentre Human Genome Diversity\ud Panel (EGDP), a dataset of 483 high-coverage human genomes\ud from 148 populations worldwide, including 379 new genomes from\ud 125 populations, which we group into diversity and selection\ud sets. We analyse this dataset to refine estimates of continent-wide\ud patterns of heterozygosity, long- and short-distance gene flow, archaic\ud admixture, and changes in effective population size through time as\ud well as for signals of positive or balancing selection. We find a genetic\ud signature in present-day Papuans that suggests that at least 2% of\ud their genome originates from an early and largely extinct expansion\ud of anatomically modern humans (AMHs) out of Africa. Together\ud with evidence from the western Asian fossil record11, and admixture\ud between AMHs and Neanderthals predating the main Eurasian\ud expansion12, our results contribute to the mounting evidence for\ud the presence of AMHs out of Africa earlier than 75,000 years ago
    • …
    corecore