63 research outputs found

    The sequences of 150,119 genomes in the UK Biobank

    Get PDF
    Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation

    Ancient genomics

    Get PDF
    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past.No Full Tex

    Sequence variants in malignant hyperthermia genes in Iceland: classification and actionable findings in a population database.

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked DownloadMalignant hyperthermia (MH) susceptibility is a rare life-threatening disorder that occurs upon exposure to a triggering agent. MH is commonly due to protein-altering variants in RYR1 and CACNA1S. The American College of Medical Genetics and Genomics recommends that when pathogenic and likely pathogenic variants in RYR1 and CACNA1S are incidentally found, they should be reported to the carriers. The detection of actionable variants allows the avoidance of exposure to triggering agents during anesthesia. First, we report a 10-year-old Icelandic proband with a suspected MH event, harboring a heterozygous missense variant NM_000540.2:c.6710G>A r.(6710g>a) p.(Cys2237Tyr) in the RYR1 gene that is likely pathogenic. The variant is private to four individuals within a three-generation family and absent from 62,240 whole-genome sequenced (WGS) Icelanders. Haplotype sharing and WGS revealed that the variant occurred as a somatic mosaicism also present in germline of the proband's paternal grandmother. Second, using a set of 62,240 Icelanders with WGS, we assessed the carrier frequency of actionable pathogenic and likely pathogenic variants in RYR1 and CACNA1S. We observed 13 actionable variants in RYR1, based on ClinVar classifications, carried by 43 Icelanders, and no actionable variant in CACNA1S. One in 1450 Icelanders carries an actionable variant for MH. Extensive sequencing allows for better classification and precise dating of variants, and WGS of a large fraction of the population has led to incidental findings of actionable MH genotypes.deCODE Genetics/Amgen Inc

    A population-based survey of FBN1 variants in Iceland reveals underdiagnosis of Marfan syndrome

    Get PDF
    Publisher Copyright: © 2023, The Author(s).Marfan syndrome (MFS) is an autosomal dominant condition characterized by aortic aneurysm, skeletal abnormalities, and lens dislocation, and is caused by variants in the FBN1 gene. To explore causes of MFS and the prevalence of the disease in Iceland we collected information from all living individuals with a clinical diagnosis of MFS in Iceland (n = 32) and performed whole-genome sequencing of those who did not have a confirmed genetic diagnosis (27/32). Moreover, to assess a potential underdiagnosis of MFS in Iceland we attempted a genotype-based approach to identify individuals with MFS. We interrogated deCODE genetics’ database of 35,712 whole-genome sequenced individuals to search for rare sequence variants in FBN1. Overall, we identified 15 pathogenic or likely pathogenic variants in FBN1 in 44 individuals, only 22 of whom were previously diagnosed with MFS. The most common of these variants, NM_000138.4:c.8038 C > T p.(Arg2680Cys), is present in a multi-generational pedigree, and was found to stem from a single forefather born around 1840. The p.(Arg2680Cys) variant associates with a form of MFS that seems to have an enrichment of abdominal aortic aneurysm, suggesting that this may be a particularly common feature of p.(Arg2680Cys)-associated MFS. Based on these combined genetic and clinical data, we show that MFS prevalence in Iceland could be as high as 1/6,600 in Iceland, compared to 1/10,000 based on clinical diagnosis alone, which indicates underdiagnosis of this actionable genetic disorder.Peer reviewe

    COPA syndrome in an Icelandic family caused by a recurrent missense mutation in COPA

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked FilesBackground: Rare missense mutations in the gene encoding coatomer subunit alpha (COPA) have recently been shown to cause autoimmune interstitial lung, joint and kidney disease, also known as COPA syndrome, under a dominant mode of inheritance. Case presentation: Here we describe an Icelandic family with three affected individuals over two generations with a rare clinical presentation of lung and joint disease and a histological diagnosis of follicular bronchiolitis. We performed whole-genome sequencing (WGS) of the three affected as well as three unaffected members of the family, and searched for rare genotypes associated with disease using 30,067 sequenced Icelanders as a reference population. We assessed all coding and splicing variants, prioritizing variants in genes known to cause interstitial lung disease. We detected a heterozygous missense mutation, p.Glu241Lys, in the COPA gene, private to the affected family members. The mutation occurred de novo in the paternal germline of the index case and was absent from 30,067 Icelandic genomes and 141,353 individuals from the genome Aggregation Database (gnomAD). The mutation occurs within the conserved and functionally important WD40 domain of the COPA protein. Conclusions: This is the second report of the p.Glu241Lys mutation in COPA, indicating the recurrent nature of the mutation. The mutation was reported to co-segregate with COPA syndrome in a large family from the USA with five affected members, and classified as pathogenic. The two separate occurrences of the p.Glu241Lys mutation in cases and its absence from a large number of sequenced genomes confirms its role in the pathogenesis of the COPA syndrome

    Genetics and epidemiology of mutational barcode-defined clonal hematopoiesis

    Get PDF
    Publisher Copyright: © 2023, The Author(s).Clonal hematopoiesis (CH) arises when a substantial proportion of mature blood cells is derived from a single hematopoietic stem cell lineage. Using whole-genome sequencing of 45,510 Icelandic and 130,709 UK Biobank participants combined with a mutational barcode method, we identified 16,306 people with CH. Prevalence approaches 50% in elderly participants. Smoking demonstrates a dosage-dependent impact on risk of CH. CH associates with several smoking-related diseases. Contrary to published claims, we find no evidence that CH is associated with cardiovascular disease. We provide evidence that CH is driven by genes that are commonly mutated in myeloid neoplasia and implicate several new driver genes. The presence and nature of a driver mutation alters the risk profile for hematological disorders. Nevertheless, most CH cases have no known driver mutations. A CH genome-wide association study identified 25 loci, including 19 not implicated previously in CH. Splicing, protein and expression quantitative trait loci were identified for CD164 and TCL1A.Peer reviewe

    Molecular benchmarks of a SARS-CoV-2 epidemic.

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked DownloadA pressing concern in the SARS-CoV-2 epidemic and other viral outbreaks, is the extent to which the containment measures are halting the viral spread. A straightforward way to assess this is to tally the active cases and the recovered ones throughout the epidemic. Here, we show how epidemic control can be assessed with molecular information during a well characterized epidemic in Iceland. We demonstrate how the viral concentration decreased in those newly diagnosed as the epidemic transitioned from exponential growth phase to containment phase. The viral concentration in the cases identified in population screening decreased faster than in those symptomatic and considered at high risk and that were targeted by the healthcare system. The viral concentration persists in recovering individuals as we found that half of the cases are still positive after two weeks. We demonstrate that accumulation of mutations in SARS-CoV-2 genome can be exploited to track the rate of new viral generations throughout the different phases of the epidemic, where the accumulation of mutations decreases as the transmission rate decreases in the containment phase. Overall, the molecular signatures of SARS-CoV-2 infections contain valuable epidemiological information that can be used to assess the effectiveness of containment measures

    Convergent genetic and expression data implicate immunity in Alzheimer's disease

    Get PDF
    Background Late–onset Alzheimer's disease (AD) is heritable with 20 genes showing genome wide association in the International Genomics of Alzheimer's Project (IGAP). To identify the biology underlying the disease we extended these genetic data in a pathway analysis. Methods The ALIGATOR and GSEA algorithms were used in the IGAP data to identify associated functional pathways and correlated gene expression networks in human brain. Results ALIGATOR identified an excess of curated biological pathways showing enrichment of association. Enriched areas of biology included the immune response (p = 3.27×10-12 after multiple testing correction for pathways), regulation of endocytosis (p = 1.31×10-11), cholesterol transport (p = 2.96 × 10-9) and proteasome-ubiquitin activity (p = 1.34×10-6). Correlated gene expression analysis identified four significant network modules, all related to the immune response (corrected p 0.002 – 0.05). Conclusions The immune response, regulation of endocytosis, cholesterol transport and protein ubiquitination represent prime targets for AD therapeutics

    GWAS Meta-Analysis of Suicide Attempt: Identification of 12 Genome-Wide Significant Loci and Implication of Genetic Risks for Specific Health Factors

    Get PDF

    Dissecting the Shared Genetic Architecture of Suicide Attempt, Psychiatric Disorders, and Known Risk Factors

    Get PDF
    Background Suicide is a leading cause of death worldwide, and nonfatal suicide attempts, which occur far more frequently, are a major source of disability and social and economic burden. Both have substantial genetic etiology, which is partially shared and partially distinct from that of related psychiatric disorders. Methods We conducted a genome-wide association study (GWAS) of 29,782 suicide attempt (SA) cases and 519,961 controls in the International Suicide Genetics Consortium (ISGC). The GWAS of SA was conditioned on psychiatric disorders using GWAS summary statistics via multitrait-based conditional and joint analysis, to remove genetic effects on SA mediated by psychiatric disorders. We investigated the shared and divergent genetic architectures of SA, psychiatric disorders, and other known risk factors. Results Two loci reached genome-wide significance for SA: the major histocompatibility complex and an intergenic locus on chromosome 7, the latter of which remained associated with SA after conditioning on psychiatric disorders and replicated in an independent cohort from the Million Veteran Program. This locus has been implicated in risk-taking behavior, smoking, and insomnia. SA showed strong genetic correlation with psychiatric disorders, particularly major depression, and also with smoking, pain, risk-taking behavior, sleep disturbances, lower educational attainment, reproductive traits, lower socioeconomic status, and poorer general health. After conditioning on psychiatric disorders, the genetic correlations between SA and psychiatric disorders decreased, whereas those with nonpsychiatric traits remained largely unchanged. Conclusions Our results identify a risk locus that contributes more strongly to SA than other phenotypes and suggest a shared underlying biology between SA and known risk factors that is not mediated by psychiatric disorders.Peer reviewe
    corecore