41 research outputs found

    The sequences of 150,119 genomes in the UK Biobank

    Get PDF
    Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation

    Coding variants in RPL3L and MYZAP increase risk of atrial fibrillation

    Get PDF
    Source at https://doi.org/10.1038/s42003-018-0068-9. Most sequence variants identified hitherto in genome-wide association studies (GWAS) of atrial fibrillation are common, non-coding variants associated with risk through unknown mechanisms. We performed a meta-analysis of GWAS of atrial fibrillation among 29,502 cases and 767,760 controls from Iceland and the UK Biobank with follow-up in samples from Norway and the US, focusing on low-frequency coding and splice variants aiming to identify causal genes. We observe associations with one missense (OR = 1.20) and one splice-donor variant (OR = 1.50) in RPL3L, the first ribosomal gene implicated in atrial fibrillation to our knowledge. Analysis of 167 RNA samples from the right atrium reveals that the splice-donor variant in RPL3L results in exon skipping. We also observe an association with a missense variant in MYZAP (OR = 1.38), encoding a component of the intercalated discs of cardiomyocytes. Both discoveries emphasize the close relationship between the mechanical and electrical function of the heart

    Large-scale plasma proteomics comparisons through genetics and disease associations

    Get PDF
    Publisher Copyright: © 2023, The Author(s).High-throughput proteomics platforms measuring thousands of proteins in plasma combined with genomic and phenotypic information have the power to bridge the gap between the genome and diseases. Here we performed association studies of Olink Explore 3072 data generated by the UK Biobank Pharma Proteomics Project 1 on plasma samples from more than 50,000 UK Biobank participants with phenotypic and genotypic data, stratifying on British or Irish, African and South Asian ancestries. We compared the results with those of a SomaScan v4 study on plasma from 36,000 Icelandic people 2, for 1,514 of whom Olink data were also available. We found modest correlation between the two platforms. Although cis protein quantitative trait loci were detected for a similar absolute number of assays on the two platforms (2,101 on Olink versus 2,120 on SomaScan), the proportion of assays with such supporting evidence for assay performance was higher on the Olink platform (72% versus 43%). A considerable number of proteins had genomic associations that differed between the platforms. We provide examples where differences between platforms may influence conclusions drawn from the integration of protein levels with the study of diseases. We demonstrate how leveraging the diverse ancestries of participants in the UK Biobank helps to detect novel associations and refine genomic location. Our results show the value of the information provided by the two most commonly used high-throughput proteomics platforms and demonstrate the differences between them that at times provides useful complementarity.Peer reviewe

    Rare variants with large effects provide functional insights into the pathology of migraine subtypes, with and without aura

    Get PDF
    Migraine is a complex neurovascular disease with a range of severity and symptoms, yet mostly studied as one phenotype in genome-wide association studies (GWAS). Here we combine large GWAS datasets from six European populations to study the main migraine subtypes, migraine with aura (MA) and migraine without aura (MO). We identified four new MA-associated variants (in PRRT2, PALMD, ABO and LRRK2) and classified 13 MO-associated variants. Rare variants with large effects highlight three genes. A rare frameshift variant in brain-expressed PRRT2 confers large risk of MA and epilepsy, but not MO. A burden test of rare loss-of-function variants in SCN11A, encoding a neuron-expressed sodium channel with a key role in pain sensation, shows strong protection against migraine. Finally, a rare variant with cis-regulatory effects on KCNK5 confers large protection against migraine and brain aneurysms. Our findings offer new insights with therapeutic potential into the complex biology of migraine and its subtypes.</p

    The genetic architecture of age-related hearing impairment revealed by genome-wide association analysis.

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked DownloadAge-related hearing impairment (ARHI) is the most common sensory disorder in older adults. We conducted a genome-wide association meta-analysis of 121,934 ARHI cases and 591,699 controls from Iceland and the UK. We identified 21 novel sequence variants, of which 13 are rare, under either additive or recessive models. Of special interest are a missense variant in LOXHD1 (MAF = 1.96%) and a tandem duplication in FBF1 covering 4 exons (MAF = 0.22%) associating with ARHI (OR = 3.7 for homozygotes, P = 1.7 × 10-22 and OR = 4.2 for heterozygotes, P = 5.7 × 10-27, respectively). We constructed an ARHI genetic risk score (GRS) using common variants and showed that a common variant GRS can identify individuals at risk comparable to carriers of rare high penetrance variants. Furthermore, we found that ARHI and tinnitus share genetic causes. This study sheds a new light on the genetic architecture of ARHI, through several rare variants in both Mendelian deafness genes and genes not previously linked to hearing

    Genome-wide association identifies seven loci for pelvic organ prolapse in Iceland and the UK Biobank.

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked DownloadPelvic organ prolapse (POP) is a downward descent of one or more of the pelvic organs, resulting in a protrusion of the vaginal wall and/or uterus. We performed a genome-wide association study of POP using data from Iceland and the UK Biobank, a total of 15,010 cases with hospital-based diagnosis code and 340,734 female controls, and found eight sequence variants at seven loci associating with POP (P 5%) and one with minor allele frequency of 4.87%. Some of the variants associating with POP also associated with traits of similar pathophysiology. Of these, rs3820282, which may alter the estrogen-based regulation of WNT4, also associates with leiomyoma of uterus, gestational duration and endometriosis. Rs3791675 at EFEMP1, a gene involved in connective tissue homeostasis, also associates with hernias and carpal tunnel syndrome. Our results highlight the role of connective tissue metabolism and estrogen exposure in the etiology of POP.UCL Hospitals NIHR Biomedical Research Centr

    Sequence variant affects GCSAML splicing, mast cell specific proteins, and risk of urticaria

    Get PDF
    Funding Information: The authors thank the individuals who participated in this study and whose contributions made this work possible. We also thank our valued colleagues who contributed to the data collection and phenotypic characterization of clinical samples as well as to the genotyping and analysis of the whole-genome association data. This research has been conducted using the UK Biobank Resource under application numbers 24711 and 24898. Publisher Copyright: © 2023, The Author(s).Urticaria is a skin disorder characterized by outbreaks of raised pruritic wheals. In order to identify sequence variants associated with urticaria, we performed a meta-analysis of genome-wide association studies for urticaria with a total of 40,694 cases and 1,230,001 controls from Iceland, the UK, Finland, and Japan. We also performed transcriptome- and proteome-wide analyses in Iceland and the UK. We found nine sequence variants at nine loci associating with urticaria. The variants are at genes participating in type 2 immune responses and/or mast cell biology (CBLB, FCER1A, GCSAML, STAT6, TPSD1, ZFPM1), the innate immunity (C4), and NF-κB signaling. The most significant association was observed for the splice-donor variant rs56043070[A] (hg38: chr1:247556467) in GCSAML (MAF = 6.6%, OR = 1.24 (95%CI: 1.20–1.28), P-value = 3.6 × 10-44). We assessed the effects of the variants on transcripts, and levels of proteins relevant to urticaria pathophysiology. Our results emphasize the role of type 2 immune response and mast cell activation in the pathogenesis of urticaria. Our findings may point to an IgE-independent urticaria pathway that could help address unmet clinical need.Peer reviewe

    Molecular benchmarks of a SARS-CoV-2 epidemic.

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked DownloadA pressing concern in the SARS-CoV-2 epidemic and other viral outbreaks, is the extent to which the containment measures are halting the viral spread. A straightforward way to assess this is to tally the active cases and the recovered ones throughout the epidemic. Here, we show how epidemic control can be assessed with molecular information during a well characterized epidemic in Iceland. We demonstrate how the viral concentration decreased in those newly diagnosed as the epidemic transitioned from exponential growth phase to containment phase. The viral concentration in the cases identified in population screening decreased faster than in those symptomatic and considered at high risk and that were targeted by the healthcare system. The viral concentration persists in recovering individuals as we found that half of the cases are still positive after two weeks. We demonstrate that accumulation of mutations in SARS-CoV-2 genome can be exploited to track the rate of new viral generations throughout the different phases of the epidemic, where the accumulation of mutations decreases as the transmission rate decreases in the containment phase. Overall, the molecular signatures of SARS-CoV-2 infections contain valuable epidemiological information that can be used to assess the effectiveness of containment measures

    Rare variants with large effects provide functional insights into the pathology of migraine subtypes, with and without aura

    Get PDF
    Publisher Copyright: © 2023, The Author(s).Migraine is a complex neurovascular disease with a range of severity and symptoms, yet mostly studied as one phenotype in genome-wide association studies (GWAS). Here we combine large GWAS datasets from six European populations to study the main migraine subtypes, migraine with aura (MA) and migraine without aura (MO). We identified four new MA-associated variants (in PRRT2, PALMD, ABO and LRRK2) and classified 13 MO-associated variants. Rare variants with large effects highlight three genes. A rare frameshift variant in brain-expressed PRRT2 confers large risk of MA and epilepsy, but not MO. A burden test of rare loss-of-function variants in SCN11A, encoding a neuron-expressed sodium channel with a key role in pain sensation, shows strong protection against migraine. Finally, a rare variant with cis-regulatory effects on KCNK5 confers large protection against migraine and brain aneurysms. Our findings offer new insights with therapeutic potential into the complex biology of migraine and its subtypes.Peer reviewe
    corecore