4 research outputs found
The sequences of 150,119 genomes in the UK Biobank
Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation
A rare splice donor mutation in the haptoglobin gene associates with blood lipid levels and coronary artery disease.
Contains fulltext :
174137.pdf (publisher's version ) (Closed access
Recommended from our members
Iceland screens, treats, or prevents multiple myeloma (iStopMM): a population-based screening study for monoclonal gammopathy of undetermined significance and randomized controlled trial of follow-up strategies
Monoclonal gammopathy of undetermined significance (MGUS) precedes multiple myeloma (MM). Population-based screening for MGUS could identify candidates for early treatment in MM. Here we describe the Iceland Screens, Treats, or Prevents Multiple Myeloma study (iStopMM), the first population-based screening study for MGUS including a randomized trial of follow-up strategies. Icelandic residents born before 1976 were offered participation. Blood samples are collected alongside blood sampling in the Icelandic healthcare system. Participants with MGUS are randomized to three study arms. Arm 1 is not contacted, arm 2 follows current guidelines, and arm 3 follows a more intensive strategy. Participants who progress are offered early treatment. Samples are collected longitudinally from arms 2 and 3 for the study biobank. All participants repeatedly answer questionnaires on various exposures and outcomes including quality of life and psychiatric health. National registries on health are cross-linked to all participants. Of the 148,704 individuals in the target population, 80 759 (54.3%) provided informed consent for participation. With a very high participation rate, the data from the iStopMM study will answer important questions on MGUS, including potentials harms and benefits of screening. The study can lead to a paradigm shift in MM therapy towards screening and early therapy
Sequence variants at the TERT-CLPTM1L locus associate with many cancer types
The common sequence variants that have recently been associated with cancer risk are particular to a single cancer type or at most two. Following up on our genome-wide scan of basal cell carcinoma, we found that rs401681[C] on chromosome 5p15.33 satisfied our threshold for genome-wide significance (OR = 1.25, P = 3.7 x 10(-12)). We tested rs401681 for association with 16 additional cancer types in over 30,000 cancer cases and 45,000 controls and found association with lung cancer (OR = 1.15, P = 7.2 x 10(-8)) and urinary bladder, prostate and cervix cancer (ORs = 1.07-1.31, all P < 4 x 10(-4)). However, rs401681[C] seems to confer protection against cutaneous melanoma (OR = 0.88, P = 8.0 x 10(-4)). Notably, most of these cancer types have a strong environmental component to their risk. Investigation of the region led us to rs2736098[A], which showed stronger association with some cancer types. However, neither variant could fully account for the association of the other. rs2736098 corresponds to A305A in the telomerase reverse transcriptase (TERT) protein and rs401681 is in an intron of the CLPTM1L gene