Search CORE

23 research outputs found

Whole genome sequencing identifies structural variants contributing to hematologic traits in the NHLBI TOPMed program

Author: Almasy Laura
Beyter Doruk
Blangero John
Curran Joanne E.
Halldórsson Bjarni V.
Mihkaylova Anna V.
Rao Shuquan
Stilp Adrienne M.
Wen Jia
Wheeler Marsha M.
Publication venue: ScholarWorks @ UTRGV
Publication date: 08/12/2022
Field of study

Genome-wide association studies have identified thousands of single nucleotide variants and small indels that contribute to variation in hematologic traits. While structural variants are known to cause rare blood or hematopoietic disorders, the genome-wide contribution of structural variants to quantitative blood cell trait variation is unknown. Here we utilized whole genome sequencing data in ancestrally diverse participants of the NHLBI Trans Omics for Precision Medicine program (N = 50,675) to detect structural variants associated with hematologic traits. Using single variant tests, we assessed the association of common and rare structural variants with red cell-, white cell-, and platelet-related quantitative traits and observed 21 independent signals (12 common and 9 rare) reaching genome-wide significance. The majority of these associations (N = 18) replicated in independent datasets. In genome-editing experiments, we provide evidence that a deletion associated with lower monocyte counts leads to disruption of an S1PR3 monocyte enhancer and decreased S1PR3 expression

Scholarworks@UTRGV Univ. of Texas RioGrande Valley

GraphTyper2 enables population-scale genotyping of structural variation using pangenome graphs

Author: Beyter Doruk
Eggertsson Hannes
Gudbjartsson Daniel
Halldórsson Bjarni
Hardarson Marteinn
Jónsson Hákon
Kristmundsdóttir Snædís
Melsted Páll
Skúladóttir Ástrós
Stefansson Kari
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/11/2019
Field of study

Publisher's version (útgefin grein).Analysis of sequence diversity in the human genome is fundamental for genetic studies. Structural variants (SVs) are frequently omitted in sequence analysis studies, although each has a relatively large impact on the genome. Here, we present GraphTyper2, which uses pangenome graphs to genotype SVs and small variants using short-reads. Comparison to the syndip benchmark dataset shows that our SV genotyping is sensitive and variant segregation in families demonstrates the accuracy of our approach. We demonstrate that incorporating public assembly data into our pipeline greatly improves sensitivity, particularly for large insertions. We validate 6,812 SVs on average per genome using long-read data of 41 Icelanders. We show that GraphTyper2 can simultaneously genotype tens of thousands of whole-genomes by characterizing 60 million small variants and half a million SVs in 49,962 Icelanders, including 80 thousand SVs with high-confidence.We are grateful to our colleagues from deCODE genetics / Amgen Inc. for their contributions. We also wish to thank all research participants who provided a biological sample to deCODE genetics.Peer Reviewe

Opin visindi

PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes

Author: Beyter Doruk
Björnsson Eythór
Eggertsson Hannes P.
Halldórsson Bjarni V.
Jónsson Hákon
Kehr Birte
Niehus Sebastian
Schönberger Janina
Stefánsson Kári
Sulem Patrick
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Thousands of genomic structural variants (SVs) segregate in the human population and can impact phenotypic traits and diseases. Their identification in whole-genome sequence data of large cohorts is a major computational challenge. Most current approaches identify SVs in single genomes and afterwards merge the identified variants into a joint call set across many genomes. We describe the approach PopDel, which directly identifies deletions of about 500 to at least 10,000 bp in length in data of many genomes jointly, eliminating the need for subsequent variant merging. PopDel scales to tens of thousands of genomes as we demonstrate in evaluations on up to 49,962 genomes. We show that PopDel reliably reports common, rare and de novo deletions. On genomes with available high-confidence reference call sets PopDel shows excellent recall and precision. Genotype inheritance patterns in up to 6794 trios indicate that genotypes predicted by PopDel are more reliable than those of previous SV callers. Furthermore, PopDel’s running time is competitive with the fastest tested previous tools. The demonstrated scalability and accuracy of PopDel enables routine scans for deletions in large-scale sequencing studies

University of Regensburg Publication Server

Landspítali University Hospital Research Archive

Directory of Open Access Journals

Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality

Author: Alfredsson Lars
Andreassen Ole
Arnadottir Gudny A.
Atlason Bjarni A.
Beyter Doruk
Brodersen Thorsten
Brunak Søren
Bruun Mie Topholm
Didriksen Maria
Djurovic Srdjan
Erikstrup Christian
Ferkingstad Egil
Frei Oleksandr
Fridriksdottir Run
Gudjonsson Sigurjon A.
Haavik Jan
Halldorsson Gisli H.
Hansen Thomas Folkmann
Haraldsson Àsgeir
Havdahl Alexandra
Helgason Hannes
Hjalgrim Henrik
Jacobsen Rikke Louise
Jensson Brynjar O.
Jónsson Hákon
Karjalainen Juha
Katrinardottir Hildigunnur
Kockum Ingrid
Kristmundsdottir Snædis
Lie Rolv T.
Moore Kristjan H.S.
Nielsen Henriette Svarre
Nielsen Kaspar Rene
Nyegaard Mette
Oddsson Asmundur
Olafsdottir Thorunn A
Olsson Tomas
Oskarsson Gudjon R.
Portilla A.L.
Rantapää Dahlqvist Solbritt
Selbæk Geir
Sonderby Ida Elken
Stefansson Olafur A.
Steingrimsdottir Thora
Steinthorsdottir Valgerdur
Stridh Pernilla
Sulem Patrick
Sveinbjornsson Gardar
Sævarsdottir Sædis
Thordardottir Helga B.
Tragante Vinicius
Tryggvadottir Laufey
Walters G. Bragi
Westergaard David
Publication venue: Springer Nature
Publication date: 01/01/2023
Field of study

Genotypes causing pregnancy loss and perinatal mortality are depleted among living individuals and are therefore difficult to find. To explore genetic causes of recessive lethality, we searched for sequence variants with deficit of homozygosity among 1.52 million individuals from six European populations. In this study, we identified 25 genes harboring protein-altering sequence variants with a strong deficit of homozygosity (10% or less of predicted homozygotes). Sequence variants in 12 of the genes cause Mendelian disease under a recessive mode of inheritance, two under a dominant mode, but variants in the remaining 11 have not been reported to cause disease. Sequence variants with a strong deficit of homozygosity are over-represented among genes essential for growth of human cell lines and genes orthologous to mouse genes known to affect viability. The function of these genes gives insight into the genetics of intrauterine lethality. We also identified 1077 genes with homozygous predicted loss-of-function genotypes not previously described, bringing the total set of genes completely knocked out in humans to 4785.publishedVersio

University of Bergen

The sequences of 150,119 genomes in the UK Biobank

Author: Asgeirsdottir Margret
Beyter Doruk
Brunak Søren
Eggertsson Hannes P
Eiriksson Ogmundur
Erikstrup Christian
Geirsson Arni J
Gudbjartsson Daniel F
Gudjonsson Sigurjon A
Gylfason Arnaldur
Halldorsson Bjarni V
Halldorsson Gisli H
Hardarson Marteinn T
Hauswedell Hannes
Helgason Agnar
Holley Guillaume
Holm Hilma
Jensson Brynjar O
Jonsdottir Ingileif
Jonsson Frosti
Jonsson Hakon
Jonsson Helgi
Jonsson Palmi
Kristinsson Kari
Kristmundsdottir Snaedis
Magnusdottir Droplaug N
Magnusson Olafur T
Masson Gisli
Melsted Pall
Moore Kristjan H S
Nielsen Kaspar René
Norland Kristjan
Oddsson Asmundur
Olafsson Isleifur
Olason Pall I
Ostrowski Sisse Rye
Palsson Gunnar
Pedersen Ole Birger
Rafnar Thorunn
Saemundsdottir Jona
Sigurdsson Brynjar
Sigurdsson Gunnar T
Sigurpalsdottir Brynja D
Snorradottir Steinunn
Sobech Emilia
Stefansson Hreinn
Stefansson Kari
Stefansson Olafur A
Styrkarsdottir Unnur
Sulem Patrick
Sveinbjornsson Gardar
Sverrisson Sverrir T
Thorleifsson Gudmar
Thorsteinsdottir Unnur
Tragante Vinicius
Ulfarsson Magnus O
Zink Florian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation

Copenhagen University Research Information System

PubMed Central

VBN

Rare variants with large effects provide functional insights into the pathology of migraine subtypes, with and without aura

Author: Andreassen Ole A
Andresdottir Margret
Banasik Karina
Beyter Doruk
Bjornsdottir Anna
Bjornsdottir Gyda
Brunak Søren
Chalmer Mona A
Didriksen Maria
Einarsson Gudmundur
Erikstrup Christian
Ferkingstad Egil
Gretarsdottir Solveig
Gudbjartsson Daniel F.
Haavik Jan
Halldorsson Bjarni V
Halldorsson Gisli H
Hansen Thomas F
Helgadottir Anna
Helgason Hannes
Hjorleifsson Eldjarn Grimur
Holm Hilma
Igland Jannicke
Jonasdottir Adalbjorg
Jonasdottir Aslaug
Jonsdottir Ingileif
Knowlton Kirk U
Kogelman Lisette J A
Lie Rolv T
Ludvigsson Petur
Lund Sigrun H
Magnusson Olafur Th
Masson Gisli
Melsted Pall
Moore Kristjan H S
Nadauld Lincoln D
Nielsen Kaspar R
Nyegaard Mette
Oddsson Asmundur
Olason Pall I.
Olesen Jes
Ostrowski Sisse R
Pedersen Ole B.
Rohde Palle Duun
Rødevand Linn
Saemundsdottir Jona
Sigurdardottir Gudrun R
Sigurdsson Asgeir
Skuladottir Astros Th
Stefansdottir Lilja
Stefansson Hreinn
Stefansson Kari
Stefansson Olafur A
Sulem Patrick
Sveinbjornsson Gardar
Sveinsson Olafur A
Sørensen Erik
T Bruun Mie
Thorarensen Olafur
Thorgeirsson Thorgeir E
Thorleifsson Gudmar
Thorsteinsdottir Unnur
Tragante Vinicius
Ullum Henrik
Unnsteinsdottir Unnur
Walters G Bragi
Zink Florian
Publication venue
Publication date: 01/11/2023
Field of study

Migraine is a complex neurovascular disease with a range of severity and symptoms, yet mostly studied as one phenotype in genome-wide association studies (GWAS). Here we combine large GWAS datasets from six European populations to study the main migraine subtypes, migraine with aura (MA) and migraine without aura (MO). We identified four new MA-associated variants (in PRRT2, PALMD, ABO and LRRK2) and classified 13 MO-associated variants. Rare variants with large effects highlight three genes. A rare frameshift variant in brain-expressed PRRT2 confers large risk of MA and epilepsy, but not MO. A burden test of rare loss-of-function variants in SCN11A, encoding a neuron-expressed sodium channel with a key role in pain sensation, shows strong protection against migraine. Finally, a rare variant with cis-regulatory effects on KCNK5 confers large protection against migraine and brain aneurysms. Our findings offer new insights with therapeutic potential into the complex biology of migraine and its subtypes.</p

VBN

Sequence variant affects GCSAML splicing, mast cell specific proteins, and risk of urticaria

Funding Information: The authors thank the individuals who participated in this study and whose contributions made this work possible. We also thank our valued colleagues who contributed to the data collection and phenotypic characterization of clinical samples as well as to the genotyping and analysis of the whole-genome association data. This research has been conducted using the UK Biobank Resource under application numbers 24711 and 24898. Publisher Copyright: © 2023, The Author(s).Urticaria is a skin disorder characterized by outbreaks of raised pruritic wheals. In order to identify sequence variants associated with urticaria, we performed a meta-analysis of genome-wide association studies for urticaria with a total of 40,694 cases and 1,230,001 controls from Iceland, the UK, Finland, and Japan. We also performed transcriptome- and proteome-wide analyses in Iceland and the UK. We found nine sequence variants at nine loci associating with urticaria. The variants are at genes participating in type 2 immune responses and/or mast cell biology (CBLB, FCER1A, GCSAML, STAT6, TPSD1, ZFPM1), the innate immunity (C4), and NF-κB signaling. The most significant association was observed for the splice-donor variant rs56043070[A] (hg38: chr1:247556467) in GCSAML (MAF = 6.6%, OR = 1.24 (95%CI: 1.20–1.28), P-value = 3.6 × 10-44). We assessed the effects of the variants on transcripts, and levels of proteins relevant to urticaria pathophysiology. Our results emphasize the role of type 2 immune response and mast cell activation in the pathogenesis of urticaria. Our findings may point to an IgE-independent urticaria pathway that could help address unmet clinical need.Peer reviewe

Opin visindi

Rare variants with large effects provide functional insights into the pathology of migraine subtypes, with and without aura

Author: Andreassen Ole A.
Andrésdóttir Margrét
Banasik Karina
Bay Jakob
Beyter Doruk
Bjornsdottir Anna
Bjornsdottir Gyda
Boldsen Jens K.
Brodersen Thorsten
Brunak Søren
Bruun Mie T.
Burgdorf Kristoffer
Chalmer Mona A.
Didriksen Maria
Dinh Khoa M.
Dowsett Joseph
Einarsson Gudmundur
Erikstrup Christian
Feenstra Bjarke
Ferkingstad Egil
Geller Frank
Gretarsdottir Solveig
Gudbjartsson Daniel F.
Haavik Jan
Halldórsson Bjarni Vilhjálmur
Halldórsson Gísli Hreinn
Hansen Thomas F.
Helgadottir Anna
Helgason Hannes
Henriksen Alexander P.
Hindhede Lotte
Hjalgrim Henrik
Hjorleifsson Eldjarn Grimur
Holm Hilma
Igland Jannicke
Jacobsen Rikke L.
Jemec Gregor
Jonasdottir Adalbjorg
Jonasdottir Aslaug
Jónsdóttir Ingileif
Kaspersen Katrine
Kjerulf Bertram D.
Knowlton Kirk U.
Kogelman Lisette J.A.
Larsen Margit A.H.
Lie Rolv T.
Louloudis Ioannis
Ludvigsson Petur
Lund Sigrún Helga
Lundgaard Agnete
Magnusson Olafur Th
Masson Gisli
Melsted Páll
Mikkelsen Christina
Mikkelsen Susan
Moore Kristjan H.S.
Nadauld Lincoln D.
Nielsen Kaspar R.
Nissen Ioanna
Nyegaard Mette
Oddsson Asmundur
Olason Pall I.
Olesen Jes
Ostrowski Sisse R.
Pedersen Ole B.
Rohde Palle D.
Rostgaard Klaus
Rødevand Linn
Saemundsdottir Jona
Sigurdardottir Gudrun R.
Sigurdsson Asgeir
Skuladottir Astros Th
Stefansson Hreinn
Stefansson Olafur A.
Stefánsdóttir Lilja
Stefánsson Kári
Sulem Patrick
Sveinbjornsson Gardar
Sveinsson Ólafur Árni
Swinn Michael
Sørensen Erik
T. Bruun Mie
Thorarensen Ólafur
Thorgeirsson Thorgeir E.
Thorleifsson Gudmar
Thørner Lise W.
Tragante Vinicius
Ullum Henrik
Unnsteinsdottir Unnur
Walters Guðmundur Bragi
Werge Thomas
Westergaard David
Zink Florian
Þorsteinsdóttir Unnur
Publication venue
Publication date: 01/11/2023
Field of study

Publisher Copyright: © 2023, The Author(s).Migraine is a complex neurovascular disease with a range of severity and symptoms, yet mostly studied as one phenotype in genome-wide association studies (GWAS). Here we combine large GWAS datasets from six European populations to study the main migraine subtypes, migraine with aura (MA) and migraine without aura (MO). We identified four new MA-associated variants (in PRRT2, PALMD, ABO and LRRK2) and classified 13 MO-associated variants. Rare variants with large effects highlight three genes. A rare frameshift variant in brain-expressed PRRT2 confers large risk of MA and epilepsy, but not MO. A burden test of rare loss-of-function variants in SCN11A, encoding a neuron-expressed sodium channel with a key role in pain sensation, shows strong protection against migraine. Finally, a rare variant with cis-regulatory effects on KCNK5 confers large protection against migraine and brain aneurysms. Our findings offer new insights with therapeutic potential into the complex biology of migraine and its subtypes.Peer reviewe

Opin visindi

From Microbial Communities to Human Cancer: Methods for Exploring Diversity Across Varying Levels of Biological Organization

Author: Beyter Doruk
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

Biological diversity can be defined as the total variation of life across levels of biological organization from genes/cells to communities/ecosystems. Exploiting the observed diversity can be of vital interest for environmental, or clinical applications as it may translate into improved responses in community management or patient treatment. Advancements in biological data acquisition technologies such as next-generation sequencing, tandem mass spectrometry or cell imaging enabled scientists explore diversity in complex samples. The high volume of data, however, created the need for efficient and sensitive computational techniques, to perform useful analyses. In this dissertation, I present three studies, where we explore the presence and the level of biological diversity together with the computational tools and analyses developed for three different data modalities.First, I describe our computational analysis of the bacterial small subunit rRNA (16S) and the eukaryotic internal transcribed spacer 2 (ITS2) sequencing data of industrial scale open algae ponds, where we explored the associations of community composition and ecosystem variables, over a year. We found that periods of high eukaryotic diversity were associated with high and more stable biomass productivity.Second, I present ProteoStorm, our computational workflow on performing efficient and sensitive peptide identifications of metaproteomics samples on massive microbial protein databases. Our approach focuses on efficiently reducing the set of candidate peptides for each spectrum, thus obtaining 100 to 1000-fold speedup at the expense of minimal sensitivity. Our re-analysis of urinary tract infection datasets using a comprehensive database, identified bacteria genera previously unknown to be associated with said samples.Last, I present our study on the landscape of extrachromosomal DNA (ecDNA) in human cancer, where we employed whole-genome sequencing, structural modelling and cytogenetic analyses of 17 different cancer types, including metaphase of 2,572 dividing cells. I focus on the exploration of the presence and diversity of ecDNA in tumor cells, which we conducted using ECdetect, an image anaysis software I developed. We discovered that ecDNA was found in nearly half of human cancers, and was almost never found in normal cells. Using ECdetect, we were also able to provide estimations on the ecDNA count diversity in tumor cell lines

Ezid

eScholarship - University of California