Search CORE

42 research outputs found

Gene Expression in Chicken Reveals Correlation with Structural Genomic Features and Conserved Patterns of Transcription in the Terrestrial Vertebrates

Author: Aart Lammers
AE Vinogradov
AJ Hulbert
AM Boutanaev
BY Liao
CI Castillo-Davis
Darren P. Martin
DK Kim
DK Kim
E Eisenberg
ET Chan
Evert M. van Schothorst
GK Smyth
H Caron
H Nie
Haisheng Nie
Hendrik-Jan Megens
Jaap Keijer
Jack A. M. Leunissen
M Kimura
Martien A. M. Groenen
P Khaitovich
PB Neerincx
Pieter B. T. Neerincx
RC Gentleman
Richard P. M. A. Crooijmans
RW Morgan
S Durinck
S Falcon
S van Hemert
S van Hemert
T Mijalski
W Huber
W Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background - The chicken is an important agricultural and avian-model species. A survey of gene expression in a range of different tissues will provide a benchmark for understanding expression levels under normal physiological conditions in birds. With expression data for birds being very scant, this benchmark is of particular interest for comparative expression analysis among various terrestrial vertebrates. Methodology/Principal Findings - We carried out a gene expression survey in eight major chicken tissues using whole genome microarrays. A global picture of gene expression is presented for the eight tissues, and tissue specific as well as common gene expression were identified. A Gene Ontology (GO) term enrichment analysis showed that tissue-specific genes are enriched with GO terms reflecting the physiological functions of the specific tissue, and housekeeping genes are enriched with GO terms related to essential biological functions. Comparisons of structural genomic features between tissue-specific genes and housekeeping genes show that housekeeping genes are more compact. Specifically, coding sequence and particularly introns are shorter than genes that display more variation in expression between tissues, and in addition intergenic space was also shorter. Meanwhile, housekeeping genes are more likely to co-localize with other abundantly or highly expressed genes on the same chromosomal regions. Furthermore, comparisons of gene expression in a panel of five common tissues between birds, mammals and amphibians showed that the expression patterns across tissues are highly similar for orthologuous genes compared to random gene pairs within each pair-wise comparison, indicating a high degree of functional conservation in gene expression among terrestrial vertebrates. Conclusions - The housekeeping genes identified in this study have shorter gene length, shorter coding sequence length, shorter introns, and shorter intergenic regions, there seems to be selection pressure on economy in genes with a wide tissue distribution, i.e. these genes are more compact. A comparative analysis showed that the expression patterns of orthologous genes are conserved in the terrestrial vertebrates during evolutio

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Wageningen University & Research Publications

Effect of host genetics on the gut microbiome in 7,738 participants of the Dutch Microbiome Project

Author: Andreu-Sánchez Sergio
Bolte Laura A
Brandao Gois Milla F
Chen Lianmin
Collij Valerie
Fu Jingyuan
Gacesa Ranko
Harmsen Hermie J M
Hu Shixian
Klaassen Marjolein A Y
Kurilshikov Alexander
Lopera-Maya Esteban A
Neerincx Pieter B T
Sanna Serena
Sinha Trishla
Swertz Morris A
van der Graaf Adriaan
Vila Arnau Vich
Weersma Rinse K
Wijmenga Cisca
Zhernakova Alexandra
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2022
Field of study

Host genetics are known to influence the gut microbiome, yet their role remains poorly understood. To robustly characterize these effects, we performed a genome-wide association study of 207 taxa and 205 pathways representing microbial composition and function in 7,738 participants of the Dutch Microbiome Project. Two robust, study-wide significant (P < 1.89 × 10-10) signals near the LCT and ABO genes were found to be associated with multiple microbial taxa and pathways and were replicated in two independent cohorts. The LCT locus associations seemed modulated by lactose intake, whereas those at ABO could be explained by participant secretor status determined by their FUT2 genotype. Twenty-two other loci showed suggestive evidence (P < 5 × 10-8) of association with microbial taxa and pathways. At a more lenient threshold, the number of loci we identified strongly correlated with trait heritability, suggesting that much larger sample sizes are needed to elucidate the remaining effects of host genetics on the gut microbiome

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

The Genome of the Netherlands:design, and project goals

Author: Abdellaoui Abdel
Beekman Marian
Boomsma Dorret I.
Byelas Heorhiy
Cao Hongzhi
Cao Sujie
Chen Ruoyan
de Bakker Paul I. W.
de Craen Anton J. M.
de Knijff Peter
Deelen Patrick
den Dunnen Johan T.
Dijkstra Martijn
Du Yuanping
Elbers Clara C.
Estrada Karol
Francioli Laurent C.
Guryev Victor
Hehir-Kwa Jayne Y.
Hofman Albert
Hottenga Jouke Jan
Houwing-Duistermaat Jeanine
Kanterakis Alexandros
Karssen Lennart C.
Kattenberg Mathijs
Koval Vyacheslav
Laros Jeroen F. J.
Li Ning
Li Qibin
Li Yingrui
Mai Hailiang
Menelaou Androniki
Neerincx Pieter B. T.
Oostra Ben
Pulit Sara L.
Rivadeneira Fernanodo
Slagboom Eline P.
Suchiman Eka H. D.
Swertz Morris A.
Uitterlinden Andre G.
van Dijk Freerk
van Duijn Cornelia M.
van Enckevort David
van Leeuwen Elisabeth M.
van Ommen Gert-Jan
van Setten Jessica
Vermaat Martijn
Wang Jun
Wijmenga Cisca
Willemsen Gonneke
Wolffenbuttel Bruce H.
Ye Kai
Publication venue
Publication date: 29/05/2013
Field of study

Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean = 53 years; SD = 16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project.</p

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

PubMed Central

Copenhagen University Research Information System

Dissertations of the University of Groningen

Recommended from our members

Improved imputation quality of low-frequency and rare variants in European samples using the ‘Genome of The Netherlands'

Author: Abdellaoui Abdel
Amin Najaf
Beekman Marian B
Boomsma Dorret I
Bot Jan
Bovenberg Jasper A
Byelas Heorhiy
Cao Hongzhi
Cao Sujie
Chen Ruoyan
committee Steering
Cox David R
de Bakker Paul I W
de Craen Anton J M
de Knijff Peter
Deelen Patrick
den Dunnen Johan T
Dijkstra Martijn
Du Yuanping
Elbers Clara C
Estrada Karol
Francesco Palamara Pier
Francioli Laurent C
Franke Lude
Guryev Victor
Gutierrez-Achury Javier
Handsaker Robert E
Hehir-Kwa Jayne Y
Hofman Albert
Hormozdiari Fereydoun
Hottenga Jouke Jan
Jan Hottenga Jouke
Kanterakis Alexandros
Karssen Lennart C
Kattenberg Mathijs
Kayser Manfred
Kloosterman Wigard P
Koval Vyacheslav
Kreiner-Møller Eskil
Lameijer Eric-Wubbo
Laros Jeroen F J
Li Mingkun
Li Ning
Li Qibin
Li Yingrui
Marschall Schönhuth
Marschall Tobias
Medina-Gomez Carolina
Mei Hailiang
Menelaou Androniki
Moed Matthijs H
Neerincx Pieter B T
Nijman Isaäuc J
Pe'er Itsik
Pitts Steven J
Platteel Mathieu
Potluri Shobha
Pulit Sara L
Rivadeneira Fernando
Slagboom P Eline
Sohail Mashaal
Stoneking Mark
Suchiman H Eka D
Sunyaev Shamil R
Swertz Morris A
van den Berg Leonard H
van der Velde K Joeri
van Dijk Freerk
van Duijn Cornelia
van Duijn Cornelia M
van Enckevort David
van Leeuwen Elisabeth M
van Ommen Gertjan B
van Oven Mannis
van Schaik Barbera D C
van Setten Jessica
Veldink Jan H
Vermaat Martijn
Wang Jun
Westra Harm-Jan
Wijmenga Cisca
Willemsen Gonneke
Wolffenbuttel Bruce H
Ye Kai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/12/2014
Field of study

Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with ‘true' genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05–0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r2, increased from 0.61 to 0.71. We also saw improved imputation accuracy for other European populations (in the British samples, r2 improved from 0.58 to 0.65, and in the Italians from 0.43 to 0.47). A combined reference set comprising 1000G and GoNL improved the imputation of rare variants even further. The Italian samples benefitted the most from this combined reference (the mean r2 increased from 0.47 to 0.50). We conclude that the creation of a large population-specific reference is advantageous for imputing rare variants and that a combined reference panel across multiple populations yields the best imputation results

Harvard University - DASH

The Genome of the Netherlands: Design, and project goals

Author: Abdellaoui A. (Abdel)
Bakker P.I.W. (Paul) de
Beekman M. (Marian)
Boomsma D.I. (Dorret)
Byelas H. (Heorhiy)
Cao H. (Hongzhi)
Cao S. (Sherry)
Chen R. (Ruoyan)
Craen A.J. (Anton) de
Deelen P. (Patrick)
Dijk F. (Freerk) van
Dijkstra M. (Martijn)
Du Y. (Yangchun)
Duijn C.M. (Cornelia) van
Dunnen J.T. (Johan) den
Elbers C.C. (Clara)
Enckevort D. (David) van
Estrada Gil K. (Karol)
Francioli L.C. (Laurent)
Guryev V. (Victor)
Hehir-Kwa J. (Jayne)
Hofman A. (Albert)
Hottenga J.J. (Jouke Jan)
Houwing-Duistermaat J.J. (Jeanine)
Kanterakis A. (Alexandros)
Karssen L.C. (Lennart)
Kattenberg V.M. (Mathijs)
Knijff P. (Peter) de
Koval V. (Vyacheslav)
Laros J.F.J. (Jeroen F.)
Leeuwen E.M. (Elisa) van
Li N. (Ning)
Li Q. (Qibin)
Li Y. (Yingrui)
Mai H. (Hailiang)
Menelaou A. (Androniki)
Neerincx P.B.T. (Pieter B T)
Ommen G.J. (Gert) van
Oostra B.A. (Ben)
Pulit S.L. (Sara)
Rivadeneira Ramirez F. (Fernando)
Setten J. (Jessica) van
Slagboom P.E. (Eline)
Suchiman H.E.D. (Eka)
Swertz M. (Morris)
Uitterlinden A.G. (André)
Vermaat J.S. (Joost)
Wang J. (Jinxia)
Wijmenga C. (Cisca)
Willemsen G.A.H.M. (Gonneke)
Wolffenbuttel B.H.R. (Bruce)
Ye K. (Kai)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2014
Field of study

Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project

Erasmus University Digital Repository

A high-quality human reference panel reveals the complexity and distribution of genomic structural variants

Author: Abdellaoui A. (Abdel)
Amin N. (Najaf)
Baaijens J.A. (Jasmijn)
Bakker P.I.W. (Paul) de
Beekman M. (Marian)
Boomsma D.I. (Dorret)
Bot J. (Jan)
Bovenberg J.A. (Jasper)
Byelas G. (George)
Cao H. (Hongzhi)
Cao J.S. (Jeremy Sujie)
Cao R. (Rui)
Chen R. (Ruoyan)
Coe B.P. (Bradley)
Craen A.J.M. (Anton) de
Deelen P. (Patrick)
Dijk F. (Freerk) van
Dijkstra L.J. (Louis)
Dijkstra M. (Martijn)
Du Y. (Yuanping)
Duijn C.M. (Cornelia) van
Dunnen J.T. (Johan) den
Eichler E.E. (Evan)
Enckevort D. (David) van
Estrada K. (Karol)
Francioli L.C. (Laurent)
Guryev V. (Victor)
Handsaker R.E. (Robert)
Hehir-Kwa J.Y. (Jayne)
Hofman A. (Albert)
Hormozdiari F. (Fereydoun)
Hottenga J.-J. (Jouke-Jan)
Kanterakis A. (Alexandros)
Karssen L.C. (Lennart)
Kattenberg V.M. (Mathijs)
Kloosterman W.P. (Wigard)
Knijff P. (Peter) de
Ko A. (Arthur)
Koval V. (Vyacheslav)
Lameijer E.-W. (Eric-Wubbo)
Laros J.F.J. (Jeroen)
Ligt J. (Joep) de
Marschall T. (Tobias)
McCarroll S.A. (Steven)
Mei H. (Hailiang)
Neerincx P.B.T. (Pieter)
Nijman I.J. (Isaac)
Ommen G.-J.B. (Gert-Jan) van
Platteel M. (Mathieu)
Renkens I. (Ivo)
Rivadeneira F. (Fernando)
Santcroos M. (Mark)
Schaik B.D.C. (Barbera) van
Schönhuth A. (Alexander)
Slagboom P.E. (Eline)
Sudmant P. (Peter)
Sun Y. (Yushen)
Swertz M.A. (Morris)
Thung (), D.T. (Djie Tjwan)
Uitterlinden A.G. (André)
van Leeuwen E.M. (Elisa)
Vermaat M. (Martijn)
Wardenaar R. (René)
Wijmenga C. (Cisca)
Willemsen G. (Gonneke)
Wolffenbuttel B. (Bruce)
Ye K. (Kai)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/10/2016
Field of study

Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals

CWI's Institutional Repository

WGS-based telomere length analysis in Dutch family trios implicates stronger maternal inheritance and a role for RRM1 gene

Author: Abdellaoui A. (Abdel)
Amin N. (Najaf)
Arakelyan A. (Arsen)
Beekman M. (Marian)
Boomsma D.I. (Dorret)
Bot J. (Jan)
Bovenberg J.A. (Jasper)
Byelas H. (Heorhiy)
Cao H. (Hongzhi)
Cao S. (Sujie)
Chen R. (Ruoyan)
Cox D.R. (David R.)
Craen A.J.M. (Anton) de
de Bakker P.I.W. (Paul I. W.)
Deelen P. (Patrick)
Dijk F. (Freerk) van
Dijkstra M. (Martijn)
Du Y. (Yuanping)
Duijn C.M. (Cornelia) van
Dunnen J.T. (Johan) den
Elbers C.C. (Clara C.)
Enckevort D. (David) van
Estrada K. (Karol)
Francioli L.C. (Laurent)
Guryev V. (Victor)
Handsaker R.E. (Robert)
Hehir-Kwa J.Y. (Jayne)
Hofman A. (Albert)
Hormozdiari F. (Fereydoun)
Isaacs A. (Aaron)
Jan Hottenga J. (Jouke)
Kanterakis A. (Alexandros)
Karssen L.C. (Lennart)
Kattenberg M. (Mathijs)
Kayser M. (Manfred)
Kloosterman W.P. (Wigard)
Knijff P. (Peter) de
Koval V. (Vyacheslav)
Lameijer E.-W. (Eric-Wubbo)
Laros J.F.J. (Jeroen)
Li M. (Mingkun)
Li N. (Ning)
Li Q. (Qibin)
Li Y. (Yingrui)
Marschall T. (Tobias)
McCarroll S.A. (Steven A.)
Medina-Gomez C. (Carolina)
Mei H. (Hailiang)
Menelaou A. (Androniki)
Moed M.H. (Matthijs H.)
Neerincx P.B.T. (Pieter)
Nersisyan L. (Lilit)
Nijman I.J. (Isaac)
Nikoghosyan M. (Maria)
Ommen G.-J.B. (Gert-Jan) van
Oostra B. (Ben)
Palamara P.F. (Pier Francesco)
Pe’er I. (Itsik)
Pitts S.J. (Steven J.)
Platteel M. (Mathieu)
Polak P. (Paz)
Potluri S. (Shobha)
Pulit S.L. (Sara L.)
Renkens I. (Ivo)
Rivadeneira F. (Fernando)
Schaik B.D.C. (Barbera) van
Schönhuth A. (Alexander)
Slagboom P.E. (Eline)
Sohail M. (Mashaal)
Stoneking M. (Mark)
Suchiman H.E.D. (H. Eka D.)
Sundar P. (Purnima)
Sunyaev S.R. (Shamil R.)
Swertz M.A. (Morris A.)
The Genome of the Netherlands Consortium
Uitterlinden A.G. (André)
van den Berg L.H. (Leonard H.)
van der Velde K.J. (K. Joeri)
van Leeuwen E.M. (Elisabeth M.)
van Oven M. (Mannis)
van Setten J. (Jessica)
Veldink J. (Jan)
Vermaat M. (Martijn)
Vuzman D. (Dana)
Wang J. (Jun)
Wijmenga C. (Cisca)
Willemsen G. (Gonneke)
Ye K. (Kai)
Ye K. (Kai)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/12/2019
Field of study

Telomere length (TL) regulation is an important factor in ageing, reproduction and cancer development. Genetic, hereditary and environmental factors regulating TL are currently widely investigated, however, their relative contribution to TL variability is still understudied. We have used whole genome sequencing data of 250 family trios from the Genome of the Netherlands project to perform computational measurement of TL and a series of regression and genome-wide association analyses to reveal TL inheritance patterns and associated genetic factors. Our results confirm that TL is a largely heritable trait, primarily with mother’s, and, to a lesser extent, with father’s TL having the strongest influence on the offspring. In this cohort, mother’s, but not father’s age at conception was positively linked to offspring TL. Age-related TL attrition of 40 bp/year had relatively small influence on TL variability. Finally, we have identified TL-associated variations in ribonuclease reductase catalytic subunit M1 (RRM1 gene), which is known to regulate telomere maintenance in yeast. We also highlight the importance of multivariate approach and the limitations of existing tools for the analysis of TL as a polygenic heritable quantitative trait

CWI's Institutional Repository

Erasmus University Digital Repository

Computational pan-genomics: Status, promises and challenges

Author: Abeel T. (Thomas)
Alkan C. (Can)
Baaijens J.A. (Jasmijn)
Bakker P.I.W. (Paul) de
Boeva V. (Valentina)
Bonnal R.J.P. (Raoul)
Chiaromonte F. (Francesca)
Chikhi R. (Rayan)
Ciccarelli F.D. (Francesca)
Cijvat C.P. (Robin)
Datema E. (Erwin)
Dijkstra L.J. (Louis)
Duijn C.M. (Cornelia) van
Dutilh B.E. (Bas)
Eichler E.E. (Evan)
El-Kebir M. (Mohammed)
Ernst C. (Corinna)
Eskin E. (Eleazar)
Garrison E. (Erik)
Ghaffaari A. (Ali)
Guryev V. (Victor)
Kersey P. (Paul)
Klau G.W. (Gunnar)
Kloosterman W.P. (Wigard)
Korbel J.O. (Jan)
Lameijer E.-W. (Eric-Wubbo)
Langmead B. (Benjamin)
Marschall T. (Tobias)
Martin M. (Marcel)
Marz M. (Manja)
Medvedev P. (Paul)
Mu J.C. (John)
Mäkinen V. (Veli)
Neerincx P.B.T. (Pieter)
Novak A.M. (Adam)
Ouwens K. (Klaasjan)
Paten B. (Benedict)
Peterlongo P. (Pierre)
Pisanti N. (Nadia)
Porubsky D. (David)
Rahmann S. (Sven)
Raphael B.J. (Benjamin)
Reinert K. (Knut)
Ridder D. (Dick) de
Ridder J. (Jeroen) de
Rivals E. (Eric)
Sanders A.D. (Ashley)
Schlesner M. (Matthias)
Schulz-Trieglaff O. (Ole)
Schönhuth A. (Alexander)
Sheikhizadeh S. (Siavash)
Shneider C. (Carl)
Smit S. (Sandra)
The Computational Pan-Genomics Consortium
Valenzuela D. (Daniel)
Vandin F. (Fabio)
Wang J. (Jiayin)
Wessels L.F.A. (Lodewyk)
Ye K. (Kai)
Zhang Y. (Ying)
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different Computational methods and paradigms are needed.We will witness the rapid extension of Computational pan-genomics, a new sub-area of research in Computational biology. In this article, we generalize existing definitions and understand a pangenome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a Computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations

CWI's Institutional Repository

Erasmus University Digital Repository

Recommended from our members

A framework for the detection of de novo mutations in family-based sequencing data

Author: Abdellaoui Abdel
Amin Najaf
Banks Eric
Beekman Marian B
Boomsma Dorret I
Bot Jan
Bovenberg Jasper A
Brandsma Margreet
Byelas Heorhiy
Cao Hongzhi
Cao Sujie
Chen Ruoyan
Cox David R
Cretu-Stancu Mircea
Daly Mark J
de Bakker Paul IW
de Craen Anton JM
de Knijff Peter
Deelen Patrick
den Dunnen Johan T
DePristo Mark A
Dijkstra Martijn
Du Yuanping
Elbers Clara C
Estrada Karol
Francesco Palamara Pier
Francioli Laurent C
Fromer Menachem
Garimella Kiran V
Guryev Victor
Handsaker Robert E
Hehir-Kwa Jayne Y
Hofman Albert
Hormozdiari Fereydoun
Hottenga Jouke Jan
Investigator Principal
Isaacs Aaron
Kanterakis Alexandros
Karssen Lennart C
Kattenberg Mathijs
Kayser Manfred
Kloosterman Wigard P
Koval Vyacheslav
Lameijer Eric-Wubbo
Laros Jeroen FJ
Li Mingkun
Li Ning
Li Qibin
Li Yingrui
Marschall Tobias
McCarroll Steven A
Medina-Gomez Carolina
Mei Hailiang
Menelaou Androniki
Moed Matthijs H
Neale Benjamin M
Neerincx Pieter BT
Nijman Isaäc J
Oostra Ben
Pe'er Itsik
Pitts Steven J
Platteel Mathieu
Polak Paz
Potluri Shobha
Pulit Sara L
Renkens Ivo
Rivadeneira Fernando
Samocha Kaitlin E
Schönhuth Alexander
Slagboom P Eline
Slagboom PEline
Sohail Mashaal
Stoneking Mark
Suchiman H Eka D
Sundar Purnima
Sunyaev Shamil R
Swertz Morris A
Uitterlinden André G
van den Berg Leonard H
van der Velde K Joeri
van Dijk Freerk
van Duijn Cornelia M
van Enckevort David
van Leeuwen Elisabeth M
van Ommen Gertjan B
van Oven Mannis
van Schaik Barbera DC
van Setten Jessica
Veldink Jan H
Vermaat Martijn
Vuzman Dana
Wang Jun
Wijmenga Cisca
Willemsen Gonneke
Ye Kai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/03/2017
Field of study

Germline mutation detection from human DNA sequence data is challenging due to the rarity of such events relative to the intrinsic error rates of sequencing technologies and the uneven coverage across the genome. We developed PhaseByTransmission (PBT) to identify de novo single nucleotide variants and short insertions and deletions (indels) from sequence data collected in parent-offspring trios. We compute the joint probability of the data given the genotype likelihoods in the individual family members, the known familial relationships and a prior probability for the mutation rate. Candidate de novo mutations (DNMs) are reported along with their posterior probability, providing a systematic way to prioritize them for validation. Our tool is integrated in the Genome Analysis Toolkit and can be used together with the ReadBackedPhasing module to infer the parental origin of DNMs based on phase-informative reads. Using simulated data, we show that PBT outperforms existing tools, especially in low coverage data and on the X chromosome. We further show that PBT displays high validation rates on empirical parent-offspring sequencing data for whole-exome data from 104 trios and X-chromosome data from 249 parent-offspring families. Finally, we demonstrate an association between father's age at conception and the number of DNMs in female offspring's X chromosome, consistent with previous literature reports

Harvard University - DASH

Skewed X-inactivation is common in the general female population

Author: Abdellaoui A. (Abdel)
Amin N. (Najaf)
Arindrarto W. (Wibowo)
Beekman M. (Marian)
Beekman M. (Marian)
Berg L.H. (Leonard) van den
Boomsma D. (Di)
Boomsma D.I. (Dorret)
Bot J. (Jan)
Bot J.J. (Jan)
Bovenberg J.A. (Jasper)
Breggen R. (Ruud) van der
Byelas H. (Heorhiy)
Cao H. (H.)
Cao S. (Sherry)
Chen R. (R.)
Chuva De Sousa Lopes S.M. (Susana M.)
Cox D. (Dr)
de Bakker P. (Pi)
de Craen A. (Aj)
Deelen J. (Joris)
Deelen P. (Patrick)
Deelen P. (Patrick)
den Dunnen J. (Jt)
Dijk F. (Freerk) van
Dijkstra M. (Martijn)
Dongen J. (Jenny) van
Draisma G. (Gerrit)
Du Y. (Y.)
Duijn C.M. (Cornelia) van
Dunnen J.T. (Johan) den
Elbers C.C. (Clara)
Enckevort D. (David) van
Estrada Gil K. (Karol)
Francioli L.C. (Laurent)
Franke L. (Lude)
Franke L. (Lude)
Gagalova K. (Kristina)
Greevenbroek M.M. van
Guryev V. (Victor)
Handsaker R.E. (Robert)
Heemst D. (Diana) van
Hehir-Kwa J. (Jayne)
Heijmans B.T. (Bastiaan T)
Heijmans B.T. (Bastiaan T.)
Hofman B. (Ba)
Hofman B.A. (Bert A)
Hormozdiari F. (Fereydoun)
Hottenga J. (Jj)
Hottenga J.J. (Jouke Jan)
Isaacs A.J. (Aaron)
Isaacs A.J. (Aaron)
Iterson M. (Maarten) van
Jan Bonder M. (Marc)
Jansen R. (Rick)
Jansen R. (Rick)
Jhamai P.M. (Mila)
Kallen C.J. van der
Kanterakis A. (Alexandros)
Karssen L.C. (Lennart)
Kattenberg V.M. (Mathijs)
Kayser M.H. (Manfred)
Kielbasa S.M. (Szymon M.)
Kloosterman W. (Wp)
Knijff P. (Peter) de
Koval V. (Vyacheslav)
Lakenberg N. (Nico)
Lameijer E. (Ew)
Laros J. (Jf)
Li M. (M.)
Li N. (N.)
Li Q. (Q.)
Li Y. (Y.)
Luijk R. (René)
Marschall T. (Tanja)
McCarroll S. (Sa)
Medina-Gomez M.C. (Carolina)
Mei H. (H.)
Mei S. (Shan)
Menelaou A. (Androniki)
Meurs J.B.J. (Joyce) van
Moed H. (Heleen)
Moed M. (Mh)
Monajemi R. (Ramin)
Neerincx P.B.T. (Pieter B T)
Nijman I. (Ij)
Nooren I. (Irene)
Oostra B.A. (Ben)
Oven M. (Mannis) van
Palamara P.F. (Pier Francesco)
Peer I. (Itsik)
Pitts S. (Sj)
Platteel I. (Inge)
Polak P.
Pool R. (Reńe)
Potluri S. (Shobha)
Pulit S.L. (Sara)
Renkens I. (Ivo)
Rivadeneira Ramirez F. (Fernando)
Santen G.W.E. (Gijs)
Schalkwijk C.G. (Casper)
Schönhuth A. (A.)
Setten J. (Jessica) van
Shvetsova E. (Ekaterina)
Slagboom P. (Pe)
Slagboom P.E. (Eline)
Sofronova A. (Alina)
Sohail M. (Mashaal)
Stehouwer C.D. (Coen Da)
Stoneking M. (Mark)
Suchiman H. (He)
Suchiman H.E.D. (H Eka D)
Sundar P. (Purnima)
Sunyaev S. (Sr)
Swertz M. (Ma)
Swertz M.A. (Morris A)
Tigchelaar E.F. (Ettje F.)
Uitterlinden A. (Ag)
Uitterlinden A.G. (André)
van den Berg L. (Lh)
van der Velde K. (Kj)
van Dijk F. (Freerk)
van Duijn C. (Cm)
Van Galen M. (Michiel)
van Leeuwen E. (Em)
van Meurs J. (Joyce)
van Ommen G. (Gj)
van Rooij J. (Jeroen)
van Schaik B. (Bd)
van ’t Hof P. (Peter)
Veldink J. (Jh)
Veldink J.H. (Jan)
Verbiest M.M.P.J. (Michael)
Verkerk M. (Marijn)
Vermaat M. (Martijn)
Vermaat M. (Martijn)
Vuzman D. (Dana)
Wang J. (J.)
White S.J. (Stefan)
Wijmenga C. (C.)
Wijmenga C. (Cisca)
Willemsen G.A.H.M. (Gonneke)
Ye K. (K.)
Zhernakova D.V. (Dasha V)
Zhernakova S. (Sasha)
Zwet E.W. (Erik) van
‘t Hoen P.A.C. (Peter A. C.)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

X-inactivation is a well-established dosage compensation mechanism ensuring that X-chromosomal genes are expressed at comparable levels in males and females. Skewed X-inactivation is often explained by negative selection of one of the alleles. We demonstrate that imbalanced expression of the paternal and maternal X-chromosomes is common in the general population and that the random nature of the X-inactivation mechanism can be sufficient to explain the imbalance. To this end, we analyzed blood-derived RNA and whole-genome sequencing data from 79 female children and their parents from the Genome of the Netherlands project. We calculated the median ratio of the paternal over total counts at all X-chromosomal heterozygous single-nucleotide variants with coverage ≥10. We identified two individuals where the same X-chromosome was inactivated in all cells. Imbalanced expression of the two X-chromosomes (ratios ≤0.35 or ≥0.65) was observed in nearly 50% of the population. The empirically observed skewing is explained by a theoretical model where X-inactivation takes place in an embryonic stage in which eight cells give rise to the hematopoietic compartment. Genes escaping X-inactivation are expressed from both alleles and therefore demonstrate less skewing than inactivated genes. Using this characteristic, we identified three novel escapee genes (SSR4, REPS2, and SEPT6), but did not find support for many previously reported escapee genes in blood. Our collective data suggest that skewed X-inactivation is common in the general population. This may contribute to manifestation of symptoms in carriers of recessive X-linked disorders. We recommend that X-inactivation results should not be used lightly in the interpretation of X-linked variants

Erasmus University Digital Repository