Search CORE

38 research outputs found

Population genetics of identity by descent

Author: Palamara Pier Francesco
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2014
Field of study

Recent improvements in high-throughput genotyping and sequencing technologies have afforded the collection of massive, genome-wide datasets of DNA information from hundreds of thousands of individuals. These datasets, in turn, provide unprecedented opportunities to reconstruct the history of human populations and detect genotype-phenotype association. Recently developed computational methods can identify long-range chromosomal segments that are identical across samples, and have been transmitted from common ancestors that lived tens to hundreds of generations in the past. These segments reveal genealogical relationships that are typically unknown to the carrying individuals. In this work, we demonstrate that such identical-by-descent (IBD) segments are informative about a number of relevant population genetics features: they enable the inference of details about past population size fluctuations, migration events, and they carry the genomic signature of natural selection. We derive a mathematical model, based on coalescent theory, that allows for a quantitative description of IBD sharing across purportedly unrelated individuals, and develop inference procedures for the reconstruction of recent demographic events, where classical methodologies are statistically underpowered. We analyze IBD sharing in several contemporary human populations, including representative communities of the Jewish Diaspora, Kenyan Maasai samples, and individuals from several Dutch provinces, in all cases retrieving evidence of fine-scale demographic events from recent history. Finally, we expand the presented model to describe distributions for those sites in IBD shared segments that harbor mutation events, showing how these may be used for the inference of mutation rates in humans and other species.Comment: Ph.D. thesi

arXiv.org e-Print Archive

Columbia University Academic Commons

The variance of identity-by-descent sharing in the Wright-Fisher model

Author: Ariel Darvasi
Bennet
Hollenbeck
Itsik Pe’er
Kong
Pier Francesco Palamara
Shai Carmi
Todd Lencz
Vladimir Vacic
Publication venue: 'Genetics Society of America'
Publication date: 12/08/2013
Field of study

Widespread sharing of long, identical-by-descent (IBD) genetic segments is a hallmark of populations that have experienced recent genetic drift. Detection of these IBD segments has recently become feasible, enabling a wide range of applications from phasing and imputation to demographic inference. Here, we study the distribution of IBD sharing in the Wright-Fisher model. Specifically, using coalescent theory, we calculate the variance of the total sharing between random pairs of individuals. We then investigate the cohort-averaged sharing: the average total sharing between one individual and the rest of the cohort. We find that for large cohorts, the cohort-averaged sharing is distributed approximately normally. Surprisingly, the variance of this distribution does not vanish even for large cohorts, implying the existence of "hyper-sharing" individuals. The presence of such individuals has consequences for the design of sequencing studies, since, if they are selected for whole-genome sequencing, a larger fraction of the cohort can be subsequently imputed. We calculate the expected gain in power of imputation by IBD, and subsequently, in power to detect an association, when individuals are either randomly selected or specifically chosen to be the hyper-sharing individuals. Using our framework, we also compute the variance of an estimator of the population size that is based on the mean IBD sharing and the variance in the sharing between inbred siblings. Finally, we study IBD sharing in an admixture pulse model, and show that in the Ashkenazi Jewish population the admixture fraction is correlated with the cohort-averaged sharing.Comment: Includes Supplementary Materia

arXiv.org e-Print Archive

Crossref

Length Distributions of Identity by Descent Reveal Fine-Scale Demographic History

Author: Darvasi Ariel
Lencz Todd
Palamara Pier Francesco
Pe’er Itsik
Publication venue: The American Society of Human Genetics. Published by Elsevier Inc.
Publication date: 02/11/2012
Field of study

Data-driven studies of identity by descent (IBD) were recently enabled by high-resolution genomic data from large cohorts and scalable algorithms for IBD detection. Yet, haplotype sharing currently represents an underutilized source of information for population-genetics research. We present analytical results on the relationship between haplotype sharing across purportedly unrelated individuals and a population’s demographic history. We express the distribution of IBD sharing across pairs of individuals for segments of arbitrary length as a function of the population’s demography, and we derive an inference procedure to reconstruct such demographic history. The accuracy of the proposed reconstruction methodology was extensively tested on simulated data. We applied this methodology to two densely typed data sets: 500 Ashkenazi Jewish (AJ) individuals and 56 Kenyan Maasai (MKK) individuals (HapMap 3 data set). Reconstructing the demographic history of the AJ cohort, we recovered two subsequent population expansions, separated by a severe founder event, consistent with previous analysis of lower-throughput genetic data and historical accounts of AJ history. In the MKK cohort, high levels of cryptic relatedness were detected. The spectrum of IBD sharing is consistent with a demographic model in which several small-sized demes intermix through high migration rates and result in enrichment of shared long-range haplotypes. This scenario of historically structured demographies might explain the unexpected abundance of runs of homozygosity within several populations

Elsevier - Publisher Connector

PubMed Central

Recommended from our members

Fast and accurate long-range phasing in a UK Biobank cohort

Author: Loh Po-Ru
Palamara Pier Francesco
Price Alkes L
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/01/2017
Field of study

Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here, we develop a fast and accurate LRP method, Eagle, that extends this paradigm to populations with much smaller proportions of genotyped samples by harnessing long (>4cM) identical-by-descent (IBD) tracts shared among distantly related individuals. We applied Eagle to N≈150,000 samples (0.2% of the British population) from the UK Biobank, and we determined that it is 1–2 orders of magnitude faster than existing methods while achieving similar or better phasing accuracy (switch error rate ≈0.3%, corresponding to perfect phase in a majority of 10Mb segments). We also observed that when used within an imputation pipeline, Eagle pre-phasing improved downstream imputation accuracy compared to pre-phasing in batches using existing methods (as necessary to achieve comparable computational cost)

Harvard University - DASH

A minimal descriptor of an ancestral recombinations graph

Author: Asif Javed
B Padhukasahasram
C Wiuf
GAT McVean
GK Chen
J Hein
L L Liang
L Parida
L Parida
Laxmi Parida
M Arenas
M Jobling
P Marjoram
Pier Francesco Palamara
R Bürger
RC Griffiths
RR Hudson
RR Hudson
S Schaffner
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Ancestral Recombinations Graph (ARG) is a phylogenetic structure that encodes both duplication events, such as mutations, as well as genetic exchange events, such as recombinations: this captures the (genetic) dynamics of a population evolving over generations. Results In this paper, we identify structure-preserving and samples-preserving core of an ARG <it>G</it> and call it the minimal descriptor ARG of <it>G</it>. Its structure-preserving characteristic ensures that all the branch lengths of the marginal trees of the minimal descriptor ARG are identical to that of <it>G</it> and the samples-preserving property asserts that the patterns of genetic variation in the samples of the minimal descriptor ARG are exactly the same as that of <it>G</it>. We also prove that even an unbounded <it>G</it> has a finite minimal descriptor, that continues to preserve certain (graph-theoretic) properties of <it>G</it> and for an appropriate class of ARGs, our estimate (Eqn 8) as well as empirical observation is that the expected reduction in the number of vertices is exponential. Conclusions Based on the definition of this lossless and bounded structure, we derive local properties of the vertices of a minimal descriptor ARG, which lend itself very naturally to the design of efficient sampling algorithms. We further show that a class of minimal descriptors, that of binary ARGs, models the standard coalescent exactly (Thm 6).</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Recommended from our members

Improved imputation quality of low-frequency and rare variants in European samples using the ‘Genome of The Netherlands'

Author: Abdellaoui Abdel
Amin Najaf
Beekman Marian B
Boomsma Dorret I
Bot Jan
Bovenberg Jasper A
Byelas Heorhiy
Cao Hongzhi
Cao Sujie
Chen Ruoyan
committee Steering
Cox David R
de Bakker Paul I W
de Craen Anton J M
de Knijff Peter
Deelen Patrick
den Dunnen Johan T
Dijkstra Martijn
Du Yuanping
Elbers Clara C
Estrada Karol
Francesco Palamara Pier
Francioli Laurent C
Franke Lude
Guryev Victor
Gutierrez-Achury Javier
Handsaker Robert E
Hehir-Kwa Jayne Y
Hofman Albert
Hormozdiari Fereydoun
Hottenga Jouke Jan
Jan Hottenga Jouke
Kanterakis Alexandros
Karssen Lennart C
Kattenberg Mathijs
Kayser Manfred
Kloosterman Wigard P
Koval Vyacheslav
Kreiner-Møller Eskil
Lameijer Eric-Wubbo
Laros Jeroen F J
Li Mingkun
Li Ning
Li Qibin
Li Yingrui
Marschall Schönhuth
Marschall Tobias
Medina-Gomez Carolina
Mei Hailiang
Menelaou Androniki
Moed Matthijs H
Neerincx Pieter B T
Nijman Isaäuc J
Pe'er Itsik
Pitts Steven J
Platteel Mathieu
Potluri Shobha
Pulit Sara L
Rivadeneira Fernando
Slagboom P Eline
Sohail Mashaal
Stoneking Mark
Suchiman H Eka D
Sunyaev Shamil R
Swertz Morris A
van den Berg Leonard H
van der Velde K Joeri
van Dijk Freerk
van Duijn Cornelia
van Duijn Cornelia M
van Enckevort David
van Leeuwen Elisabeth M
van Ommen Gertjan B
van Oven Mannis
van Schaik Barbera D C
van Setten Jessica
Veldink Jan H
Vermaat Martijn
Wang Jun
Westra Harm-Jan
Wijmenga Cisca
Willemsen Gonneke
Wolffenbuttel Bruce H
Ye Kai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/12/2014
Field of study

Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with ‘true' genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05–0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r2, increased from 0.61 to 0.71. We also saw improved imputation accuracy for other European populations (in the British samples, r2 improved from 0.58 to 0.65, and in the Italians from 0.43 to 0.47). A combined reference set comprising 1000G and GoNL improved the imputation of rare variants even further. The Italian samples benefitted the most from this combined reference (the mean r2 increased from 0.47 to 0.50). We conclude that the creation of a large population-specific reference is advantageous for imputing rare variants and that a combined reference panel across multiple populations yields the best imputation results

Harvard University - DASH

WGS-based telomere length analysis in Dutch family trios implicates stronger maternal inheritance and a role for RRM1 gene

Author: Abdellaoui A. (Abdel)
Amin N. (Najaf)
Arakelyan A. (Arsen)
Beekman M. (Marian)
Boomsma D.I. (Dorret)
Bot J. (Jan)
Bovenberg J.A. (Jasper)
Byelas H. (Heorhiy)
Cao H. (Hongzhi)
Cao S. (Sujie)
Chen R. (Ruoyan)
Cox D.R. (David R.)
Craen A.J.M. (Anton) de
de Bakker P.I.W. (Paul I. W.)
Deelen P. (Patrick)
Dijk F. (Freerk) van
Dijkstra M. (Martijn)
Du Y. (Yuanping)
Duijn C.M. (Cornelia) van
Dunnen J.T. (Johan) den
Elbers C.C. (Clara C.)
Enckevort D. (David) van
Estrada K. (Karol)
Francioli L.C. (Laurent)
Guryev V. (Victor)
Handsaker R.E. (Robert)
Hehir-Kwa J.Y. (Jayne)
Hofman A. (Albert)
Hormozdiari F. (Fereydoun)
Isaacs A. (Aaron)
Jan Hottenga J. (Jouke)
Kanterakis A. (Alexandros)
Karssen L.C. (Lennart)
Kattenberg M. (Mathijs)
Kayser M. (Manfred)
Kloosterman W.P. (Wigard)
Knijff P. (Peter) de
Koval V. (Vyacheslav)
Lameijer E.-W. (Eric-Wubbo)
Laros J.F.J. (Jeroen)
Li M. (Mingkun)
Li N. (Ning)
Li Q. (Qibin)
Li Y. (Yingrui)
Marschall T. (Tobias)
McCarroll S.A. (Steven A.)
Medina-Gomez C. (Carolina)
Mei H. (Hailiang)
Menelaou A. (Androniki)
Moed M.H. (Matthijs H.)
Neerincx P.B.T. (Pieter)
Nersisyan L. (Lilit)
Nijman I.J. (Isaac)
Nikoghosyan M. (Maria)
Ommen G.-J.B. (Gert-Jan) van
Oostra B. (Ben)
Palamara P.F. (Pier Francesco)
Pe’er I. (Itsik)
Pitts S.J. (Steven J.)
Platteel M. (Mathieu)
Polak P. (Paz)
Potluri S. (Shobha)
Pulit S.L. (Sara L.)
Renkens I. (Ivo)
Rivadeneira F. (Fernando)
Schaik B.D.C. (Barbera) van
Schönhuth A. (Alexander)
Slagboom P.E. (Eline)
Sohail M. (Mashaal)
Stoneking M. (Mark)
Suchiman H.E.D. (H. Eka D.)
Sundar P. (Purnima)
Sunyaev S.R. (Shamil R.)
Swertz M.A. (Morris A.)
The Genome of the Netherlands Consortium
Uitterlinden A.G. (André)
van den Berg L.H. (Leonard H.)
van der Velde K.J. (K. Joeri)
van Leeuwen E.M. (Elisabeth M.)
van Oven M. (Mannis)
van Setten J. (Jessica)
Veldink J. (Jan)
Vermaat M. (Martijn)
Vuzman D. (Dana)
Wang J. (Jun)
Wijmenga C. (Cisca)
Willemsen G. (Gonneke)
Ye K. (Kai)
Ye K. (Kai)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/12/2019
Field of study

Telomere length (TL) regulation is an important factor in ageing, reproduction and cancer development. Genetic, hereditary and environmental factors regulating TL are currently widely investigated, however, their relative contribution to TL variability is still understudied. We have used whole genome sequencing data of 250 family trios from the Genome of the Netherlands project to perform computational measurement of TL and a series of regression and genome-wide association analyses to reveal TL inheritance patterns and associated genetic factors. Our results confirm that TL is a largely heritable trait, primarily with mother’s, and, to a lesser extent, with father’s TL having the strongest influence on the offspring. In this cohort, mother’s, but not father’s age at conception was positively linked to offspring TL. Age-related TL attrition of 40 bp/year had relatively small influence on TL variability. Finally, we have identified TL-associated variations in ribonuclease reductase catalytic subunit M1 (RRM1 gene), which is known to regulate telomere maintenance in yeast. We also highlight the importance of multivariate approach and the limitations of existing tools for the analysis of TL as a polygenic heritable quantitative trait

CWI's Institutional Repository

Erasmus University Digital Repository

Recommended from our members

A framework for the detection of de novo mutations in family-based sequencing data

Author: Abdellaoui Abdel
Amin Najaf
Banks Eric
Beekman Marian B
Boomsma Dorret I
Bot Jan
Bovenberg Jasper A
Brandsma Margreet
Byelas Heorhiy
Cao Hongzhi
Cao Sujie
Chen Ruoyan
Cox David R
Cretu-Stancu Mircea
Daly Mark J
de Bakker Paul IW
de Craen Anton JM
de Knijff Peter
Deelen Patrick
den Dunnen Johan T
DePristo Mark A
Dijkstra Martijn
Du Yuanping
Elbers Clara C
Estrada Karol
Francesco Palamara Pier
Francioli Laurent C
Fromer Menachem
Garimella Kiran V
Guryev Victor
Handsaker Robert E
Hehir-Kwa Jayne Y
Hofman Albert
Hormozdiari Fereydoun
Hottenga Jouke Jan
Investigator Principal
Isaacs Aaron
Kanterakis Alexandros
Karssen Lennart C
Kattenberg Mathijs
Kayser Manfred
Kloosterman Wigard P
Koval Vyacheslav
Lameijer Eric-Wubbo
Laros Jeroen FJ
Li Mingkun
Li Ning
Li Qibin
Li Yingrui
Marschall Tobias
McCarroll Steven A
Medina-Gomez Carolina
Mei Hailiang
Menelaou Androniki
Moed Matthijs H
Neale Benjamin M
Neerincx Pieter BT
Nijman Isaäc J
Oostra Ben
Pe'er Itsik
Pitts Steven J
Platteel Mathieu
Polak Paz
Potluri Shobha
Pulit Sara L
Renkens Ivo
Rivadeneira Fernando
Samocha Kaitlin E
Schönhuth Alexander
Slagboom P Eline
Slagboom PEline
Sohail Mashaal
Stoneking Mark
Suchiman H Eka D
Sundar Purnima
Sunyaev Shamil R
Swertz Morris A
Uitterlinden André G
van den Berg Leonard H
van der Velde K Joeri
van Dijk Freerk
van Duijn Cornelia M
van Enckevort David
van Leeuwen Elisabeth M
van Ommen Gertjan B
van Oven Mannis
van Schaik Barbera DC
van Setten Jessica
Veldink Jan H
Vermaat Martijn
Vuzman Dana
Wang Jun
Wijmenga Cisca
Willemsen Gonneke
Ye Kai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/03/2017
Field of study

Germline mutation detection from human DNA sequence data is challenging due to the rarity of such events relative to the intrinsic error rates of sequencing technologies and the uneven coverage across the genome. We developed PhaseByTransmission (PBT) to identify de novo single nucleotide variants and short insertions and deletions (indels) from sequence data collected in parent-offspring trios. We compute the joint probability of the data given the genotype likelihoods in the individual family members, the known familial relationships and a prior probability for the mutation rate. Candidate de novo mutations (DNMs) are reported along with their posterior probability, providing a systematic way to prioritize them for validation. Our tool is integrated in the Genome Analysis Toolkit and can be used together with the ReadBackedPhasing module to infer the parental origin of DNMs based on phase-informative reads. Using simulated data, we show that PBT outperforms existing tools, especially in low coverage data and on the X chromosome. We further show that PBT displays high validation rates on empirical parent-offspring sequencing data for whole-exome data from 104 trios and X-chromosome data from 249 parent-offspring families. Finally, we demonstrate an association between father's age at conception and the number of DNMs in female offspring's X chromosome, consistent with previous literature reports

Harvard University - DASH

Genome of the Netherlands population-specific imputations identify an ABCA6 variant associated with cholesterol levels

Author: Abdellaoui Abdel
Amin Najaf
Bartz Traci M.
Beekman Marian
Bis Joshua C.
Boomsma Dorret I.
Borecki Ingrid B.
Bot Jan
Bovenberg Jasper A.
Brody Jennifer A.
Buckley Brendan M.
Byelas Heorhiy
Campbell Harry
Cao Hongzhi
Cao Sujie
Chen Ruoyan
Cox David R.
Cupples L. Adrienne
De Bakker Paul I W
de Bakker Paul I W
De Craen Anton J M
De Geus Eco J.
De Knijff Peter
Deelen Joris
Deelen Patrick
Den Dunnen Johan T.
Dijkstra Martijn
Du Yuanping
Duan Qing
Elbers Clara C.
Eline Slagboom P.
Feitosa Mary F.
Francioli Laurent C.
Franco Oscar H.
Guryev Victor
Handsaker Robert E.
Hayward Caroline
Hehir-Kwa Jayne Y.
Hocking Lynne J.
Hofman Albert
Hormozdiari Fereydoun
Hottenga Jouke Jan
Huffman Jennifer E.
Isaacs Aaron
Joshi Peter K.
Jukema J. Wouter
Kanterakis Alexandros
Karssen Lennart C.
Kattenberg Mathijs
Kayser Manfred
Kearney Patricia M.
Kloosterman Wigard P.
Koval Vyacheslav
Lameijer Eric Wubbo
Lange Leslie A.
Laros Jeroen F J
Leach Irene Mateo
Li Mingkun
Li Ning
Li Qibin
Li Yingrui
Manichaikul Ani
Marschall Tobias
Mbarek Hamdi
Medina-Gomez Carolina
Mei Hailiang
Menelaou Androniki
Milaneschi Yuri
Moed Matthijs H.
Mychaleckyj Josyf C.
Neerincx Pieter B T
Nijman Isaäc J.
Oostra Ben A.
Packard Chris J.
Palamara Pier Francesco
Peer Itsik
Peloso Gina M.
Penninx Brenda W J H
Pitts Steven J.
Platteel Mathieu
Polasek Ozren
Porteous David J.
Postmus Iris
Potluri Shobha
Psaty Bruce M.
Pulit Sara L.
Rich Stephen S.
Rivadeneira Fernando
Rotter Jerome I.
Rudan Igor
Sattar Naveed
Schönhuth Alexander
Sijbrands Eric J.
Sohail Mashaal
Stoneking Mark
Stott David J.
Suchiman H. Eka D
Sunyaev Shamil R.
Swertz Morris A.
Trochet Holly
Trompet Stella
Uh Hae Won
Uitterlinden Andre G.
Van Den Berg Leonard H.
Van Der Harst Pim
Van Der Velde K. Joeri
Van Dijk Freerk
Van Duijn Cornelia M.
Van Enckevort David J.
Van Leeuwen Elisabeth M.
Van Ommen Gert Jan B
Van Oven Mannis
Van Schaik Barbera D C
Van Setten Jessica
Veldink Jan H.
Vermaat Martijn
Verweij Niek
Vitart Veronique
Wang Jun
White Charles C.
Wijmenga Cisca
Willemsen Gonneke
Wilson James F.
Wolffenbuttel Bruce H.
Wright Alan F.
Ye Kai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. Acknowledgements: We especially thank all volunteers who participated in our study. This study made use of data generated by the ‘Genome of the Netherlands’ project, which is funded by the Netherlands Organization for Scientific Research (grant no. 184021007). The data were made available as a Rainbow Project of BBMRI-NL. Samples were contributed by LifeLines (http://lifelines.nl/lifelines-research/general), the Leiden Longevity Study (http://www.healthy-ageing.nl; http://www.langleven.net), the Netherlands Twin Registry (NTR: http://www.tweelingenregister.org), the Rotterdam studies (http://www.erasmus-epidemiology.nl/rotterdamstudy) and the Genetic Research in Isolated Populations programme (http://www.epib.nl/research/geneticepi/research.html#gip). The sequencing was carried out in collaboration with the Beijing Institute for Genomics (BGI). Cardiovascular Health Study: This CHS research was supported by NHLBI contracts HHSN268201200036C, HHSN268200800007C, HHSN268200960009C, N01HC55222, N01HC85079, N01HC85080, N01HC85081, N01HC85082, N01HC85083, N01HC85086; and NHLBI grants HL080295, HL087652, HL105756 and HL103612 with additional contribution from the National Institute of Neurological Disorders and Stroke (NINDS). Additional support was provided through AG023629 from the National Institute on Aging (NIA). A full list of CHS investigators and institutions can be found at http://www.chs-nhlbi.org/pi.htm. The CROATIA cohorts would like to acknowledge the invaluable contributions of the recruitment teams in Vis, Korcula and Split (including those from the Institute of Anthropological Research in Zagreb and the Croatian Centre for Global Health at the University of Split), the administrative teams in Croatia and Edinburgh and the people of Vis, Korcula and Split. SNP genotyping was performed at the Wellcome Trust Clinical Research Facility in Edinburgh for CROATIA-Vis, by Helmholtz Zentrum München, GmbH, Neuherberg, Germany for CROATIA-Korcula and by AROS Applied Biotechnology, Aarhus, Denmark for CROATIA-Split. They would also like to thank Jared O’Connell for performing the pre-phasing for all cohorts before imputation. The ERF study as a part of EuroSPAN (European Special Populations Research Network) was supported by European Commission FP-6 STRP grant number 018947 (LSHG-CT-2006-01947) and also received funding from the European Community's Seventh Framework Programme (FP7/2007-2013)/grant agreement HEALTH-F4-2007-201413 by the European Commission under the programme ‘Quality of Life and Management of the Living Resources’ of 5th Framework Programme (no. QLG2-CT-2002-01254). High-throughput analysis of the ERF data was supported by joint grant from the Netherlands Organisation for Scientific Research and the Russian Foundation for Basic Research (NWO-RFBR 047.017.043). This research was financially supported by BBMRI-NL, a Research Infrastructure financed by the Dutch government (NWO 184.021.007). Statistical analyses for the ERF study were carried out on the Genetic Cluster Computer (http://www.geneticcluster.org), which is financially supported by the Netherlands Scientific Organization (NWO 480-05-003 PI: Posthuma) along with a supplement from the Dutch Brain Foundation and the VU University Amsterdam. We are grateful to all study participants and their relatives, general practitioners and neurologists for their contributions and to P. Veraart for her help in genealogy, J. Vergeer for the supervision of the laboratory work and P. Snijders for his help in data collection. The FamHS is funded by a NHLBI grant 5R01HL08770003, and NIDDK grants 5R01DK06833603 and 5R01DK07568102. The Framingham Heart Study SHARe Project for GWAS scan was supported by the NHLBI Framingham Heart Study (Contract No. N01-HC-25195) and its contract with Affymetrix Inc for genotyping services (Contract No. N02-HL-6-4278). DNA isolation and biochemistry were partly supported by NHLBI HL-54776. A portion of this research utilized the Linux Cluster for Genetic Analysis (LinGA-II) funded by the Robert Dawson Evans Endowment of the Department of Medicine at the Boston University School of Medicine and Boston Medical Center. We are grateful to Han Chen for conducting the 1000G imputation. The Family Heart Study was supported by the by grants R01-HL-087700 and R01-HL-088215 from the National Heart, Lung, and Blood Institute (NHLBI). We would like to acknowledge the invaluable contributions of the families who took part in the Generation Scotland: Scottish Family Health Study, the general practitioners and Scottish School of Primary Care for their help in recruiting them, and the whole Generation Scotland team, which includes academic researchers, IT staff, laboratory technicians, statisticians and research managers. SNP genotyping was performed at the Wellcome Trust Clinical Research Facility in Edinburgh. GS:SFHS is funded by the Scottish Executive Health Department, Chief Scientist Office, grant number CZD/16/6. SNP genotyping was funded by the Medical Research Council, United Kingdom. We wish to acknowledge the services of the LifeLines Cohort Study, the contributing research centres delivering data to LifeLines and all the study participants. MESA Whites and the MESA SHARe project are conducted and supported by contracts N01-HC-95159 through N01-HC-95169 and RR-024156 from the NHLBI. Funding for MESA SHARe genotyping was provided by NHLBI Contract N02.HL.6.4278. MESA Family is conducted and supported in collaboration with MESA investigators; support is provided by grants and contracts R01HL071051, R01HL071205, R01HL071250, R01HL071251, R01HL071252, R01HL071258 and R01HL071259. We thank the participants of the MESA study, the Coordinating Center, MESA investigators and study staff for their valuable contributions. A full list of participating MESA investigators and institutions can be found at http://www.mesa-nhlbi.org. Netherland Twin Register (NTR) and Netherlands Study of Depression and Anxiety (NESDA): Funding was obtained from the Netherlands Organization for Scientific Research (NWO) and MagW/ZonMW grants Middelgroot-911-09-032, Spinozapremie 56-464-14192, Geestkracht programme of the Netherlands Organization for Health Research and Development (Zon-MW, grant number 10-000-1002), Center for Medical Systems Biology (CSMB, NWO Genomics), NBIC/BioAssist/RK(2008.024), Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-NL, 184.021.007), VU University’s Institute for Health and Care Research (EMGO+) and Neuroscience Campus Amsterdam (NCA); the European Science Foundation (ESF, EU/QLRT-2001-01254), the European Community’s Seventh Framework Program (FP7/2007-2013), ENGAGE (HEALTH-F4-2007-201413); the European Science Council (ERC Advanced, 230374); and the European Research Council (ERC-284167). Part of the genotyping and analyses were funded by the Genetic Association Information Network (GAIN) of the Foundation for the National Institutes of Health, Rutgers University Cell and DNA Repository (NIMH U24 MH068457-06), the Avera Institute, Sioux Falls, South Dakota (USA) and the National Institutes of Health (NIH R01 HD042157-01A1, MH081802, Grand Opportunity grants 1RC2 MH089951 and 1RC2 MH089995). PREVEND genetics is supported by the Dutch Kidney Foundation (Grant E033), the EU project grant GENECURE (FP-6 LSHM CT 2006 037697), the National Institutes of Health (grant 2R01LM010098), The Netherlands Organisation for Health Research and Development (NWO-Groot grant 175.010.2007.006, NWO VENI grant 916.761.70, ZonMw grant 90.700.441) and the Dutch Inter University Cardiology Institute Netherlands (ICIN). The PROSPER study was supported by an investigator-initiated grant obtained from Bristol-Myers Squibb. J.W.J is an Established Clinical Investigator of the Netherlands Heart Foundation (grant 2001 D 032). Genotyping was supported by the seventh framework programme of the European commission (grant 223004) and by the Netherlands Genomics Initiative (Netherlands Consortium for Healthy Aging grant 050-060-810). The Rotterdam Study is funded by Erasmus Medical Center and Erasmus University, Rotterdam, Netherlands Organization for the Health Research and Development (ZonMw), the Research Institute for Diseases in the Elderly (RIDE), the Ministry of Education, Culture and Science, the Ministry for Health, Welfare and Sports, the European Commission (DG XII) and the Municipality of Rotterdam. We are grateful to the study participants, the staff from the Rotterdam Study and the participating general practitioners and pharmacists. The generation and management of GWAS genotype data for the Rotterdam Study is supported by the Netherlands Organisation of Scientific Research NWO Investments (nr. 175.010.2005.011, 911-03-012). This study is funded by the Research Institute for Diseases in the Elderly (014-93-015; RIDE2), the Netherlands Genomics Initiative (NGI)/Netherlands Organisation for Scientific Research (NWO) project no. 050-060-810. We thank Pascal Arp, Mila Jhamai, Marijn Verkerk, Lizbeth Herrera and Marjolein Peters for their help in creating the GWAS database.Peer reviewedPublisher PD

Aberdeen University Research

VU Research Portal

University of Groningen

Edinburgh Research Explorer

Leiden University Scholary Publications

Enlighten

Erasmus University Digital Repository

MPG.PuRe

Maastricht University Research Portal

Proceedings - University of Groningen

Crossref

CWI's Institutional Repository

Harvard University - DASH

ARTS repository - University of Groningen

Copenhagen University Research Information System

EUR Research Repository

Utrecht University Repository

Dissertations of the University of Groningen

Skewed X-inactivation is common in the general female population

Author: Abdellaoui A. (Abdel)
Amin N. (Najaf)
Arindrarto W. (Wibowo)
Beekman M. (Marian)
Beekman M. (Marian)
Berg L.H. (Leonard) van den
Boomsma D. (Di)
Boomsma D.I. (Dorret)
Bot J. (Jan)
Bot J.J. (Jan)
Bovenberg J.A. (Jasper)
Breggen R. (Ruud) van der
Byelas H. (Heorhiy)
Cao H. (H.)
Cao S. (Sherry)
Chen R. (R.)
Chuva De Sousa Lopes S.M. (Susana M.)
Cox D. (Dr)
de Bakker P. (Pi)
de Craen A. (Aj)
Deelen J. (Joris)
Deelen P. (Patrick)
Deelen P. (Patrick)
den Dunnen J. (Jt)
Dijk F. (Freerk) van
Dijkstra M. (Martijn)
Dongen J. (Jenny) van
Draisma G. (Gerrit)
Du Y. (Y.)
Duijn C.M. (Cornelia) van
Dunnen J.T. (Johan) den
Elbers C.C. (Clara)
Enckevort D. (David) van
Estrada Gil K. (Karol)
Francioli L.C. (Laurent)
Franke L. (Lude)
Franke L. (Lude)
Gagalova K. (Kristina)
Greevenbroek M.M. van
Guryev V. (Victor)
Handsaker R.E. (Robert)
Heemst D. (Diana) van
Hehir-Kwa J. (Jayne)
Heijmans B.T. (Bastiaan T)
Heijmans B.T. (Bastiaan T.)
Hofman B. (Ba)
Hofman B.A. (Bert A)
Hormozdiari F. (Fereydoun)
Hottenga J. (Jj)
Hottenga J.J. (Jouke Jan)
Isaacs A.J. (Aaron)
Isaacs A.J. (Aaron)
Iterson M. (Maarten) van
Jan Bonder M. (Marc)
Jansen R. (Rick)
Jansen R. (Rick)
Jhamai P.M. (Mila)
Kallen C.J. van der
Kanterakis A. (Alexandros)
Karssen L.C. (Lennart)
Kattenberg V.M. (Mathijs)
Kayser M.H. (Manfred)
Kielbasa S.M. (Szymon M.)
Kloosterman W. (Wp)
Knijff P. (Peter) de
Koval V. (Vyacheslav)
Lakenberg N. (Nico)
Lameijer E. (Ew)
Laros J. (Jf)
Li M. (M.)
Li N. (N.)
Li Q. (Q.)
Li Y. (Y.)
Luijk R. (René)
Marschall T. (Tanja)
McCarroll S. (Sa)
Medina-Gomez M.C. (Carolina)
Mei H. (H.)
Mei S. (Shan)
Menelaou A. (Androniki)
Meurs J.B.J. (Joyce) van
Moed H. (Heleen)
Moed M. (Mh)
Monajemi R. (Ramin)
Neerincx P.B.T. (Pieter B T)
Nijman I. (Ij)
Nooren I. (Irene)
Oostra B.A. (Ben)
Oven M. (Mannis) van
Palamara P.F. (Pier Francesco)
Peer I. (Itsik)
Pitts S. (Sj)
Platteel I. (Inge)
Polak P.
Pool R. (Reńe)
Potluri S. (Shobha)
Pulit S.L. (Sara)
Renkens I. (Ivo)
Rivadeneira Ramirez F. (Fernando)
Santen G.W.E. (Gijs)
Schalkwijk C.G. (Casper)
Schönhuth A. (A.)
Setten J. (Jessica) van
Shvetsova E. (Ekaterina)
Slagboom P. (Pe)
Slagboom P.E. (Eline)
Sofronova A. (Alina)
Sohail M. (Mashaal)
Stehouwer C.D. (Coen Da)
Stoneking M. (Mark)
Suchiman H. (He)
Suchiman H.E.D. (H Eka D)
Sundar P. (Purnima)
Sunyaev S. (Sr)
Swertz M. (Ma)
Swertz M.A. (Morris A)
Tigchelaar E.F. (Ettje F.)
Uitterlinden A. (Ag)
Uitterlinden A.G. (André)
van den Berg L. (Lh)
van der Velde K. (Kj)
van Dijk F. (Freerk)
van Duijn C. (Cm)
Van Galen M. (Michiel)
van Leeuwen E. (Em)
van Meurs J. (Joyce)
van Ommen G. (Gj)
van Rooij J. (Jeroen)
van Schaik B. (Bd)
van ’t Hof P. (Peter)
Veldink J. (Jh)
Veldink J.H. (Jan)
Verbiest M.M.P.J. (Michael)
Verkerk M. (Marijn)
Vermaat M. (Martijn)
Vermaat M. (Martijn)
Vuzman D. (Dana)
Wang J. (J.)
White S.J. (Stefan)
Wijmenga C. (C.)
Wijmenga C. (Cisca)
Willemsen G.A.H.M. (Gonneke)
Ye K. (K.)
Zhernakova D.V. (Dasha V)
Zhernakova S. (Sasha)
Zwet E.W. (Erik) van
‘t Hoen P.A.C. (Peter A. C.)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

X-inactivation is a well-established dosage compensation mechanism ensuring that X-chromosomal genes are expressed at comparable levels in males and females. Skewed X-inactivation is often explained by negative selection of one of the alleles. We demonstrate that imbalanced expression of the paternal and maternal X-chromosomes is common in the general population and that the random nature of the X-inactivation mechanism can be sufficient to explain the imbalance. To this end, we analyzed blood-derived RNA and whole-genome sequencing data from 79 female children and their parents from the Genome of the Netherlands project. We calculated the median ratio of the paternal over total counts at all X-chromosomal heterozygous single-nucleotide variants with coverage ≥10. We identified two individuals where the same X-chromosome was inactivated in all cells. Imbalanced expression of the two X-chromosomes (ratios ≤0.35 or ≥0.65) was observed in nearly 50% of the population. The empirically observed skewing is explained by a theoretical model where X-inactivation takes place in an embryonic stage in which eight cells give rise to the hematopoietic compartment. Genes escaping X-inactivation are expressed from both alleles and therefore demonstrate less skewing than inactivated genes. Using this characteristic, we identified three novel escapee genes (SSR4, REPS2, and SEPT6), but did not find support for many previously reported escapee genes in blood. Our collective data suggest that skewed X-inactivation is common in the general population. This may contribute to manifestation of symptoms in carriers of recessive X-linked disorders. We recommend that X-inactivation results should not be used lightly in the interpretation of X-linked variants

Erasmus University Digital Repository