Search CORE

14 research outputs found

Increased mutation and gene conversion within human segmental duplications

Author: DeWitt William S.
Dishuck Philip C.
Guitart Xavi
Harvey William T.
Marco Sola Santiago
Vollger Mitchell R.
Publication venue: Nature Research
Publication date: 01/01/2023
Field of study

Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data1,2. Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions3,4. We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have ‘relocated’ on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences5,6.We thank T. Brown for help in editing this manuscript, P. Green for valuable suggestions, and R. Seroussi and his staff for their generous donation of time and resources. This work was supported in part by grants from the US National Institutes of Health (NIH 5R01HG002385, 5U01HG010971 and 1U01HG010973 to E.E.E.; K99HG011041 to P.H.; and F31AI150163 to W.S.D.). W.S.D. was supported in part by a Fellowship in Understanding Dynamic and Multi-scale Systems from the James S. McDonnell Foundation. E.E.E. is an investigator of the Howard Hughes Medical Institute (HHMI). This article is subject to HHMI’s Open Access to Publications policy. HHMI laboratory heads have previously granted a nonexclusive CC BY 4.0 licence to the public and a sublicensable licence to HHMI in their research articles. Pursuant to those licences, the author-accepted manuscript of this article can be made freely available under a CC BY 4.0 licence immediately on publication.Peer Reviewed"Article signat per 19 autors/es: Mitchell R. Vollger, Philip C. Dishuck, William T. Harvey, William S. DeWitt, Xavi Guitart, Michael E. Goldberg, Allison N. Rozanski, Julian Lucas, Mobin Asri, Human Pangenome Reference Consortium, Katherine M. Munson, Alexandra P. Lewis, Kendra Hoekzema, Glennis A. Logsdon, David Porubsky, Benedict Paten, Kelley Harris, PingHsun Hsieh & Evan E. Eichler"Postprint (published version

UPCommons. Portal del coneixement obert de la UPC

A high-quality bonobo genome refines the analysis of hominid evolution

The divergence of chimpanzee and bonobo provides one of the few examples of recent hominid speciation1,2. Here we describe a fully annotated, high-quality bonobo genome assembly, which was constructed without guidance from reference genomes by applying a multiplatform genomics approach. We generate a bonobo genome assembly in which more than 98% of genes are completely annotated and 99% of the gaps are closed, including the resolution of about half of the segmental duplications and almost all of the full-length mobile elements. We compare the bonobo genome to those of other great apes1,3,4,5 and identify more than 5,569 fixed structural variants that specifically distinguish the bonobo and chimpanzee lineages. We focus on genes that have been lost, changed in structure or expanded in the last few million years of bonobo evolution. We produce a high-resolution map of incomplete lineage sorting and estimate that around 5.1% of the human genome is genetically closer to chimpanzee or bonobo and that more than 36.5% of the genome shows incomplete lineage sorting if we consider a deeper phylogeny including gorilla and orangutan. We also show that 26% of the segments of incomplete lineage sorting between human and chimpanzee or human and bonobo are non-randomly distributed and that genes within these clustered segments show significant excess of amino acid replacement compared to the rest of the genome

Archivio istituzionale della ricerca - Università di Bari

PubMed Central

Sequence diversity analyses of an improved rhesus macaque genome enhance its biomedical utility

The rhesus macaque () is the most widely studied nonhuman primate (NHP) in biomedical research. We present an updated reference genome assembly (Mmul_10, contig N50 = 46 Mbp) that increases the sequence contiguity 120-fold and annotate it using 6.5 million full-length transcripts, thus improving our understanding of gene content, isoform diversity, and repeat organization. With the improved assembly of segmental duplications, we discovered new lineage-specific genes and expanded gene families that are potentially informative in studies of evolution and disease susceptibility. Whole-genome sequencing (WGS) data from 853 rhesus macaques identified 85.7 million single-nucleotide variants (SNVs) and 10.5 million indel variants, including potentially damaging variants in genes associated with human autism and developmental delay, providing a framework for developing noninvasive NHP models of human disease

Louisiana State University

The structure, function and evolution of a complete human chromosome 8

The complete assembly of each human chromosome is essential for understanding human biology and evolution(1,)(2). Here we use complementary long-read sequencing technologies to complete the linear assembly of human chromosome 8. Our assembly resolves the sequence of five previously long-standing gaps, including a 2.08-Mb centromeric alpha-satellite array, a 644-kb copy number polymorphism in the beta-defensin gene cluster that is important for disease risk, and an 863-kb variable number tandem repeat at chromosome 8q21.2 that can function as a neocentromere. We show that the centromeric alpha-satellite array is generally methylated except for a 73-kb hypomethylated region of diverse higher-order alpha-satellites enriched with CENP-A nucleosomes, consistent with the location of the kinetochore. In addition, we confirm the overall organization and methylation pattern of the centromere in a diploid human genome. Using a dual long-read sequencing approach, we complete high-quality draft assemblies of the orthologous centromere from chromosome 8 in chimpanzee, orangutan and macaque to reconstruct its evolutionary history. Comparative and phylogenetic analyses show that the higher-order alpha-satellite structure evolved in the great ape ancestor with a layered symmetry, in which more ancient higher-order repeats locate peripherally to monomeric alpha-satellites. We estimate that the mutation rate of centromeric satellite DNA is accelerated by more than 2.2-fold compared to the unique portions of the genome, and this acceleration extends into the flanking sequence

Archivio istituzionale della ricerca - Università di Bari

Assembly of 43 human Y chromosomes reveals extensive complexity and variation.

Author: Audano Peter A
Beck Christine R
Bonder Marc Jan
Dishuck Philip C
Ebert Peter
Eichler Evan E
Hallast Pille
Harvey William T
Hasenfeld Patrick
Hoekzema Kendra
Hoyt Savannah J
Human Genome Structural Variation Consortium (HGSVC)
Höps Wolfram
Kim Kwondo
Konkel Miriam K
Korbel Jan O
Kordosky Jennifer
Kwon Jee Young
Lee Charles
Lewis Alexandra P
Li Chong
Loftus Mark
Logsdon Glennis A
Marschall Tobias
Munson Katherine M
O\u27Neill Rachel J
Porubsky David
Shi Xinghua
Tsetsos Fotios
Tyler-Smith Chris
Yilmaz Feyza
Zhou Weichen
Zhu Qihui
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/09/2023
Field of study

The prevalence of highly repetitive sequences within the human Y chromosome has prevented its complete assembly to date1 and led to its systematic omission from genomic analyses. Here we present de novo assemblies of 43 Y chromosomes spanning 182,900 years of human evolution and report considerable diversity in size and structure. Half of the male-specific euchromatic region is subject to large inversions with a greater than twofold higher recurrence rate compared with all other chromosomes2. Ampliconic sequences associated with these inversions show differing mutation rates that are sequence context dependent, and some ampliconic genes exhibit evidence for concerted evolution with the acquisition and purging of lineage-specific pseudogenes. The largest heterochromatic region in the human genome, Yq12, is composed of alternating repeat arrays that show extensive variation in the number, size and distribution, but retain a 1:1 copy-number ratio. Finally, our data suggest that the boundary between the recombining pseudoautosomal region 1 and the non-recombining portions of the X and Y chromosomes lies 500 kb away from the currently established1 boundary. The availability of fully sequence-resolved Y chromosomes from multiple individuals provides a unique opportunity for identifying new associations of traits with specific Y-chromosomal variants and garnering insights into the evolution and function of complex regions of the human genome

The Jackson Laboratory: The Mouseion at the JAXlibrary

ARTS repository - University of Groningen

Dissertations of the University of Groningen

A high-quality bonobo genome refines the analysis of hominid evolution.

Author: Antonacci Francesca
Audano Peter A
Baker Carl
Batzer Mark A
Catacchio Claudia R
Diekhans Mark
Dishuck Philip C
Eichler Evan E
Fernandes Jason D
Fiddes Ian T
Gordon David S
Harvey William T
Hastie Alex R
Haukness Marina
Hillier LaDeana W
Hoekzema Kendra
Hoffman Jinna
Hsieh PingHsun
Huang Tzu-Hsueh
Lee Joyce
Lewis Alexandra P
Li Ruiyang
Mao Yafei
Mercuri Ludovica
Montinaro Francesco
Munson Katherine M
Murali Shwetha Canchi
Pang Andy W C
Paten Benedict
Piccolo Ilaria
Porubsky David
Salama Sofie R
Sorensen Melanie
Storer Jessica M
Sulovari Arvis
Thibaud-Nissen Françoise
Underwood Jason G
Ventura Mario
Walker Jerilyn A
Publication venue: Providence St. Joseph Health Digital Commons
Publication date: 01/01/2021
Field of study

The divergence of chimpanzee and bonobo provides one of the few examples of recent hominid speciation1,2. Here we describe a fully annotated, high-quality bonobo genome assembly, which was constructed without guidance from reference genomes by applying a multiplatform genomics approach. We generate a bonobo genome assembly in which more than 98% of genes are completely annotated and 99% of the gaps are closed, including the resolution of about half of the segmental duplications and almost all of the full-length mobile elements. We compare the bonobo genome to those of other great apes1,3-5 and identify more than 5,569 fixed structural variants that specifically distinguish the bonobo and chimpanzee lineages. We focus on genes that have been lost, changed in structure or expanded in the last few million years of bonobo evolution. We produce a high-resolution map of incomplete lineage sorting and estimate that around 5.1% of the human genome is genetically closer to chimpanzee or bonobo and that more than 36.5% of the genome shows incomplete lineage sorting if we consider a deeper phylogeny including gorilla and orangutan. We also show that 26% of the segments of incomplete lineage sorting between human and chimpanzee or human and bonobo are non-randomly distributed and that genes within these clustered segments show significant excess of amino acid replacement compared to the rest of the genome

PubMed Central

Archivio istituzionale della ricerca - Università di Bari

Providence St. Joseph Health Digital Commons

A high-quality bonobo genome refines the analysis of hominid evolution

Author: Antonacci Francesca
Audano Peter A
Baker Carl
Batzer Mark A
Catacchio Claudia R
Diekhans Mark
Dishuck Philip C
Eichler Evan E
Fernandes Jason D
Fiddes Ian T
Gordon David S
Harvey William T
Hastie Alex R
Haukness Marina
Hillier LaDeana W
Hoekzema Kendra
Hoffman Jinna
Hsieh PingHsun
Huang Tzu-Hsueh
Lee Joyce
Lewis Alexandra P
Li Ruiyang
Mao Yafei
Mercuri Ludovica
Montinaro Francesco
Munson Katherine M
Murali Shwetha Canchi
Pang Andy W C
Paten Benedict
Piccolo Ilaria
Porubsky David
Salama Sofie R
Sorensen Melanie
Storer Jessica M
Sulovari Arvis
Thibaud-Nissen Françoise
Underwood Jason G
Ventura Mario
Walker Jerilyn A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

The divergence of chimpanzee and bonobo provides one of the few examples of recent hominid speciation1,2. Here we describe a fully annotated, high-quality bonobo genome assembly, which was constructed without guidance from reference genomes by applying a multiplatform genomics approach. We generate a bonobo genome assembly in which more than 98% of genes are completely annotated and 99% of the gaps are closed, including the resolution of about half of the segmental duplications and almost all of the full-length mobile elements. We compare the bonobo genome to those of other great apes1,3-5 and identify more than 5,569 fixed structural variants that specifically distinguish the bonobo and chimpanzee lineages. We focus on genes that have been lost, changed in structure or expanded in the last few million years of bonobo evolution. We produce a high-resolution map of incomplete lineage sorting and estimate that around 5.1% of the human genome is genetically closer to chimpanzee or bonobo and that more than 36.5% of the genome shows incomplete lineage sorting if we consider a deeper phylogeny including gorilla and orangutan. We also show that 26% of the segments of incomplete lineage sorting between human and chimpanzee or human and bonobo are non-randomly distributed and that genes within these clustered segments show significant excess of amino acid replacement compared to the rest of the genome

Crossref

PubMed Central

Archivio istituzionale della ricerca - Università di Bari

Providence St. Joseph Health Digital Commons

Long-read sequence and assembly of segmental duplications

Author: A Artyomenko
A Auton
AJ Sharp
AnneMarie E. Welch
BS Emanuel
C Alkan
C Alkan
C Jain
CS Chin
D Aguiar
D Gordon
DM Bickhart
DM Church
DR Kelley
ES Lander
Evan E. Eichler
EW Myers
H Li
IT Fiddes
J Chen
J Huddleston
JA Bailey
JD Parsons
JS Seo
KM Steinberg
L Shi
LM Abegglen
M Florio
M Jain
M Patterson
M Pop
Mark J. P. Chaisson
Max L. Dougherty
Melanie Sorensen
Mitchell R. Vollger
MJ Chaisson
MJ Chaisson
MJP Chaisson
ML Dougherty
MY Dennis
MY Dennis
MY Dennis
N Ailon
P Bonizzoni
P Stankiewicz
PA Pevzner
PA Pevzner
PH Sudmant
PH Sudmant
PH Sudmant
Philip C. Dishuck
Richard K. Wilson
S Das
S Koren
Tina A. Graves-Lindsay
Vy Dang
X Nuttle
X Nuttle
Z Puljiz
ZN Kronenberg
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Assembly of 43 human Y chromosomes reveals extensive complexity and variation

Author: Audano Peter A.
Beck Christine R.
Bonder Marc Jan
Clemson University
Dishuck Philip C.
Ebert Peter
Eichler Evan E.
Faculteit Medische Wetenschappen/UMCG
Genome Biology Unit
German Cancer Research Center
Hallast Pille
Harvey William T.
Hasenfeld Patrick
Hoekzema Kendra
Hoyt Savannah J.
Höps Wolfram
Jackson Laboratory
Kim Kwondo
Konkel Miriam K.
Korbel Jan O.
Kordosky Jennifer
Kwon Jee Young
Lee Charles
Lewis Alexandra P.
Li Chong
Loftus Mark
Logsdon Glennis A.
Marschall Tobias
Munson Katherine M.
O’Neill Rachel J.
Porubsky David
Shi Xinghua
Temple University
The University of Connecticut Health Center
Tsetsos Fotios
Tyler-Smith Chris
University of Connecticut
University of Duesseldorf
University of Michigan Medical School
University of Washington School of Medicine
University of Washington Seattle
Wellcome Sanger Institute
Yilmaz Feyza
Zhou Weichen
Zhu Qihui
Publication venue: figshare
Publication date: 01/01/2023
Field of study

The uploaded file contains Supplementary Tables S1-S61 for the manuscript "Assembly of 43 human Y chromosomes reveals extensive complexity and variation

Dissertations of the University of Groningen

Assembly of 43 human Y chromosomes reveals extensive complexity and variation

Author: Audano Peter A.
Beck Christine R.
Bonder Marc Jan
Clemson University
Dishuck Philip C.
Ebert Peter
Eichler Evan E.
Genome Biology Unit
German Cancer Research Center
Hallast Pille
Harvey William T.
Hasenfeld Patrick
Hoekzema Kendra
Howard Hughes Medical Institute
Hoyt Savannah J.
Höps Wolfram
Jackson Laboratory
Kim Kwondo
Konkel Miriam K.
Korbel Jan O.
Kordosky Jennifer
Kwon Jee Young
Lee Charles
Lewis Alexandra P.
Li Chong
Loftus Mark
Logsdon Glennis A.
Marschall Tobias
Munson Katherine M.
O’Neill Rachel J.
Porubsky David
Shi Xinghua
Temple University
The University of Connecticut Health Center
Tsetsos Fotios
Tyler-Smith Chris
University of Connecticut
University of Duesseldorf
University of Groningen
University of Michigan Medical School
University of Washington School of Medicine
Wellcome Sanger Institute
Yilmaz Feyza
Zhou Weichen
Zhu Qihui
Publication venue: figshare
Publication date: 01/01/2023
Field of study

The uploaded file contains Supplementary Tables S1-S61 for the manuscript "Assembly of 43 human Y chromosomes reveals extensive complexity and variation

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen