23 research outputs found
Human centromere repositioning activates transcription and opens chromatin fibre structure
Human centromeres appear as constrictions on mitotic chromosomes and form a platform for kinetochore assembly in mitosis. Biophysical experiments led to a suggestion that repetitive DNA at centromeric regions form a compact scaffold necessary for function, but this was revised when neocentromeres were discovered on non-repetitive DNA. To test whether centromeres have a special chromatin structure we have analysed the architecture of a neocentromere. Centromere repositioning is accompanied by RNA polymerase II recruitment and active transcription to form a decompacted, negatively supercoiled domain enriched in ‘open’ chromatin fibres. In contrast, centromerisation causes a spreading of repressive epigenetic marks to surrounding regions, delimited by H3K27me3 polycomb boundaries and divergent genes. This flanking domain is transcriptionally silent and partially remodelled to form ‘compact’ chromatin, similar to satellite-containing DNA sequences, and exhibits genomic instability. We suggest transcription disrupts chromatin to provide a foundation for kinetochore formation whilst compact pericentromeric heterochromatin generates mechanical rigidity
Alu insertion polymorphisms shared by Papio baboons and Theropithecus gelada reveal an intertwined common ancestry
© 2019 The Author(s). Background: Baboons (genus Papio) and geladas (Theropithecus gelada) are now generally recognized as close phylogenetic relatives, though morphologically quite distinct and generally classified in separate genera. Primate specific Alu retrotransposons are well-established genomic markers for the study of phylogenetic and population genetic relationships. We previously reported a computational reconstruction of Papio phylogeny using large-scale whole genome sequence (WGS) analysis of Alu insertion polymorphisms. Recently, high coverage WGS was generated for Theropithecus gelada. The objective of this study was to apply the high-Throughput poly-Detect method to computationally determine the number of Alu insertion polymorphisms shared by T. gelada and Papio, and vice versa, by each individual Papio species and T. gelada. Secondly, we performed locus-specific polymerase chain reaction (PCR) assays on a diverse DNA panel to complement the computational data. Results: We identified 27,700 Alu insertions from T. gelada WGS that were also present among six Papio species, with nearly half (12,956) remaining unfixed among 12 Papio individuals. Similarly, each of the six Papio species had species-indicative Alu insertions that were also present in T. gelada. In general, P. kindae shared more insertion polymorphisms with T. gelada than did any of the other five Papio species. PCR-based genotype data provided additional support for the computational findings. Conclusions: Our discovery that several thousand Alu insertion polymorphisms are shared by T. gelada and Papio baboons suggests a much more permeable reproductive barrier between the two genera then previously suspected. Their intertwined evolution likely involves a long history of admixture, gene flow and incomplete lineage sorting
A high-quality bonobo genome refines the analysis of hominid evolution
The divergence of chimpanzee and bonobo provides one of the few examples of recent hominid speciation1,2. Here we describe a fully annotated, high-quality bonobo genome assembly, which was constructed without guidance from reference genomes by applying a multiplatform genomics approach. We generate a bonobo genome assembly in which more than 98% of genes are completely annotated and 99% of the gaps are closed, including the resolution of about half of the segmental duplications and almost all of the full-length mobile elements. We compare the bonobo genome to those of other great apes1,3,4,5 and identify more than 5,569 fixed structural variants that specifically distinguish the bonobo and chimpanzee lineages. We focus on genes that have been lost, changed in structure or expanded in the last few million years of bonobo evolution. We produce a high-resolution map of incomplete lineage sorting and estimate that around 5.1% of the human genome is genetically closer to chimpanzee or bonobo and that more than 36.5% of the genome shows incomplete lineage sorting if we consider a deeper phylogeny including gorilla and orangutan. We also show that 26% of the segments of incomplete lineage sorting between human and chimpanzee or human and bonobo are non-randomly distributed and that genes within these clustered segments show significant excess of amino acid replacement compared to the rest of the genome
The evolution of African great ape subtelomeric heterochromatin and the fusion of human chromosome 2
Chimpanzee and gorilla chromosomes differ from human chromosomes by the presence of large blocks of subterminal heterochromatin thought to be composed primarily of arrays of tandem satellite sequence. We explore their sequence composition and organization and show a complex organization composed of specific sets of segmental duplications that have hyperexpanded in concert with the formation of subterminal satellites. These regions are highly copy number polymorphic between and within species, and copy number differences involving hundreds of copies can be accurately estimated by assaying read-depth of next-generation sequencing data sets. Phylogenetic and comparative genomic analyses suggest that the structures have arisen largely independently in the two lineages with the exception of a few seed sequences present in the common ancestor of humans and African apes. We propose a model where an ancestral human-chimpanzee pericentric inversion and the ancestral chromosome 2 fusion both predisposed and protected the chimpanzee and human genomes, respectively, to the formation of subtelomeric heterochromatin. Our findings highlight the complex interplay between duplicated sequences and chromosomal rearrangements that rapidly alter the cytogenetic landscape in a short period of evolutionary time.This work was supported, in part, by NIH grants HG002385 and GM058815 to E.E.E. and NIH grant U54 HG003079 to R.K.W. P.H.S. is supported by a Howard Hughes Medical Institute International Student Fellowship. E.E.E. is an investigator of the Howard Hughes Medical Institut
The evolution of African great ape subtelomeric heterochromatin and the fusion of human chromosome 2
Chimpanzee and gorilla chromosomes differ from human chromosomes by the presence of large blocks of subterminal heterochromatin thought to be composed primarily of arrays of tandem satellite sequence. We explore their sequence composition and organization and show a complex organization composed of specific sets of segmental duplications that have hyperexpanded in concert with the formation of subterminal satellites. These regions are highly copy number polymorphic between and within species, and copy number differences involving hundreds of copies can be accurately estimated by assaying read-depth of next-generation sequencing data sets. Phylogenetic and comparative genomic analyses suggest that the structures have arisen largely independently in the two lineages with the exception of a few seed sequences present in the common ancestor of humans and African apes. We propose a model where an ancestral human-chimpanzee pericentric inversion and the ancestral chromosome 2 fusion both predisposed and protected the chimpanzee and human genomes, respectively, to the formation of subtelomeric heterochromatin. Our findings highlight the complex interplay between duplicated sequences and chromosomal rearrangements that rapidly alter the cytogenetic landscape in a short period of evolutionary time.This work was supported, in part, by NIH grants HG002385 and GM058815 to E.E.E. and NIH grant U54 HG003079 to R.K.W. P.H.S. is supported by a Howard Hughes Medical Institute International Student Fellowship. E.E.E. is an investigator of the Howard Hughes Medical Institut
Epigenetic origin of evolutionary novel centromeres
Most evolutionary new centromeres (ENC) are composed of large arrays of satellite DNA and surrounded by segmental duplications. However, the hypothesis is that ENCs are seeded in an anonymous sequence and only over time have acquired the complexity of "normal" centromeres. Up to now evidence to test this hypothesis was lacking. We recently discovered that the well-known polymorphism of orangutan chromosome 12 was due to the presence of an ENC. We sequenced the genome of an orangutan homozygous for the ENC, and we focused our analysis on the comparison of the ENC domain with respect to its wild type counterpart. No significant variations were found. This finding is the first clear evidence that ENC seedings are epigenetic in nature. The compaction of the ENC domain was found significantly higher than the corresponding WT region and, interestingly, the expression of the only gene embedded in the region was significantly repressed
Gorilla genome structural variation reveals evolutionary parallelisms with chimpanzee
Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a western lowland gorilla and integrating these data into a physical and cytogenetic framework of structural variation. We discovered and validated over 7665 structural changes within the gorilla lineage, including sequence resolution of inversions, deletions, duplications, and mobile element insertions. A comparison with human and other ape genomes shows that the gorilla genome has been subjected to the highest rate of segmental duplication. We show that both the gorilla and chimpanzee genomes have experienced independent yet convergent patterns of structural mutation that have not occurred in humans, including the formation of subtelomeric heterochromatic caps, the hyperexpansion of segmental duplications, and bursts of retroviral integrations. Our analysis suggests that the chimpanzee and gorilla genomes are structurally more derived than either orangutan or human genomes
Evolution and diversity of copy number variation in the great ape lineage
Copy number variation (CNV) contributes to disease and has restructured the genomes of great apes. The diversity and rate of this process, however, have not been extensively explored among great ape lineages. We analyzed 97 deeply sequenced great ape and human genomes and estimate 16% (469 Mb) of the hominid genome has been affected by recent CNV. We identify a comprehensive set of fixed gene deletions (n = 340) and duplications (n = 405) as well as >13.5 Mb of sequence that has been specifically lost on the human lineage. We compared the diversity and rates of copy number and single nucleotide variation across the hominid phylogeny. We find that CNV diversity partially correlates with single nucleotide diversity (r2 = 0.5) and recapitulates the phylogeny of apes with few exceptions. Duplications significantly outpace deletions (2.8-fold). The load of segregating duplications remains significantly higher in bonobos, Western chimpanzees, and Sumatran orangutans—populations that have experienced recent genetic bottlenecks (P = 0.0014, 0.02, and 0.0088, respectively). The rate of fixed deletion has been more clocklike with the exception of the chimpanzee lineage, where we observe a twofold increase in the chimpanzee–bonobo ancestor (P = 4.79 × 10−9) and increased deletion load among Western chimpanzees (P = 0.002). The latter includes the first genomic disorder in a chimpanzee with features resembling Smith-Magenis syndrome mediated by a chimpanzee-specific increase in segmental duplication complexity. We hypothesize that demographic effects, such as bottlenecks, have contributed to larger and more gene-rich segments being deleted in the chimpanzee lineage and that this effect, more generally, may account for episodic bursts in CNV during hominid evolution.P.H.S. is supported by a Howard Hughes International Student Fellowship.T.M.B. is supported by an ERC Starting Grant (260372). T.M.B. is an ICREA Research Investigator (Institut Catala d’Estudis i Recerca Avancats de la Generalitat de Catalunya). This work was supported, in part, by U.S. National Institutes of Health (NIH) grant HG002385 to E.E.E., BFU2009-13409-C02-02 to J.P.M., and MICINN (Spain) BFU2011-28549 to T.M.B. E.E.E. is an investigator of the Howard Hughes Medical Institut
Computational detection and experimental validation of segmental duplications and associated copy number variations in water buffalo (Bubalus bubalis)
Duplicated sequences are an important source of gene evolution and structural variation within mammalian genomes. Using a read depth approach based on next-generation sequencing, we performed a genome-wide analysis of segmental duplications (SDs) and associated copy number variations (CNVs) in the water buffalo (Bubalus bubalis). By aligning short reads of Olimpia (the reference water buffalo) to the UMD3.1 cattle genome, we identified 1,038 segmental duplications comprising 44.6 Mb (equivalent to ~1.73% of the cattle genome) of the autosomal and X chromosomal sequence in the buffalo genome. We experimentally validated 70.3% (71/101) of these duplications using fluorescent in situ hybridization. We also detected a total of 1,344 CNV regions across 14 additional water buffaloes, amounting to 59.8 Mb of variable sequence or the equivalent of 2.2% of the cattle genome. The CNV regions overlap 1,245 genes that are significantly enriched for specific biological functions including immune response, oxygen transport, sensory system and signal transduction. Additionally, we performed array Comparative Genomic Hybridization (aCGH) experiments using the 14 water buffaloes as test samples and Olimpia as the reference. Using a linear regression model, a high Pearson correlation (r = 0.781) was observed between the log 2 ratios between copy number estimates and the log 2 ratios of aCGH probes. We further designed Quantitative PCR assays to confirm CNV regions within or near annotated genes and found 74.2% agreement with our CNV predictions. These results confirm sub-chromosome-scale structural rearrangements present in the cattle and water buffalo. The information on genome variation that will be of value for evolutionary and phenotypic studies, and may be useful for selective breeding of both species