81 research outputs found

    The complex pattern of epigenomic variation between natural yeast strains at single-nucleosome resolution

    Get PDF
    International audienceBackground: Epigenomic studies on humans and model species have revealed substantial inter‑individual variation in histone modification profiles. However, the pattern of this variation has not been precisely characterized, particularly regarding which genomic features are enriched for variability and whether distinct histone marks co‑vary synergistically. Yeast allows us to investigate intra‑species variation at high resolution while avoiding other sources of variation, such as cell type or subtype. Results: We profiled histone marks H3K4me3, H3K9ac, H3K14ac, H4K12ac and H3K4me1 in three unrelated wild strains of Saccharomyces cerevisiae at single‑nucleosome resolution and analyzed inter‑strain differences statistically. All five marks varied significantly at specific loci, but to different extents. The number of nucleosomes varying for a given mark between two strains ranged from 20 to several thousands; +1 nucleosomes were significantly less subject to variation. Genes with highly evolvable or responsive expression showed higher variability; however, the variation pattern could not be explained by known transcriptional differences between the strains. Synergistic variation of distinct marks was not systematic, with surprising differences between functionally related H3K9ac and H3K14ac. Interestingly, H3K14ac differences that persisted through transient hyperacetylation were supported by H3K4me3 differences, suggesting stabilization via cross talk. Conclusions: Quantitative variation of histone marks among S. cerevisiae strains is abundant and complex. Its relation to functional characteristics is modular and seems modest, with partial association with gene expression divergences, differences between functionally related marks and partial co‑variation between marks that may confer stability. Thus, the specific context of studies, such as which precise marks, individuals and genomic loci are investigated, is primor‑ dial in population epigenomics studies. The complexity found in this pilot survey in yeast suggests that high complexity can be anticipated among higher eukaryotes, including humans

    Evidence That Replication-Associated Mutation Alone Does Not Explain Between-Chromosome Differences In Substitution Rates

    Get PDF
    Since Haldane first noticed an excess of paternally derived mutations, it has been considered that most mutations derive from errors during germ line replication. Miyata et al. (1987) proposed that differences in the rate of neutral evolution on X, Y, and autosome can be employed to measure the extent of this male bias. This commonly applied method assumes replication to be the sole source of between-chromosome variation in substitution rates. We propose a simple test of this assumption: If true, estimates of the male bias should be independent of which two chromosomal classes are compared. Prior evidence from rodents suggested that this might not be true, but conclusions were limited by a lack of rat Y-linked sequence. We therefore sequenced two rat Y-linked bacterial artificial chromosomes and determined evolutionary rate by comparison with mouse. For estimation of rates we consider both introns and synonymous rates. Surprisingly, for both data sets the prediction of congruent estimates of α is strongly rejected. Indeed, some comparisons suggest a female bias with autosomes evolving faster than Y-linked sequence. We conclude that the method of Miyata et al. (1987) has the potential to provide incorrect estimates. Correcting the method requires understanding of the other causes of substitution that might differ between chromosomal classes. One possible cause is recombination-associated substitution bias for which we find some evidence. We note that if, as some suggest, this association is dominantly owing to male recombination, the high estimates of α seen in birds is to be expected as Z chromosomes recombine in males

    Chromatin structure and evolution in the human genome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Evolutionary rates are not constant across the human genome but genes in close proximity have been shown to experience similar levels of divergence and selection. The higher-order organisation of chromosomes has often been invoked to explain such phenomena but previously there has been insufficient data on chromosome structure to investigate this rigorously. Using the results of a recent genome-wide analysis of open and closed human chromatin structures we have investigated the global association between divergence, selection and chromatin structure for the first time.</p> <p>Results</p> <p>In this study we have shown that, paradoxically, synonymous site divergence (dS) at non-CpG sites is highest in regions of open chromatin, primarily as a result of an increased number of transitions, while the rates of other traditional measures of mutation (intergenic, intronic and ancient repeat divergence as well as SNP density) are highest in closed regions of the genome. Analysis of human-chimpanzee divergence across intron-exon boundaries indicates that although genes in relatively open chromatin generally display little selection at their synonymous sites, those in closed regions show markedly lower divergence at their fourfold degenerate sites than in neighbouring introns and intergenic regions. Exclusion of known Exonic Splice Enhancer hexamers has little affect on the divergence observed at fourfold degenerate sites across chromatin categories; however, we show that closed chromatin is enriched with certain classes of ncRNA genes whose RNA secondary structure may be particularly important.</p> <p>Conclusion</p> <p>We conclude that, overall, non-CpG mutation rates are lowest in open regions of the genome and that regions of the genome with a closed chromatin structure have the highest background mutation rate. This might reflect lower rates of DNA damage or enhanced DNA repair processes in regions of open chromatin. Our results also indicate that dS is a poor measure of mutation rates, particularly when used in closed regions of the genome, as genes in closed regions generally display relatively strong levels of selection at their synonymous sites.</p

    Coordinated evolution of co-expressed gene clusters in the Drosophila transcriptome

    Get PDF
    Abstract Background Co-expression of genes that physically cluster together is a common characteristic of eukaryotic transcriptomes. This organization of transcriptomes suggests that coordinated evolution of gene expression for clustered genes may also be common. Clusters where expression evolution of each gene is not independent of their neighbors are important units for understanding transcriptome evolution. Results We used a common microarray platform to measure gene expression in seven closely related species in the Drosophila melanogaster subgroup, accounting for confounding effects of sequence divergence. To summarize the correlation structure among genes in a chromosomal region, we analyzed the fraction of variation along the first principal component of the correlation matrix. We analyzed the correlation for blocks of consecutive genes to assess patterns of correlation that may be manifest at different scales of coordinated expression. We find that expression of physically clustered genes does evolve in a coordinated manner in many locations throughout the genome. Our analysis shows that relatively few of these clusters are near heterochromatin regions and that these clusters tend to be over-dispersed relative to the rest of the genome. This suggests that these clusters are not the byproduct of local gene clustering. We also analyzed the pattern of co-expression among neighboring genes within a single Drosophila species: D. simulans. For the co-expression clusters identified within this species, we find an under-representation of genes displaying a signature of recurrent adaptive amino acid evolution consistent with previous findings. However, clusters displaying co-evolution of expression among species are enriched for adaptively evolving genes. This finding points to a tie between adaptive sequence evolution and evolution of the transcriptome. Conclusion Our results demonstrate that co-evolution of expression in gene clusters is relatively common among species in the D. melanogaster subgroup. We consider the possibility that local regulation of expression in gene clusters may drive the connection between adaptive sequence and coordinated gene expression evolution

    Gender Specific Impact of Replication and Recombination on Rodent Intron Evolution

    Get PDF
    Mutation rate variability has been widely observed across mammalian genomes but the underlying causes are not yet fully understood. This thesis attempts to explain this variation, as assayed by the substitution rate of putatively neutral sites, across rodent genomes at three scales: genic, inter-autosomal and between chromosome types. It was shown that the method commonly employed to estimate the extent of male-bias in the mutation rate is flawed, suggesting that inter-chromosomal variation in mutation rates is not solely due to differences in the number of replications they undergo in each germ-line. Two novel models were proposed that incorporated an additional recombination-associated parameter to explain why, contrary to the theory of male-driven evolution, the autosomes evolve faster than the Y-chromosome. As number of replications could not fully account for mutational variability at any scale, the impact of the time during S-phase when replication occurs was explored. Differential timing of replication was shown to explain both inter-genic and some inter-autosomal variation in intronic substitution rates, with later replicating sequences evolving faster. However, controlling for different replication times failed to account for why number of replications could not explain differences in chromosomal divergence. Further, GC rich sequences were found to evolve slowly because they tend to replicate early. Finally, late replicating genes were found to have high recombination rates in females but low recombination rates in males. These previously unidentified relationships could explain why, owing to sex-specific covariance with replication timing, the strength of covariance between recombination rate and divergence was underestimated in males and overestimated in females. It might also explain why female recombination rates, unlike those in males, do not covary with GC content.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Patchy Interspecific Sequence Similarities Efficiently Identify Positive cis-Regulatory Elements in the Sea Urchin

    Get PDF
    We demonstrate that interspecific sequence conservation can provide a systematic guide to the identification of functional cis-regulatory elements within a large expanse of genomic DNA. The test was carried out on the otx gene of Strongylocentrotus purpuratus. This gene plays a major role in the gene regulatory network that underlies endomesoderm specification in the embryo. The cis-regulatory organization of the otx gene is expected to be complex, because the gene has three different start sites (X. Li, C.-K. Chuang, C.-A. Mao, L. M. Angerer, and W. H. Klein, 1997, Dev. Biol. 187, 253–266), and it is expressed in many different spatial domains of the embryo. BAC recombinants containing the otx gene were isolated from Strongylocentrotus purpuratus and Lytechinus variegatus libraries, and the ordered sequence of these BACs was obtained and annotated. Sixty kilobases of DNA flanking the gene, and included in the BAC sequence from both species, were scanned computationally for short conserved sequence elements. For this purpose, we used a newly constructed software package assembled in our laboratory, “FamilyRelations.” This tool allows detection of sequence similarities above a chosen criterion within sliding windows set at 20–50 bp. Seventeen partially conserved regions, most a few hundred base pairs long, were amplified from the S. purpuratus BAC DNA by PCR, inserted in an expression vector driving a CAT reporter, and tested for cis-regulatory activity by injection into fertilized S. purpuratus eggs. The regulatory activity of these constructs was assessed by whole-mount in situ hybridization (WMISH) using a probe against CAT mRNA. Of the 17 constructs, 11 constructs displayed spatially restricted regulatory activity, and 6 were inactive in this test. The domains within which the cis-regulatory constructs were expressed are approximately consistent with results from a WMISH study on otx expression in the embryo, in which we used probes specific for the mRNAs generated from each of the three transcription start sites. Four separate cis-regulatory elements that specifically produce endomesodermal expression were identified, as well as ubiquitously active elements, and ectoderm-specific elements. We confirm predictions from other work with respect to target sites for specific transcription factors within the elements that express in the endoderm

    Multiple Mechanisms Promote the Retained Expression of Gene Duplicates in the Tetraploid Frog Xenopus laevis

    Get PDF
    Gene duplication provides a window of opportunity for biological variants to persist under the protection of a co-expressed copy with similar or redundant function. Duplication catalyzes innovation (neofunctionalization), subfunction degeneration (subfunctionalization), and genetic buffering (redundancy), and the genetic survival of each paralog is triggered by mechanisms that add, compromise, or do not alter protein function. We tested the applicability of three types of mechanisms for promoting the retained expression of duplicated genes in 290 expressed paralogs of the tetraploid clawed frog, Xenopus laevis. Tests were based on explicit expectations concerning the ka/ks ratio, and the number and location of nonsynonymous substitutions after duplication. Functional constraints on the majority of paralogs are not significantly different from a singleton ortholog. However, we recover strong support that some of them have an asymmetric rate of nonsynonymous substitution: 6% match predictions of the neofunctionalization hypothesis in that (1) each paralog accumulated nonsynonymous substitutions at a significantly different rate and (2) the one that evolves faster has a higher ka/ks ratio than the other paralog and than a singleton ortholog. Fewer paralogs (3%) exhibit a complementary pattern of substitution at the protein level that is predicted by enhancement or degradation of different functional domains, and the remaining 13% have a higher average ka/ks ratio in both paralogs that is consistent with altered functional constraints, diversifying selection, or activity-reducing mutations after duplication. We estimate that these paralogs have been retained since they originated by genome duplication between 21 and 41 million years ago. Multiple mechanisms operate to promote the retained expression of duplicates in the same genome, in genes in the same functional class, over the same period of time following duplication, and sometimes in the same pair of paralogs. None of these paralogs are superfluous; degradation or enhancement of different protein subfunctions and neofunctionalization are plausible hypotheses for the retained expression of some of them. Evolution of most X. laevis paralogs, however, is consistent with retained expression via mechanisms that do not radically alter functional constraints, such as selection to preserve post-duplication stoichiometry or temporal, quantitative, or spatial subfunctionalization

    Cell states and transcriptional programs of the healthy human heart

    Get PDF
    Das Herz ist das zentrale Kreislauforgan in unserem Körper und jede Abweichung seiner Funktion wirkt sich negativ auf die Homöostase des gesamten Körpers aus. Die Herzfunktion beruht auf der Synergie der Zellen, die das Organ bilden. Die detaillierte zelluläre Zusammensetzung sowie die Funktionalität der einzelnen Zellen müssen noch ermittelt werden, und diese Arbeit ist eine wichtige Ergänzung dieser Bemühungen. Dank der jüngsten Entwicklungen in den Einzelzelltechnologien sind wir nun in der Lage, Transkriptome einzelner Zellen aus komplexem Gewebe in beispiellosem Umfang zu charakterisieren. Im ersten Schritt eines solchen Experiments müssen die Zellen und Zellkerne aus dem Gewebe befreit und vereinzelt werden. Herzgewebe wirft in dieser Hinsicht einzigartige Herausforderungen auf, darunter die Knappheit des gesunden menschlichen Herzgewebes für die Forschung, das Vorhandensein von Kardiomyozyten, die aufgrund ihrer Größe nicht durch Microfluid-basierte Standardinstrumente passen und deren Multinukleation, sowie mögliche Voreingenommenheit verschiedener Methoden zur Gewebedissoziation. Hier präsentiere ich den umfassenden Zellatlas des gesunden erwachsenen menschlichen Herzens. Ich beginne mit der Methodenentwicklung zur Isolierung von einzelnen Zellen und Zellkernen aus Mausherzen. Um den Zellatlas des menschlichen Herzens zu erstellen, analysiere ich einen Datensatz von fast einer halben Million Einzelzellen und Zellkerne aus sechs Herzregionen von vierzehn gesunden Menschen. In diesem Atlas definieren wir 11 Hauptzelltypen und 62 Zellzustände des menschlichen Herzens. Ein tieferer Fokus wird auf das Herzgefäßsystem gelegt und die Zellen der arterio-venösen Achse sowie deren Wechselwirkungen und potenzielle Funktionalität werden definiert. Insgesamt präsentiert diese Dissertation einen komplex Datensatz aus menschlichem Herzgewebe und liefert neue Einblicke in die Biologie des gesunden Herzens mit Implikationen für kardiovaskuläre Erkrankungen.The heart is the central circulatory organ in our bodies and any discrepancies of its function relative to healthy homeostasis negatively impact the whole body. Cardiac function relies on the synergy of all the cells that constitute the organ. The detailed cellular composition as well as the heterogeneity and functionality of the individual cells is yet to be established and this work is a major advance in this effort. Thanks to the recent developments in single cell genomics technologies, we are now able to profile transcriptomes from individual cells of complex tissues at unprecedented scale. In the first step of such an experiment, the single cells and nuclei need to be liberated from the tissue. Heart tissue presents a unique set of challenges in this regard, including the scarcity of healthy human cardiac tissue for research, large cardiomyocytes that do not fit into the standard droplet-based instruments, multinucleation of cardiomyocytes that might skew the proportions of the recovered nuclei as well as potential bias of tissue dissociation methods. Here I present a cell atlas of the free walls, apex and septum of the healthy adult human heart. I start with methods development for the isolation of single cells and single nuclei from mouse heart. Next, I move to the building of the atlas of the human cells and nuclei, where I describe the dataset of close to half a million single cells and nuclei sampled from 14 organ donors, defining 11 major cell types and 62 cell states of the heart. A deeper focus on the cardiac vasculature defined the cells of the arterio-venous axis as well as their interactions and potential functionality. Overall, this thesis presents a joined dataset of single cells and single nuclei from human cardiac tissues and provides new insights into cardiac biology in heath with implications for cardiovascular disease

    Exploring the Marine Virosphere: From Genome Context to Content

    Get PDF
    corecore