52 research outputs found
VEZF1 elements mediate protection from DNA methylation
There is growing consensus that genome organization and long-range gene regulation involves partitioning of the genome into domains of distinct epigenetic chromatin states. Chromatin insulator or barrier elements are key components of these processes as they can establish boundaries between chromatin states. The ability of elements such as the paradigm β-globin HS4 insulator to block the range of enhancers or the spread of repressive histone modifications is well established. Here we have addressed the hypothesis that a barrier element in vertebrates should be capable of defending a gene from silencing by DNA methylation. Using an established stable reporter gene system, we find that HS4 acts specifically to protect a gene promoter from de novo DNA methylation. Notably, protection from methylation can occur in the absence of histone acetylation or transcription. There is a division of labor at HS4; the sequences that mediate protection from methylation are separable from those that mediate CTCF-dependent enhancer blocking and USF-dependent histone modification recruitment. The zinc finger protein VEZF1 was purified as the factor that specifically interacts with the methylation protection elements. VEZF1 is a candidate CpG island protection factor as the G-rich sequences bound by VEZF1 are frequently found at CpG island promoters. Indeed, we show that VEZF1 elements are sufficient to mediate demethylation and protection of the APRT CpG island promoter from DNA methylation. We propose that many barrier elements in vertebrates will prevent DNA methylation in addition to blocking the propagation of repressive histone modifications, as either process is sufficient to direct the establishment of an epigenetically stable silent chromatin stat
Variant Histone H2A.Z Is Globally Localized to the Promoters of Inactive Yeast Genes and Regulates Nucleosome Positioning
H2A.Z is an evolutionary conserved histone variant involved in transcriptional regulation, antisilencing, silencing, and genome stability. The mechanism(s) by which H2A.Z regulates these various biological functions remains poorly defined, in part due to the lack of knowledge regarding its physical location along chromosomes and the bearing it has in regulating chromatin structure. Here we mapped H2A.Z across the yeast genome at an approximately 300-bp resolution, using chromatin immunoprecipitation combined with tiling microarrays. We have identified 4,862 small regions—typically one or two nucleosomes wide—decorated with H2A.Z. Those “Z loci” are predominantly found within specific nucleosomes in the promoter of inactive genes all across the genome. Furthermore, we have shown that H2A.Z can regulate nucleosome positioning at the GAL1 promoter. Within HZAD domains, the regions where H2A.Z shows an antisilencing function, H2A.Z is localized in a wider pattern, suggesting that the variant histone regulates a silencing and transcriptional activation via different mechanisms. Our data suggest that the incorporation of H2A.Z into specific promoter-bound nucleosomes configures chromatin structure to poise genes for transcriptional activation. The relevance of these findings to higher eukaryotes is discussed
Recommended from our members
A draft human pangenome reference
Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample
Recommended from our members
Gaps and complex structurally variant loci in phased genome assemblies
There has been tremendous progress in phased genome assembly production by combining long-read data with parental information or linked-read data. Nevertheless, a typical phased genome assembly generated by trio-hifiasm still generates more than 140 gaps. We perform a detailed analysis of gaps, assembly breaks, and misorientations from 182 haploid assemblies obtained from a diversity panel of 77 unique human samples. Although trio-based approaches using HiFi are the current gold standard, chromosome-wide phasing accuracy is comparable when using Strand-seq instead of parental data. Importantly, the majority of assembly gaps cluster near the largest and most identical repeats (including segmental duplications [35.4%], satellite DNA [22.3%], or regions enriched in GA/AT-rich DNA [27.4%]). Consequently, 1513 protein-coding genes overlap assembly gaps in at least one haplotype, and 231 are recurrently disrupted or missing from five or more haplotypes. Furthermore, we estimate that 6-7 Mbp of DNA are misorientated per haplotype irrespective of whether trio-free or trio-based approaches are used. Of these misorientations, 81% correspond to bona fide large inversion polymorphisms in the human species, most of which are flanked by large segmental duplications. We also identify large-scale alignment discontinuities consistent with 11.9 Mbp of deletions and 161.4 Mbp of insertions per haploid genome. Although 99% of this variation corresponds to satellite DNA, we identify 230 regions of euchromatic DNA with frequent expansions and contractions, nearly half of which overlap with 197 protein-coding genes. Such variable and incompletely assembled regions are important targets for future algorithmic development and pangenome representation
Finishing the euchromatic sequence of the human genome
The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
Pan-conserved segment tags identify ultra-conserved sequences across assemblies in the human pangenome
The human pangenome, a new reference sequence, addresses many limitations of the current GRCh38 reference. The first release is based on 94 high-quality haploid assemblies from individuals with diverse backgrounds. We employed a k-mer indexing strategy for comparative analysis across multiple assemblies, including the pangenome reference, GRCh38, and CHM13, a telomere-to-telomere reference assembly. Our k-mer indexing approach enabled us to identify a valuable collection of universally conserved sequences across all assemblies, referred to as “pan-conserved segment tags” (PSTs). By examining intervals between these segments, we discerned highly conserved genomic segments and those with structurally related polymorphisms. We found 60,764 polymorphic intervals with unique geo-ethnic features in the pangenome reference. In this study, we utilized ultra-conserved sequences (PSTs) to forge a link between human pangenome assemblies and reference genomes. This methodology enables the examination of any sequence of interest within the pangenome, using the reference genome as a comparative framework
The Protein CTCF Is Required for the Enhancer Blocking Activity of Vertebrate Insulators
AbstractAn insulator is a DNA sequence that can act as a barrier to the influences of neighboring cis-acting elements, preventing gene activation, for example, when located between an enhancer and a promoter. We have identified a 42 bp fragment of the chicken β-globin insulator that is both necessary and sufficient for enhancer blocking activity in human cells. We show that this sequence is the binding site for CTCF, a previously identified eleven–zinc finger DNA-binding protein that is highly conserved in vertebrates. CTCF sites are present in all of the vertebrate enhancer-blocking elements we have examined. We suggest that directional enhancer blocking by CTCF is a conserved component of gene regulation in vertebrates
Positional enhancer-blocking activity of the chicken β-globin insulator in transiently transfected cells
It is thought that insulators demarcate transcriptionally and structurally independent chromatin domains. Insulators are detected by their ability to block enhancer–promoter interactions in a directional manner, and protect a transgene from position effects. Most studies are performed in stably transformed cells or organisms. Here we analyze the enhancer-blocking activity of the chicken β-globin insulator in transient transfection experiments in both erythroid and nonerythroid cell lines. We show that four tandem copies of a 90-bp fragment of this insulator were able to block an enhancer in these experiments. In circular plasmids, placement on either side of the enhancer reduced activity, but when the plasmid was linearized, the enhancer-blocking activity was observed only when the insulator was placed between the promoter and the enhancer. These observations are consistent with the position-dependent enhancer-blocking activity of the insulator observed in stable transformation experiments.</jats:p
- …
