92 research outputs found
High-throughput sequencing reveals a simple model of nucleosome energetics
We use nucleosome maps obtained by high-throughput sequencing to study
sequence specificity of intrinsic histone-DNA interactions. In contrast with
previous approaches, we employ an analogy between a classical one-dimensional
fluid of finite-size particles in an arbitrary external potential and arrays of
DNA-bound histone octamers. We derive an analytical solution to infer free
energies of nucleosome formation directly from nucleosome occupancies measured
in high-throughput experiments. The sequence-specific part of free energies is
then captured by fitting them to a sum of energies assigned to individual
nucleotide motifs. We have developed hierarchical models of increasing
complexity and spatial resolution, establishing that nucleosome occupancies can
be explained by systematic differences in mono- and dinucleotide content
between nucleosomal and linker DNA sequences, with periodic dinucleotide
distributions and longer sequence motifs playing a secondary role. Furthermore,
similar sequence signatures are exhibited by control experiments in which
genomic DNA is either sonicated or digested with micrococcal nuclease in the
absence of nucleosomes, making it possible that current predictions based on
high-throughput nucleosome positioning maps are biased by experimental
artifacts.Comment: 36 pages, 13 figure
Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-seq and ESTs
The reference annotations made for a genome sequence provide the framework
for all subsequent analyses of the genome. Correct annotation is particularly
important when interpreting the results of RNA-seq experiments where short
sequence reads are mapped against the genome and assigned to genes according to
the annotation. Inconsistencies in annotations between the reference and the
experimental system can lead to incorrect interpretation of the effect on RNA
expression of an experimental treatment or mutation in the system under study.
Until recently, the genome-wide annotation of 3-prime untranslated regions
received less attention than coding regions and the delineation of intron/exon
boundaries. In this paper, data produced for samples in Human, Chicken and A.
thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing
technology from Helicos Biosciences which locates 3-prime polyadenylation sites
to within +/- 2 nt, were combined with archival EST and RNA-Seq data. Nine
examples are illustrated where this combination of data allowed: (1) gene and
3-prime UTR re-annotation (including extension of one 3-prime UTR by 5.9 kb);
(2) disentangling of gene expression in complex regions; (3) clearer
interpretation of small RNA expression and (4) identification of novel genes.
While the specific examples displayed here may become obsolete as genome
sequences and their annotations are refined, the principles laid out in this
paper will be of general use both to those annotating genomes and those seeking
to interpret existing publically available annotations in the context of their
own experimental dataComment: 44 pages, 9 figure
Nucleolar Association and Transcriptional Inhibition through 5S rDNA in Mammals
Changes in the spatial positioning of genes within the mammalian nucleus have been associated with transcriptional differences and thus have been hypothesized as a mode of regulation. In particular, the localization of genes to the nuclear and nucleolar peripheries is associated with transcriptional repression. However, the mechanistic basis, including the pertinent cis- elements, for such associations remains largely unknown. Here, we provide evidence that demonstrates a 119 bp 5S rDNA can influence nucleolar association in mammals. We found that integration of transgenes with 5S rDNA significantly increases the association of the host region with the nucleolus, and their degree of association correlates strongly with repression of a linked reporter gene. We further show that this mechanism may be functional in endogenous contexts: pseudogenes derived from 5S rDNA show biased conservation of their internal transcription factor binding sites and, in some cases, are frequently associated with the nucleolus. These results demonstrate that 5S rDNA sequence can significantly contribute to the positioning of a locus and suggest a novel, endogenous mechanism for nuclear organization in mammals
Silent chromatin at the middle and ends: lessons from yeasts
Eukaryotic centromeres and telomeres are specialized chromosomal regions that share one common characteristic: their underlying DNA sequences are assembled into heritably repressed chromatin. Silent chromatin in budding and fission yeast is composed of fundamentally divergent proteins tat assemble very different chromatin structures. However, the ultimate behaviour of silent chromatin and the pathways that assemble it seem strikingly similar among Saccharomyces cerevisiae (S. cerevisiae), Schizosaccharomyces pombe (S. pombe) and other eukaryotes. Thus, studies in both yeasts have been instrumental in dissecting the mechanisms that establish and maintain silent chromatin in eukaryotes, contributing substantially to our understanding of epigenetic processes. In this review, we discuss current models for the generation of heterochromatic domains at centromeres and telomeres in the two yeast species
Structural basis of RNA polymerase III transcription initiation.
RNA polymerase (Pol) III transcribes essential non-coding RNAs, including the entire pool of transfer RNAs, the 5S ribosomal RNA and the U6 spliceosomal RNA, and is often deregulated in cancer cells. The initiation of gene transcription by Pol III requires the activity of the transcription factor TFIIIB to form a transcriptionally active Pol III preinitiation complex (PIC). Here we present electron microscopy reconstructions of Pol III PICs at 3.4-4.0 Å and a reconstruction of unbound apo-Pol III at 3.1 Å. TFIIIB fully encircles the DNA and restructures Pol III. In particular, binding of the TFIIIB subunit Bdp1 rearranges the Pol III-specific subunits C37 and C34, thereby promoting DNA opening. The unwound DNA directly contacts both sides of the Pol III cleft. Topologically, the Pol III PIC resembles the Pol II PIC, whereas the Pol I PIC is more divergent. The structures presented unravel the molecular mechanisms underlying the first steps of Pol III transcription and also the general conserved mechanisms of gene transcription initiation
A user's guide to the Encyclopedia of DNA elements (ENCODE)
The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome
- …