684 research outputs found
Coexistence of different base periodicities in prokaryotic genomes as related to DNA curvature, supercoiling, and transcription
We analyzed the periodic patterns in E. coli promoters and compared the
distributions of the corresponding patterns in promoters and in the complete
genome to elucidate their function. Except the three-base periodicity,
coincident with that in the coding regions and growing stronger in the region
downstream from the transcriptions start (TS), all other salient periodicities
are peaked upstream of TS. We found that helical periodicities with the lengths
about B-helix pitch ~10.2-10.5 bp and A-helix pitch ~10.8-11.1 bp coexist in
the genomic sequences. We mapped the distributions of stretches with A-, B-,
and Z- like DNA periodicities onto E.coli genome. All three periodicities tend
to concentrate within non-coding regions when their intensity becomes stronger
and prevail in the promoter sequences. The comparison with available
experimental data indicates that promoters with the most pronounced
periodicities may be related to the supercoiling-sensitive genes.Comment: 23 pages, 6 figures, 2 table
Structural attributes of nucleotide sequences in promoter regions of supercoiling-sensitive genes: how to relate microarray expression data with genomic sequences
The level of supercoiling in the chromosome can affect gene expression. To
clarify the basis of supercoiling sensitivity, we analyzed the structural
features of nucleotide sequences in the vicinity of promoters for the genes
with expression enhanced and decreased in response to loss of chromosomal
supercoiling in E. coli. Fourier analysis of promoter sequences for
supercoiling-sensitive genes reveals the tendency in selection of sequences
with helical periodicities close to 10 nt for relaxation-induced genes and to
11 nt for relaxation-repressed genes. The helical periodicities in the subsets
of promoters recognized by RNA polymerase with different sigma factors were
also studied. A special procedure was developed for study of correlations
between the intensities of periodicities in promoter sequences and the
expression levels of corresponding genes. Significant correlations of
expression with the AT content and with AT periodicities about 10, 11, and 50
nt indicate their role in regulation of supercoiling-sensitive genes.Comment: 38 pages, 12 figure
Oligonucleotide Sequence Motifs as Nucleosome Positioning Signals
To gain a better understanding of the sequence patterns that characterize positioned nucleosomes, we first performed an analysis of the periodicities of the 256 tetranucleotides in a yeast genome-wide library of nucleosomal DNA sequences that was prepared by in vitro reconstitution. The approach entailed the identification and analysis of 24 unique tetranucleotides that were defined by 8 consensus sequences. These consensus sequences were shown to be responsible for most if not all of the tetranucleotide and dinucleotide periodicities displayed by the entire library, demonstrating that the periodicities of dinucleotides that characterize the yeast genome are, in actuality, due primarily to the 8 consensus sequences. A novel combination of experimental and bioinformatic approaches was then used to show that these tetranucleotides are important for preferred formation of nucleosomes at specific sites along DNA in vitro. These results were then compared to tetranucleotide patterns in genome-wide in vivo libraries from yeast and C. elegans in order to assess the contributions of DNA sequence in the control of nucleosome residency in the cell. These comparisons revealed striking similarities in the tetranucleotide occurrence profiles that are likely to be involved in nucleosome positioning in both in vitro and in vivo libraries, suggesting that DNA sequence is an important factor in the control of nucleosome placement in vivo. However, the strengths of the tetranucleotide periodicities were 3–4 fold higher in the in vitro as compared to the in vivo libraries, which implies that DNA sequence plays less of a role in dictating nucleosome positions in vivo. The results of this study have important implications for models of sequence-dependent positioning since they suggest that a defined subset of tetranucleotides is involved in preferred nucleosome occupancy and that these tetranucleotides are the major source of the dinucleotide periodicities that are characteristic of positioned nucleosomes
An Unusual 500,000 Bases Long Oscillation of Guanine and Cytosine Content in Human Chromosome 21
An oscillation with a period of around 500 kb in guanine and cytosine content
(GC%) is observed in the DNA sequence of human chromosome 21. This oscillation
is localized in the rightmost one-eighth region of the chromosome, from 43.5 Mb
to 46.5 Mb. Five cycles of oscillation are observed in this region with six
GC-rich peaks and five GC-poor valleys. The GC-poor valleys comprise regions
with low density of CpG islands and, alternating between the two DNA strands,
low gene density regions. Consequently, the long-range oscillation of GC%
result in spacing patterns of both CpG island density, and to a lesser extent,
gene densities.Comment: 15 pages (figures included), 5 figure
Nucleosome DNA sequence structure of isochores
<p>Abstract</p> <p>Background</p> <p>Significant differences in G+C content between different isochore types suggest that the nucleosome positioning patterns in DNA of the isochores should be different as well.</p> <p>Results</p> <p>Extraction of the patterns from the isochore DNA sequences by Shannon N-gram extension reveals that while the general motif YRRRRRYYYYYR is characteristic for all isochore types, the dominant positioning patterns of the isochores vary between TAAAAATTTTTA and CGGGGGCCCCCG due to the large differences in G+C composition. This is observed in human, mouse and chicken isochores, demonstrating that the variations of the positioning patterns are largely G+C dependent rather than species-specific. The species-specificity of nucleosome positioning patterns is revealed by dinucleotide periodicity analyses in isochore sequences. While human sequences are showing CG periodicity, chicken isochores display AG (CT) periodicity. Mouse isochores show very weak CG periodicity only.</p> <p>Conclusions</p> <p>Nucleosome positioning pattern as revealed by Shannon N-gram extension is strongly dependent on G+C content and different in different isochores. Species-specificity of the pattern is subtle. It is reflected in the choice of preferentially periodical dinucleotides.</p
Sequence-dependent histone variant positioning signatures
Background: Nucleosome, the fundamental unit of chromatin, is formed by wrapping nearly 147bp of DNA around an octamer of histone proteins. This histone core has many variants that are different from each other by their biochemical compositions as well as biological functions. Although the deposition of histone variants onto chromatin has been implicated in many important biological processes, such as transcription and replication, themechanisms of how they are deposited on target sites are still obscure. Results: By analyzing genomic sequences of nucleosomes bearing different histone variants from human, including H2A.Z, H3.3 and both (H3.3/H2A.Z, so-called double variant histones), we found that genomic sequencecontributes in part to determining target sites for different histone variants. Moreover, dinucleotides CA/TG are remarkably important in distinguishing target sites of H2A.Z-only nucleosomes with those of H3.3-containing (both H3.3-only and double variant) nucleosomes. Conclusions: There exists a DNA-related mechanism regulating the deposition of different histone variants onto chromatin and biological outcomes thereof. This provides additional insights into epigenetic regulatory mechanisms of many important cellular processes
In the search for the low-complexity sequences in prokaryotic and eukaryotic genomes: how to derive a coherent picture from global and local entropy measures
We investigate on a possible way to connect the presence of Low-Complexity
Sequences (LCS) in DNA genomes and the nonstationary properties of base
correlations. Under the hypothesis that these variations signal a change in the
DNA function, we use a new technique, called Non-Stationarity Entropic Index
(NSEI) method, and we prove that this technique is an efficient way to detect
functional changes with respect to a random baseline. The remarkable aspect is
that NSEI does not imply any training data or fitting parameter, the only
arbitrarity being the choice of a marker in the sequence. We make this choice
on the basis of biological information about LCS distributions in genomes. We
show that there exists a correlation between changing the amount in LCS and the
ratio of long- to short-range correlation
- …