333 research outputs found
Superhelical Destabilization in Regulatory Regions of Stress Response Genes
Stress-induced DNA duplex destabilization (SIDD) analysis exploits the known structural and energetic properties of DNA to predict sites that are susceptible to strand separation under negative superhelical stress. When this approach was used to calculate the SIDD profile of the entire Escherichia coli K12 genome, it was found that strongly destabilized sites occur preferentially in intergenic regions that are either known or inferred to contain promoters, but rarely occur in coding regions. Here, we investigate whether the genes grouped in different functional categories have characteristic SIDD properties in their upstream flanks. We report that strong SIDD sites in the E. coli K12 genome are statistically significantly overrepresented in the upstream regions of genes encoding transcriptional regulators. In particular, the upstream regions of genes that directly respond to physiological and environmental stimuli are more destabilized than are those regions of genes that are not involved in these responses. Moreover, if a pathway is controlled by a transcriptional regulator whose gene has a destabilized 5′ flank, then the genes (operons) in that pathway also usually contain strongly destabilized SIDD sites in their 5′ flanks. We observe this statistically significant association of SIDD sites with upstream regions of genes functioning in transcription in 38 of 43 genomes of free-living bacteria, but in only four of 18 genomes of endosymbionts or obligate parasitic bacteria. These results suggest that strong SIDD sites 5′ to participating genes may be involved in transcriptional responses to environmental changes, which are known to transiently alter superhelicity. We propose that these SIDD sites are active and necessary participants in superhelically mediated regulatory mechanisms governing changes in the global pattern of gene expression in prokaryotes in response to physiological or environmental changes
SIDDBASE: a database containing the stress-induced DNA duplex destabilization (SIDD) profiles of complete microbial genomes
Prokaryotic genomic DNA is generally negatively supercoiled in vivo. Many regulatory processes, including the initiation of transcription, are known to depend on the superhelical state of the DNA substrate. The stresses induced within DNA by negative superhelicity can destabilize the DNA duplex at specific sites. Various experiments have either shown or suggested that stress-induced DNA duplex destabilization (SIDD) is involved in specific regulatory mechanisms governing a variety of biological processes. We have developed methods to evaluate the SIDD properties of DNA sequences, including complete chromosomes. This analysis predicts the locations where the duplex becomes destabilized under superhelical stress. Previous studies have shown that the SIDD-susceptible sites predicted in this way occur at rates much higher than expected at random in transcriptional regulatory regions, and much lower than expected in coding regions. Analysis of the SIDD profiles of 42 bacterial genomes chosen for their diversity confirms this pattern. Predictions of SIDD sites have been used to identify potential genomic regulatory regions, and suggest both possible regulatory mechanisms involving stress-induced destabilization and experimental tests of these mechanisms. Here we describe the SIDDBASE database which enables users to retrieve and visualize the results of SIDD analyses of completely sequenced prokaryotic and archaeal genomes, together with their annotations. SIDDBASE is available at
Theoretical Analysis of the Stress Induced B-Z Transition in Superhelical DNA
We present a method to calculate the propensities of regions within a DNA molecule to transition from B-form to Z-form under negative superhelical stresses. We use statistical mechanics to analyze the competition that occurs among all susceptible Z-forming regions at thermodynamic equilibrium in a superhelically stressed DNA of specified sequence. This method, which we call SIBZ, is similar to the SIDD algorithm that was previously developed to analyze superhelical duplex destabilization. A state of the system is determined by assigning to each base pair either the B- or the Z-conformation, accounting for the dinucleotide repeat unit of Z-DNA. The free energy of a state is comprised of the nucleation energy, the sequence-dependent B-Z transition energy, and the energy associated with the residual superhelicity remaining after the change of twist due to transition. Using this information, SIBZ calculates the equilibrium B-Z transition probability of each base pair in the sequence. This can be done at any physiologically reasonable level of negative superhelicity. We use SIBZ to analyze a variety of representative genomic DNA sequences. We show that the dominant Z-DNA forming regions in a sequence can compete in highly complex ways as the superhelicity level changes. Despite having no tunable parameters, the predictions of SIBZ agree precisely with experimental results, both for the onset of transition in plasmids containing introduced Z-forming sequences and for the locations of Z-forming regions in genomic sequences. We calculate the transition profiles of 5 kb regions taken from each of 12,841 mouse genes and centered on the transcription start site (TSS). We find a substantial increase in the frequency of Z-forming regions immediately upstream from the TSS. The approach developed here has the potential to illuminate the occurrence of Z-form regions in vivo, and the possible roles this transition may play in biological processes
Theoretical Analysis of Competing Conformational Transitions in Superhelical DNA
We develop a statistical mechanical model to analyze the competitive behavior of transitions to multiple alternate conformations in a negatively supercoiled DNA molecule of kilobase length and specified base sequence. Since DNA superhelicity topologically couples together the transition behaviors of all base pairs, a unified model is required to analyze all the transitions to which the DNA sequence is susceptible. Here we present a first model of this type. Our numerical approach generalizes the strategy of previously developed algorithms, which studied superhelical transitions to a single alternate conformation. We apply our multi-state model to study the competition between strand separation and B-Z transitions in superhelical DNA. We show this competition to be highly sensitive to temperature and to the imposed level of supercoiling. Comparison of our results with experimental data shows that, when the energetics appropriate to the experimental conditions are used, the competition between these two transitions is accurately captured by our algorithm. We analyze the superhelical competition between B-Z transitions and denaturation around the c-myc oncogene, where both transitions are known to occur when this gene is transcribing. We apply our model to explore the correlation between stress-induced transitions and transcriptional activity in various organisms. In higher eukaryotes we find a strong enhancement of Z-forming regions immediately 5′ to their transcription start sites (TSS), and a depletion of strand separating sites in a broad region around the TSS. The opposite patterns occur around transcript end locations. We also show that susceptibility to each type of transition is different in eukaryotes and prokaryotes. By analyzing a set of untranscribed pseudogenes we show that the Z-susceptibility just downstream of the TSS is not preserved, suggesting it may be under selection pressure
Superhelical Duplex Destabilization and the Recombination Position Effect
The susceptibility to recombination of a plasmid inserted into a chromosome
varies with its genomic position. This recombination position effect is known to
correlate with the average G+C content of the flanking sequences. Here we
propose that this effect could be mediated by changes in the susceptibility to
superhelical duplex destabilization that would occur. We use standard
nonparametric statistical tests, regression analysis and principal component
analysis to identify statistically significant differences in the
destabilization profiles calculated for the plasmid in different contexts, and
correlate the results with their measured recombination rates. We show that the
flanking sequences significantly affect the free energy of denaturation at
specific sites interior to the plasmid. These changes correlate well with
experimentally measured variations of the recombination rates within the
plasmid. This correlation of recombination rate with superhelical
destabilization properties of the inserted plasmid DNA is stronger than that
with average G+C content of the flanking sequences. This model suggests a
possible mechanism by which flanking sequence base composition, which is not
itself a context-dependent attribute, can affect recombination rates at
positions within the plasmid
A Fundamental Regulatory Mechanism Operating through OmpR and DNA Topology Controls Expression of Salmonella Pathogenicity Islands SPI-1 and SPI-2
DNA topology has fundamental control over the ability of transcription factors to access their target DNA sites at gene promoters. However, the influence of DNA topology on protein–DNA and protein–protein interactions is poorly understood. For example, relaxation of DNA supercoiling strongly induces the well-studied pathogenicity gene ssrA (also called spiR) in Salmonella enterica, but neither the mechanism nor the proteins involved are known. We have found that relaxation of DNA supercoiling induces expression of the Salmonella pathogenicity island (SPI)-2 regulator ssrA as well as the SPI-1 regulator hilC through a mechanism that requires the two-component regulator OmpR-EnvZ. Additionally, the ompR promoter is autoregulated in the same fashion. Conversely, the SPI-1 regulator hilD is induced by DNA relaxation but is repressed by OmpR. Relaxation of DNA supercoiling caused an increase in OmpR binding to DNA and a concomitant decrease in binding by the nucleoid-associated protein FIS. The reciprocal occupancy of DNA by OmpR and FIS was not due to antagonism between these transcription factors, but was instead a more intrinsic response to altered DNA topology. Surprisingly, DNA relaxation had no detectable effect on the binding of the global repressor H-NS. These results reveal the underlying molecular mechanism that primes SPI genes for rapid induction at the onset of host invasion. Additionally, our results reveal novel features of the archetypal two-component regulator OmpR. OmpR binding to relaxed DNA appears to generate a locally supercoiled state, which may assist promoter activation by relocating supercoiling stress-induced destabilization of DNA strands. Much has been made of the mechanisms that have evolved to regulate horizontally-acquired genes such as SPIs, but parallels among the ssrA, hilC, and ompR promoters illustrate that a fundamental form of regulation based on DNA topology coordinates the expression of these genes regardless of their origins
Recommended from our members
Interplay between DNA sequence and negative superhelicity drives R-loop structures.
R-loops are abundant three-stranded nucleic-acid structures that form in cis during transcription. Experimental evidence suggests that R-loop formation is affected by DNA sequence and topology. However, the exact manner by which these factors interact to determine R-loop susceptibility is unclear. To investigate this, we developed a statistical mechanical equilibrium model of R-loop formation in superhelical DNA. In this model, the energy involved in forming an R-loop includes four terms-junctional and base-pairing energies and energies associated with superhelicity and with the torsional winding of the displaced DNA single strand around the RNA:DNA hybrid. This model shows that the significant energy barrier imposed by the formation of junctions can be overcome in two ways. First, base-pairing energy can favor RNA:DNA over DNA:DNA duplexes in favorable sequences. Second, R-loops, by absorbing negative superhelicity, partially or fully relax the rest of the DNA domain, thereby returning it to a lower energy state. In vitro transcription assays confirmed that R-loops cause plasmid relaxation and that negative superhelicity is required for R-loops to form, even in a favorable region. Single-molecule R-loop footprinting following in vitro transcription showed a strong agreement between theoretical predictions and experimental mapping of stable R-loop positions and further revealed the impact of DNA topology on the R-loop distribution landscape. Our results clarify the interplay between base sequence and DNA superhelicity in controlling R-loop stability. They also reveal R-loops as powerful and reversible topology sinks that cells may use to nonenzymatically relieve superhelical stress during transcription
Salerno's model of DNA reanalysed: could solitons have biological significance?
We investigate the sequence-dependent behaviour of localised excitations in a
toy, nonlinear model of DNA base-pair opening originally proposed by Salerno.
Specifically we ask whether ``breather'' solitons could play a role in the
facilitated location of promoters by RNA polymerase. In an effective potential
formalism, we find excellent correlation between potential minima and {\em
Escherichia coli} promoter recognition sites in the T7 bacteriophage genome.
Evidence for a similar relationship between phage promoters and downstream
coding regions is found and alternative reasons for links between AT richness
and transcriptionally-significant sites are discussed. Consideration of the
soliton energy of translocation provides a novel dynamical picture of sliding:
steep potential gradients correspond to deterministic motion, while ``flat''
regions, corresponding to homogeneous AT or GC content, are governed by random,
thermal motion. Finally we demonstrate an interesting equivalence between
planar, breather solitons and the helical motion of a sliding protein
``particle'' about a bent DNA axis.Comment: Latex file 20 pages, 5 figures. Manuscript of paper to appear in J.
Biol. Phys., accepted 02/09/0
Long-range correlations in the mechanics of small DNA circles under topological stress revealed by multi-scale simulation
It is well established that gene regulation can be achieved through activator and repressor proteins that bind to DNA and switch particular genes on or off, and that complex metabolic networks deter- mine the levels of transcription of a given gene at a given time. Using three complementary computa- tional techniques to study the sequence-dependence of DNA denaturation within DNA minicircles, we have observed that whenever the ends of the DNA are con- strained, information can be transferred over long distances directly by the transmission of mechanical stress through the DNA itself, without any require- ment for external signalling factors. Our models com- bine atomistic molecular dynamics (MD) with coarse- grained simulations and statistical mechanical calcu- lations to span three distinct spatial resolutions and timescale regimes. While they give a consensus view of the non-locality of sequence-dependent denatura- tion in highly bent and supercoiled DNA loops, each also reveals a unique aspect of long-range informa- tional transfer that occurs as a result of restraining the DNA within the closed loop of the minicircles
The genome-wide distribution of non-B DNA motifs is shaped by operon structure and suggests the transcriptional importance of non-B DNA structures in Escherichia coli
Although the right-handed double helical B-form DNA is most common under physiological conditions, DNA is dynamic and can adopt a number of alternative structures, such as the four-stranded G-quadruplex, left-handed Z-DNA, cruciform and others. Active transcription necessitates strand separation and can induce such non-canonical forms at susceptible genomic sequences. Therefore, it has been speculated that these non-B DNA motifs can play regulatory roles in gene transcription. Such conjecture has been supported in higher eukaryotes by direct studies of several individual genes, as well as a number of large-scale analyses. However, the role of non-B DNA structures in many lower organisms, in particular proteobacteria, remains poorly understood and incompletely documented. In this study, we performed the first comprehensive study of the occurrence of B DNA–non-B DNA transition-susceptible sites (non-B DNA motifs) within the context of the operon structure of the Escherichia coli genome. We compared the distributions of non-B DNA motifs in the regulatory regions of operons with those from internal regions. We found an enrichment of some non-B DNA motifs in regulatory regions, and we show that this enrichment cannot be simply explained by base composition bias in these regions. We also showed that the distribution of several non-B DNA motifs within intergenic regions separating divergently oriented operons differs from the distribution found between convergent ones. In particular, we found a strong enrichment of cruciforms in the termination region of operons; this enrichment was observed for operons with Rho-dependent, as well as Rho-independent terminators. Finally, a preference for some non-B DNA motifs was observed near transcription factor-binding sites. Overall, the conspicuous enrichment of transition-susceptible sites in these specific regulatory regions suggests that non-B DNA structures may have roles in the transcriptional regulation of specific operons within the E. coli genome
- …