7 research outputs found

    Identification of multiple genomic DNA sequences which form i-motif structures at neutral pH

    Get PDF
    i-Motifs are alternative DNA secondary structures formed in cytosine-rich sequences. Particular examples of these structures, traditionally assumed to be stable only at acidic pH, have been found to form under near-physiological conditions. To determine the potential impact of these structures on physiological processes, investigation of sequences with the capacity to fold under physiological conditions is required. Here we describe a systematic study of cytosine-rich DNA sequences, with varying numbers of consecutive cytosines, to gain insights into i-motif DNA sequence and structure stability. i-Motif formation was assessed using ultraviolet spectroscopy, circular dichroism and native gel electrophoresis. We found that increasing cytosine tract lengths resulted in increased thermal stability; sequences with at least five cytosines per tract folded into i-motif at room temperature and neutral pH. Using these results, we postulated a folding rule for i-motif formation, analogous to (but different from) that for G-quadruplexes. This indicated that thousands of cytosine-rich sequences in the human genome may fold into i-motif structures under physiological conditions. Many of these were found in locations where structure formation is likely to influence gene expression. Characterization of a selection of these identified i-motif forming sequences uncovered 17 genomic i-motif forming sequence examples which were stable at neutral pH

    Identification of multiple genomic DNA sequences which form i-motif structures at neutral pH

    Get PDF
    i-Motifs are alternative DNA secondary structures formed in cytosine-rich sequences. Particular examples of these structures, traditionally assumed to be stable only at acidic pH, have been found to form under near-physiological conditions. To determine the potential impact of these structures on physiological processes, investigation of sequences with the capacity to fold under physiological conditions is required. Here we describe a systematic study of cytosine-rich DNA sequences, with varying numbers of consecutive cytosines, to gain insights into i-motif DNA sequence and structure stability. i-Motif formation was assessed using ultraviolet spectroscopy, circular dichroism and native gel electrophoresis. We found that increasing cytosine tract lengths resulted in increased thermal stability; sequences with at least five cytosines per tract folded into i-motif at room temperature and neutral pH. Using these results, we postulated a folding rule for i-motif formation, analogous to (but different from) that for G-quadruplexes. This indicated that thousands of cytosine-rich sequences in the human genome may fold into i-motif structures under physiological conditions. Many of these were found in locations where structure formation is likely to influence gene expression. Characterization of a selection of these identified i-motif forming sequences uncovered 17 genomic i-motif forming sequence examples which were stable at neutral pH

    Substitution of Cytosine with Guanylurea Decreases the Stability of i-Motif DNA

    Get PDF
    Both 5-aza-2′-deoxycytidine (decitabine) and its primary breakdown product, 2′-deoxyriboguanylurea (GuaUre-dR), have been shown to act as mutagens and epimutagens that cause replication stress and alter both DNA methylation and gene expression patterns. As cytosine analogues, both are expected to be preferentially incorporated into regions of GC skew where runs of cytosine residues are sequestered on one strand and guanine residues on the other. Given that such regions have been identified as sites with the potential for effects on gene expression and replication stress linked to formation of alternative DNA secondary structures, it is of interest to determine the influence that these base analogues might have on the stability of structures of this kind. Here we report that incorporation of GuaUre-dR into an i-motif-forming sequence decreases both the thermal and pH stability of an i-motif despite the apparent ability of GuaUre-dR to base pair with cytosine

    Prediction of DNA i-motifs via machine learning

    Get PDF
    i-Motifs (iMs), are secondary structures formed in cytosine-rich DNA sequences and are involved in multiple functions in the genome. Although putative iM forming sequences are widely distributed in the human genome, the folding status and strength of putative iMs vary dramatically. Much previous research on iM has focused on assessing the iM folding properties using biophysical experiments. However, there are no dedicated computational tools for predicting the folding status and strength of iM structures. Here, we introduce a machine learning pipeline, iM-Seeker, to predict both folding status and structural stability of DNA iMs. The programme iM-Seeker incorporates a Balanced Random Forest classifier trained on genome-wide iMab antibody-based CUT&Tag sequencing data to predict the folding status and an Extreme Gradient Boosting regressor to estimate the folding strength according to both literature biophysical data and our in-house biophysical experiments. iM-Seeker predicts DNA iM folding status with a classification accuracy of 81% and estimates the folding strength with coefficient of determination (R2) of 0.642 on the test set. Model interpretation confirms that the nucleotide composition of the C-rich sequence significantly affects iM stability, with a positive correlation with sequences containing cytosine and thymine and a negative correlation with guanine and adenine

    Epigenetic modification of cytosines fine tunes the stability of i-motif DNA

    Get PDF
    i-Motifs are widely used in nanotechnology, play a part in gene regulation and have been detected in human nuclei. As these structures are composed of cytosine, they are potential sites for epigenetic modification. In addition to 5-methyl- and 5-hydroxymethylcytosine modifications, recent evidence has suggested biological roles for 5-formylcytosine and 5-carboxylcytosine. Herein the human telomeric i-motif sequence was used to examine how these four epigenetic modifications alter the thermal and pH stability of i-motifs. Changes in melting temperature and transitional pH depended on both the type of modification and its position within the i-motif forming sequence. The cytosines most sensitive to modification were next to the first and third loops within the structure. Using previously described i-motif forming sequences, we screened the MCF-7 and MCF-10A methylomes to map 5-methylcytosine and found the majority of sequences were differentially methylated in MCF7 (cancerous) and MCF10A (non-cancerous) cell lines. Furthermore, i-motif forming sequences stable at neutral pH were significantly more likely to be epigenetically modified than traditional acidic i-motif forming sequences. This work has implications not only in the epigenetic regulation of DNA, but also allows discreet tunability of i-motif stability for nanotechnological applications
    corecore