26 research outputs found

    Average prediction profiles around occurrences of the pattern GGGGCGGGG/CCCCGCCCC.

    No full text
    <p>Each curve corresponds to the average prediction profiles around the occurrences of the pattern in GC-rich remodeled LNFRs of resting T cells. The different curves correspond to different choices of .</p

    Occurrences of certain patterns in remodeled vs. non-remodeled GC-rich LNFRs of resting T cells.

    No full text
    <p>Columns 2 and 3 provide the percentages of GC-rich resting LNFRs outside promoters in which the patterns occur. Column 4 provides the -value of Fisher's exact test for enrichment of the patterns in remodeled LNFRs. Columns 5–7 are analogous to columns 2–4, but the percentages and -values are computed for all GC-rich resting LNFRs, both inside and outside promoters. The percentages in the first row are not the exact sums of percentages in rows 2 and 3 because there are LNFR sequences that contain both patterns GGGGCGGGG/CCCCGCCCC and GGGGTGGGG/CCCCACCCC.</p

    Genome-Wide Chromatin Remodeling Identified at GC-Rich Long Nucleosome-Free Regions

    Get PDF
    <div><p>To gain deeper insights into principles of cell biology, it is essential to understand how cells reorganize their genomes by chromatin remodeling. We analyzed chromatin remodeling on next generation sequencing data from resting and activated T cells to determine a whole-genome chromatin remodeling landscape. We consider chromatin remodeling in terms of nucleosome repositioning which can be observed most robustly in long nucleosome-free regions (LNFRs) that are occupied by nucleosomes in another cell state. We found that LNFR sequences are either AT-rich or GC-rich, where nucleosome repositioning was observed much more prominently in GC-rich LNFRs — a considerable proportion of them outside promoter regions. Using support vector machines with string kernels, we identified a GC-rich DNA sequence pattern indicating loci of nucleosome repositioning in resting T cells. This pattern appears to be also typical for CpG islands. We found out that nucleosome repositioning in GC-rich LNFRs is indeed associated with CpG islands and with binding sites of the CpG-island-binding ZF-CXXC proteins KDM2A and CFP1. That this association occurs prominently inside and also prominently outside of promoter regions hints at a mechanism governing nucleosome repositioning that acts on a whole-genome scale.</p> </div

    Classification performance.

    No full text
    <p>The percentages are two-fold cross validation accuracies for the classification of LNFRs showing remodeling versus randomly selected non-LNFR DNA sequences from the human genome. Altogether four sets of remodeled LNFRs were considered: GC-rich and AT-rich remodeled LNFRs, both for resting and activated T cells. Each row corresponds to one choice of , the sub-sequence length parameter of the spectrum kernel. GC-rich remodeling LNFRs (third and fifth column) can be classified with higher accuracy than AT-rich remodeling LNFRs (second and fourth column). The highest accuracy for GC-rich remodeling LNFRs in resting T cells is 68.6% and is achieved for .</p

    Sequence logo of remodeling pattern for both DNA strands.

    No full text
    <p>The pattern was identified by applying MEME to the regions of interest obtained from the prediction profiles computed by the SVM with the spectrum kernel with .</p

    Maximum nucleosome occupancy scores over masked LNFRs.

    No full text
    <p>Panel <b>A</b> shows a histogram of maximum nucleosome occupancy scores in activated T cells over masked (i.e. without the first and the last 25 bp) LNFRs in resting state. Panel <b>B</b> shows a histogram of maximum nucleosome occupancy scores in resting T cells over masked LNFRs in activated state. AT-rich LNFRs generally exhibit lower nucleosome coverage than GC-rich LNFRs, which indicates that GC-rich LNFRs show a stronger remodeling tendency than AT-rich LNFRs.</p

    GC content distributions of LNFRs versus random fragments from the human genome.

    No full text
    <p>GC content distributions of LNFRs in resting T cells (<b>A</b>) and activated T cells (<b>B</b>) compared to the GC content distribution of fragments drawn randomly from the human genome (<b>C</b>). For both resting and activated T cells, the LNFRs are divided into two groups, an AT-rich and a GC-rich one. For resting T cells, the two groups are very clearly separated by a GC content threshold of about 60%, while this threshold is at 50% for activated T cells.</p

    Proportion of LNFRs overlapping with CpG islands plotted versus the LNFRs' GC content.

    No full text
    <p>Each curve plots the proportion of LNFRs overlapping with CpG islands in relation to the GC content of the considered LNFRs. The plot in panel <b>A</b> shows data for LNFRs of resting T cells, while the plot in panel <b>B</b> shows data for LNFRs of activated T cells. For resting T cells, a clear difference between remodeled and non-remodeled LNFRs is visible, both if we consider all LNFRs and if we restrict to non-promoter LNFRs. The proportions of overlaps of LNFRs of activated T cells with CpG islands are generally lower and no clear difference is visible between remodeled and non-remodeled LNFRs.</p

    Classification performance versus sub-sequence length.

    No full text
    <p>Two-fold cross validation accuracies for the classification of GC-rich LNFRs showing remodeling versus randomly selected non-LNFR sequences for different choices of , the sub-sequence length parameter of the spectrum kernel. For resting T cells, the accuracy peaks at , while 's between 4 and 6 are best for classifying GC-rich remodeling LNFRs in activated T cells.</p

    Overlaps with CpG islands and binding site peaks of remodeled vs. non-remodeled GC-rich LNFRs of resting T cells.

    No full text
    <p>Columns 2 and 3 provide the percentages of GC-rich resting LNFRs outside promoters that overlap with CpG islands or the binding site peaks under consideration. Column 4 provides the -value of Fisher's exact test for enrichment of overlaps in remodeled LNFRs, except for value typeset in italics, which correspond to -values of Fisher's exact test for enrichment of overlaps in non-remodeled sequences. Columns 5–7 are analogous to columns 2–4, but the all percentages and -values are computed for all GC-rich LNFRs, no matter whether inside or outside promoters. The row “Pol II (A)” refers to the Pol II ChIP-seq data set published by Barski et al. <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0047924#pone.0047924-Barski1" target="_blank">[46]</a>, while “Pol II (B)” refers to the Pol II ChIP-seq data published by Schones et al. <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0047924#pone.0047924-Schones1" target="_blank">[28]</a>.</p
    corecore