21 research outputs found

    Number of CISs per Scale Parameter

    No full text
    <p>Number of CISs for Various Scale Parameters (Corrected and Uncorrected), the csCISs, the Background-Corrected csCISs, and the CISs from the RTCGD. Background correction only has effect at larger scales.</p

    The <i>Myc</i> Locus on Chromosome 15

    No full text
    <div><p>(A) The blue line represents the estimated number of insertions as a function of position for a certain region. The red line depicts the threshold associated with an <i>α</i>-level of 0.05.</p><p>(B) CISs are depicted by means of vertical lines. From top to bottom these represent: the CISs for the current scale (30k), the csCISs, the CISs from the RTCGD, the insertions, and the genes (top and bottom strand separated).</p><p>(C) Scale space diagram. The vertical axes of the scale space has a logarithmic scale and indicates the scale for which the CIS was detected (only a subset of scales was actually evaluated: [50 100 250 500 1 k 2.5 k 5 k 10 k 30 k 50 k 100 k 150 k] bp).</p><p>(D) Evaluation of the insertion distribution over four small scale CISs, identified by scale space analysis. Per screen we list the number of insertions that fall within the small scale CIS. The screens are labeled consistent with RTCGD nomenclature.</p></div

    Results from Simulation Experiments—True and False Positives

    No full text
    <div><p>(A,B) Results for the GKC applied to artificial data.</p><p>(C,D) Results for the TKC.</p><p>(E,F) Results for the RKC.</p><p>The horizontal solid lines in (A), (C), and (E) show the 5% significance level, the dotted lines show the average number of csFPs. The legend shows the different simulated CISs, stating the number of insertions <i>N</i><sub>CIS</sub> that fall within the CIS of width <i>W</i><sub>CIS</sub>.</p></div

    Schematic Depiction of the Significance Analysis of the Density Estimate of the Insertion Data

    No full text
    <div><p>(A) The position of the <i>N</i> insertions is permuted.</p><p>(B) The convolution method is applied to the resulting permuted insertion profile. The heights of all peaks are recorded.</p><p>(C) Step A and B are repeated. A distribution of the peaks in random data results.</p><p>(D) The threshold is computed by determining the <i>α</i>-level in the empirical CDF of the null-distribution. This threshold is applied to the insertion estimate of the real insertion data, resulting in a series of significant peaks.</p></div

    Schematic Depiction of the Kernel Convolution Framework

    No full text
    <p>The insertions are convolved with a kernel function with a width determined by the scale parameter. In principle any kernel function can be used, but the Gaussian kernel function is depicted. The significance of the peaks is evaluated using a null-distribution computed by means of a random permutation of the data. This is done for a range of scale parameters to obtain the CISs in the scale space.</p

    Results from Simulation Experiments—Cross Scale True Positives and Deviations from CIS Center

    No full text
    <div><p>(A) Average number of csTPs per artificially generated CIS. Significant errors are made for the borderline cases: the narrow CISs (500 bp), or broad CISs (80k bp) with relatively few insertions. The GKC outperforms the RKC and TKC for all simulated CISs.</p><p>(B) Average deviation of the detected CIS center from the actual simulated CIS center normalized on the simulated CIS width plus the scale parameter under consideration.</p></div

    Schematic Depiction of the Computation of a Background-Corrected Threshold

    No full text
    <div><p>(A) The density of TSSs (the 5′ ends of the genes) is computed using a fixed kernel width <i>h</i><sub>bg</sub>.</p><p>(B) A new realization of insertions is generated using the density from step A.</p><p>(C) The GKC method is applied to the resulting insertion profile, yielding one realization of the background density estimate. Steps (A) and (B) and applying the GKC are repeated N times to yield a distribution of background realizations. For every position on the genome, a CDF of these realizations is computed and the threshold is determined based on the <i>α</i>-level.</p><p>(D) The location-dependent threshold is combined with the threshold based on uniform background. Finally, the smoothed insertion estimate of the real data is thresholded with the resulting threshold.</p></div

    Example of Novel CIS and Background Corrected CIS

    No full text
    <div><p>(A) Venn diagram comparing the csCISs and the CISs in the RTCGD. For reasons explained in <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.0020166#pcbi-0020166-g007" target="_blank">Figure 7</a>, the intersection shows two counts.</p><p>(B) An example of a CIS that consists of three insertions from three independent screens, and therefore is only detected when integrally analyzing the data.</p><p>(C) Venn diagram comparing the csCISs with and without applying background correction.</p><p>(D) An example of a csCIS, that was also included in the RTCGD, and is rejected based on the background-corrected threshold. The small vertical bars (red) in the genes denote the 5′ ends of genes, and a star denotes a corrected CIS. Since we are only interested in correcting regions that are putative CISs, a background-corrected threshold is only computed for peaks in the estimated number of insertions. The corrected threshold is given by the horizontal dotted line above the peak.</p></div

    Comparison with Previous CIS Definition

    No full text
    <div><p>(A) Plot of the increase of the error as a function of the screen size, when using the definition from [<a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.0020166#pcbi-0020166-b008" target="_blank">8</a>], computed using the Poisson distribution, or a permutation approach. Also the results from the two individual windows used are given. Since the errors made by the two windows individually are not mutually exclusive, the Poisson estimate is an overestimate of the true error.</p><p>(B) Venn diagram comparing three different CIS definitions: a) the definition from [<a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.0020166#pcbi-0020166-b008" target="_blank">8</a>] applied to the complete dataset, b) the csCISs resulting from the GKC, and c) the published CISs from the RTCGD. The intersection between sets shows three counts (and corresponding percentages), indicating the count for set a, b, and c, respectively. This is because the three sets of CISs used different definitions (at different scales) for a CIS, so that some CISs are split up, and hence are counted twice.</p></div
    corecore