14 research outputs found

    Benchmark soil metagenome data sets for k-mer counting performance, taken from [11].

    No full text
    <p>Benchmark soil metagenome data sets for k-mer counting performance, taken from <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0101271#pone.0101271-Howe1" target="_blank">[11]</a>.</p

    Low-memory digital normalization.

    No full text
    <p><b>The results of digitally normalizing a 5 m read </b><b><i>E. coli</i></b><b> data set (1.4 GB) to C = 20 with k = 20 under several memory usage/false positive rates. The false positive rate (column 1) is empirically determined. We measured reads remaining, number of “true” k-mers missing from the data at each step, and the number of total k-mers remaining. Note: at high false positive rates, reads are erroneously removed due to inflation of k-mer counts.</b></p

    Memory usage of k-mer counting tools when calculating k-mer abundance histograms, with maximum resident program size (y axis, in GB) plotted against the total number of distinct k-mers in the data set (x axis, billions of k-mers).

    No full text
    <p>Memory usage of k-mer counting tools when calculating k-mer abundance histograms, with maximum resident program size (y axis, in GB) plotted against the total number of distinct k-mers in the data set (x axis, billions of k-mers).</p

    Iterative low-memory k-mer trimming.

    No full text
    <p><b>The results of trimming reads at unique (erroneous) k-mers from a 5 m read </b><b><i>E. coli</i></b><b> data set (1.4 GB) in under 30 MB of RAM. After each iteration, we measured the total number of distinct k-mers in the data set, the total number of unique (and likely erroneous) k-mers remaining, and the number of unique k-mers present at the 3' end of reads.</b></p

    <i>E. coli</i> genome assembly after low-memory digital normalization.

    No full text
    <p><b>A comparison of assembling reads digitally normalized with low memory/high false positive rates. The reads were digitally normalized to C = 20 (see <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0101271#pone.0101271-Brown1" target="_blank">[21]</a> for more information) and were assembled using Velvet. We measured total length of assembly, as well as percent of true MG1655 genome covered by the assembly using QUAST.</b></p

    Iterative low-memory k-mer trimming.

    No full text
    <p><b>The results of trimming reads at unique (erroneous) k-mers from a 5 m read </b><b><i>E. coli</i></b><b> data set (1.4 GB) in under 30 MB of RAM. After each iteration, we measured the total number of distinct k-mers in the data set, the total number of unique (and likely erroneous) k-mers remaining, and the number of unique k-mers present at the 3' end of reads.</b></p
    corecore