8 research outputs found

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel type of direct repeat found in a wide range of bacteria and archaea. CRISPRs are beginning to attract attention because of their proposed mechanism; that is, defending their hosts against invading extrachromosomal elements such as viruses. Existing repeat detection tools do a poor job of identifying CRISPRs due to the presence of unique spacer sequences separating the repeats. In this study, a new tool, CRT, is introduced that rapidly and accurately identifies CRISPRs in large DNA strings, such as genomes and metagenomes.</p> <p>Results</p> <p>CRT was compared to CRISPR detection tools, Patscan and Pilercr. In terms of correctness, CRT was shown to be very reliable, demonstrating significant improvements over Patscan for measures precision, recall and quality. When compared to Pilercr, CRT showed improved performance for recall and quality. In terms of speed, CRT proved to be a huge improvement over Patscan. Both CRT and Pilercr were comparable in speed, however CRT was faster for genomes containing large numbers of repeats.</p> <p>Conclusion</p> <p>In this paper a new tool was introduced for the automatic detection of CRISPR elements. This tool, CRT, showed some important improvements over current techniques for CRISPR identification. CRT's approach to detecting repetitive sequences is straightforward. It uses a simple sequential scan of a DNA sequence and detects repeats directly without any major conversion or preprocessing of the input. This leads to a program that is easy to describe and understand; yet it is very accurate, fast and memory efficient, being O(<it>n</it>) in space and O(<it>nm</it>/<it>l</it>) in time.</p

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats-3

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats"</p><p>http://www.biomedcentral.com/1471-2105/8/209</p><p>BMC Bioinformatics 2007;8():209-209.</p><p>Published online 18 Jun 2007</p><p>PMCID:PMC1924867.</p><p></p>r windows size 6 and once for window size 8. The y-axis represents time in seconds. The x-axis lists the genome accession numbers, followed by the number of repeats detected in the genome. As the size of the genomes increase, it can bee seen that running times of the search tools increase at different rates. Below, the corresponding organism names and the number of CRISPR loci are given. All genomes are close in size (2.7 – 3.8 Mbp). [IMG:BA000031] Vibrio parahaemolyticus (serovar O3:K6, strain RIMD 2210633) loci: 0 [IMG:CR628337] Legionella pneumophila (strain Lens) loci: 2 [IMG:AP006840] Symbiobacterium thermophilum (strain IAM 14863/T) loci: 3 [IMG:AE017180] Geobacter sulfurreducens (strain ATCC 51573/PCA) loci: 2 [IMG:AE008691] Thermoanaerobacter tengcongensis (strain MB4/JCM 11007) loci: 3 [IMG:AE006641] Sulfolobus solfataricus P2 loci: 7 [IMG:BA000023] Sulfolobus tokodaii str. 7 DNA loci: 7

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats-1

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats"</p><p>http://www.biomedcentral.com/1471-2105/8/209</p><p>BMC Bioinformatics 2007;8():209-209.</p><p>Published online 18 Jun 2007</p><p>PMCID:PMC1924867.</p><p></p>ure 2. However, by removing Patscan, a better comparison of the execution speeds of PilerCR and CRT can be achieved

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats-6

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats"</p><p>http://www.biomedcentral.com/1471-2105/8/209</p><p>BMC Bioinformatics 2007;8():209-209.</p><p>Published online 18 Jun 2007</p><p>PMCID:PMC1924867.</p><p></p>6 and once for window size 8). This figure is the same as Figure 3, except the ranges of the repeat length and spacer length to be detected are increased

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats-4

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats"</p><p>http://www.biomedcentral.com/1471-2105/8/209</p><p>BMC Bioinformatics 2007;8():209-209.</p><p>Published online 18 Jun 2007</p><p>PMCID:PMC1924867.</p><p></p> and once for window size 8). The y-axis represents time in seconds. The x-axis lists the genome accession numbers, followed by their sizes in million base pairs (Mbp). As the size of the genomes increase, it can be seen that running times of the search tools increase at different rates. Below, the corresponding organism names are given. [IMG:AE015450] Mycoplasma gallisepticum (strain R(low)) [IMG:AE004439] Pasteurella multocida (strain Pm70) [IMG:AE017282] Methylococcus capsulatus (strain Bath/NCIMB 11132) [IMG:AP006627] Bacillus clausii (strain KSM-K16) [IMG:BX470251] Photorhabdus luminescens (subsp. laumondii, strain TT01)

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats-0

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats"</p><p>http://www.biomedcentral.com/1471-2105/8/209</p><p>BMC Bioinformatics 2007;8():209-209.</p><p>Published online 18 Jun 2007</p><p>PMCID:PMC1924867.</p><p></p> and once for window size 8). The y-axis represents time in seconds. The x-axis lists the genome accession numbers, followed by their sizes in million base pairs (Mbp). As the size of the genomes increase, it can be seen that running times of the search tools increase at different rates. Below, the corresponding organism names are given. [IMG:AE015450] Mycoplasma gallisepticum (strain R(low)) [IMG:AE004439] Pasteurella multocida (strain Pm70) [IMG:AE017282] Methylococcus capsulatus (strain Bath/NCIMB 11132) [IMG:AP006627] Bacillus clausii (strain KSM-K16) [IMG:BX470251] Photorhabdus luminescens (subsp. laumondii, strain TT01)

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats-5

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats"</p><p>http://www.biomedcentral.com/1471-2105/8/209</p><p>BMC Bioinformatics 2007;8():209-209.</p><p>Published online 18 Jun 2007</p><p>PMCID:PMC1924867.</p><p></p>ure 2. However, by removing Patscan, a better comparison of the execution speeds of PilerCR and CRT can be achieved

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats-7

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats"</p><p>http://www.biomedcentral.com/1471-2105/8/209</p><p>BMC Bioinformatics 2007;8():209-209.</p><p>Published online 18 Jun 2007</p><p>PMCID:PMC1924867.</p><p></p>r windows size 6 and once for window size 8. The y-axis represents time in seconds. The x-axis lists the genome accession numbers, followed by the number of repeats detected in the genome. As the size of the genomes increase, it can bee seen that running times of the search tools increase at different rates. Below, the corresponding organism names and the number of CRISPR loci are given. All genomes are close in size (2.7 – 3.8 Mbp). [IMG:BA000031] Vibrio parahaemolyticus (serovar O3:K6, strain RIMD 2210633) loci: 0 [IMG:CR628337] Legionella pneumophila (strain Lens) loci: 2 [IMG:AP006840] Symbiobacterium thermophilum (strain IAM 14863/T) loci: 3 [IMG:AE017180] Geobacter sulfurreducens (strain ATCC 51573/PCA) loci: 2 [IMG:AE008691] Thermoanaerobacter tengcongensis (strain MB4/JCM 11007) loci: 3 [IMG:AE006641] Sulfolobus solfataricus P2 loci: 7 [IMG:BA000023] Sulfolobus tokodaii str. 7 DNA loci: 7
    corecore