168,674 research outputs found

    Using multiple GPUs to accelerate string searching for digital forensic analysis

    Get PDF
    String searching within a large corpus of data is an important component of digital forensic (DF) analysis techniques such as file carving. The continuing increase in capacity of consumer storage devices requires corresponding im-provements to the performance of string searching techniques. As string search-ing is a trivially-parallelisable problem, GPGPU approaches are a natural fit – but previous studies have found that local storage presents an insurmountable performance bottleneck. We show that this need not be the case with modern hardware, and demonstrate substantial performance improvements from the use of single and multiple GPUs when searching for strings within a typical forensic disk image

    siEDM: an efficient string index and search algorithm for edit distance with moves

    Full text link
    Although several self-indexes for highly repetitive text collections exist, developing an index and search algorithm with editing operations remains a challenge. Edit distance with moves (EDM) is a string-to-string distance measure that includes substring moves in addition to ordinal editing operations to turn one string into another. Although the problem of computing EDM is intractable, it has a wide range of potential applications, especially in approximate string retrieval. Despite the importance of computing EDM, there has been no efficient method for indexing and searching large text collections based on the EDM measure. We propose the first algorithm, named string index for edit distance with moves (siEDM), for indexing and searching strings with EDM. The siEDM algorithm builds an index structure by leveraging the idea behind the edit sensitive parsing (ESP), an efficient algorithm enabling approximately computing EDM with guarantees of upper and lower bounds for the exact EDM. siEDM efficiently prunes the space for searching query strings by the proposed method, which enables fast query searches with the same guarantee as ESP. We experimentally tested the ability of siEDM to index and search strings on benchmark datasets, and we showed siEDM's efficiency.Comment: 23 page

    A new problem in string searching

    Full text link
    We describe a substring search problem that arises in group presentation simplification processes. We suggest a two-level searching model: skip and match levels. We give two timestamp algorithms which skip searching parts of the text where there are no matches at all and prove their correctness. At the match level, we consider Harrison signature, Karp-Rabin fingerprint, Bloom filter and automata based matching algorithms and present experimental performance figures.Comment: To appear in Proceedings Fifth Annual International Symposium on Algorithms and Computation (ISAAC'94), Lecture Notes in Computer Scienc

    Tree-Level Stability Without Spacetime Fermions: Novel Examples in String Theory

    Get PDF
    Is perturbative stability intimately tied with the existence of spacetime fermions in string theory in more than two dimensions? Type 0'B string theory in ten-dimensional flat space is a rare example of a non-tachyonic, non-supersymmetric string theory with a purely bosonic closed string spectrum. However, all known type 0' constructions exhibit massless NSNS tadpoles signaling the fact that we are not expanding around a true vacuum of the theory. In this note, we are searching for perturbatively stable examples of type 0' string theory without massless tadpoles in backgrounds with a spatially varying dilaton. We present two examples with this property in non-critical string theories that exhibit four- and six-dimensional Poincare invariance. We discuss the D-branes that can be embedded in this context and the type of gauge theories that can be constructed in this manner. We also comment on the embedding of these non-critical models in critical string theories and their holographic (Little String Theory) interpretation and propose a general conjecture for the role of asymptotic supersymmetry in perturbative string theory.Comment: harvmac, 29 pages; v2 minor changes, version to appear in JHE

    Analysis of string-searching algorithms on biological sequence databases

    Get PDF
    String-searching algorithms are used to find the occurrences of a search string in a given text. The advent of digital computers has stimulated the development of string-searching algorithms for various applications. Here, we report the performance of all string-searching algorithms on widely used biological sequence databases containing the building blocks of nucleotides (in the case of nucleic acid sequence database) and amino acids (in the case of protein sequence database). The biological sequence databases used in the present study are Protein Information Resource (PIR), SWISSPROT, and amino acid and nucleotide sequences of all genomes available in the genome database. The average time taken for different search-string lengths considered for study has been taken as an indicator of performance for comparison between various methods
    • …
    corecore