168,674 research outputs found
Using multiple GPUs to accelerate string searching for digital forensic analysis
String searching within a large corpus of data is an important component of digital forensic (DF) analysis techniques such as file carving. The continuing increase in capacity of consumer storage devices requires corresponding im-provements to the performance of string searching techniques. As string search-ing is a trivially-parallelisable problem, GPGPU approaches are a natural fit – but previous studies have found that local storage presents an insurmountable performance bottleneck. We show that this need not be the case with modern hardware, and demonstrate substantial performance improvements from the use of single and multiple GPUs when searching for strings within a typical forensic disk image
siEDM: an efficient string index and search algorithm for edit distance with moves
Although several self-indexes for highly repetitive text collections exist,
developing an index and search algorithm with editing operations remains a
challenge. Edit distance with moves (EDM) is a string-to-string distance
measure that includes substring moves in addition to ordinal editing operations
to turn one string into another. Although the problem of computing EDM is
intractable, it has a wide range of potential applications, especially in
approximate string retrieval. Despite the importance of computing EDM, there
has been no efficient method for indexing and searching large text collections
based on the EDM measure. We propose the first algorithm, named string index
for edit distance with moves (siEDM), for indexing and searching strings with
EDM. The siEDM algorithm builds an index structure by leveraging the idea
behind the edit sensitive parsing (ESP), an efficient algorithm enabling
approximately computing EDM with guarantees of upper and lower bounds for the
exact EDM. siEDM efficiently prunes the space for searching query strings by
the proposed method, which enables fast query searches with the same guarantee
as ESP. We experimentally tested the ability of siEDM to index and search
strings on benchmark datasets, and we showed siEDM's efficiency.Comment: 23 page
A new problem in string searching
We describe a substring search problem that arises in group presentation
simplification processes. We suggest a two-level searching model: skip and
match levels. We give two timestamp algorithms which skip searching parts of
the text where there are no matches at all and prove their correctness. At the
match level, we consider Harrison signature, Karp-Rabin fingerprint, Bloom
filter and automata based matching algorithms and present experimental
performance figures.Comment: To appear in Proceedings Fifth Annual International Symposium on
Algorithms and Computation (ISAAC'94), Lecture Notes in Computer Scienc
Manifestations of String Theory in Astrophysical Data and at the LHC
With the advent of the LHC and the continuing influx of cosmological data,
phenomenological aspects of string theory have received renewed attention in
recent years and many problems have been properly incorporated in this
framework. In this Dissertation, we suggest new string-related candidates for
the origin of dark energy and dark matter and propose ways of searching for
string resonances at the LHC.Comment: Ph. D. Thesis, University of Wisconsin-Milwaukee, August 200
Tree-Level Stability Without Spacetime Fermions: Novel Examples in String Theory
Is perturbative stability intimately tied with the existence of spacetime
fermions in string theory in more than two dimensions? Type 0'B string theory
in ten-dimensional flat space is a rare example of a non-tachyonic,
non-supersymmetric string theory with a purely bosonic closed string spectrum.
However, all known type 0' constructions exhibit massless NSNS tadpoles
signaling the fact that we are not expanding around a true vacuum of the
theory. In this note, we are searching for perturbatively stable examples of
type 0' string theory without massless tadpoles in backgrounds with a spatially
varying dilaton. We present two examples with this property in non-critical
string theories that exhibit four- and six-dimensional Poincare invariance. We
discuss the D-branes that can be embedded in this context and the type of gauge
theories that can be constructed in this manner. We also comment on the
embedding of these non-critical models in critical string theories and their
holographic (Little String Theory) interpretation and propose a general
conjecture for the role of asymptotic supersymmetry in perturbative string
theory.Comment: harvmac, 29 pages; v2 minor changes, version to appear in JHE
Analysis of string-searching algorithms on biological sequence databases
String-searching algorithms are used to find the occurrences of a search string in a given text. The advent of digital computers has stimulated the development of string-searching algorithms for various applications. Here, we report the performance of all string-searching algorithms on widely used biological sequence databases containing the building blocks of nucleotides (in the case of nucleic acid sequence database) and amino acids (in the case of protein sequence database). The biological sequence databases used in the present study are Protein Information Resource (PIR), SWISSPROT, and amino acid and nucleotide sequences of all genomes available in the genome database. The average time taken for different search-string lengths considered for study has been taken as an indicator of performance for comparison between various methods
- …