5,698 research outputs found
Incoherent dictionaries and the statistical restricted isometry property
In this article we present a statistical version of the Candes-Tao restricted
isometry property (SRIP for short) which holds in general for any incoherent
dictionary which is a disjoint union of orthonormal bases. In addition, under
appropriate normalization, the eigenvalues of the associated Gram matrix
fluctuate around 1 according to the Wigner semicircle distribution. The result
is then applied to various dictionaries that arise naturally in the setting of
finite harmonic analysis, giving, in particular, a better understanding on a
remark of Applebaum-Howard-Searle-Calderbank concerning RIP for the Heisenberg
dictionary of chirp like functions.Comment: Key words: Incoherent dictionaries, statistical version of Candes -
Tao RIP, Semi-Circle law, deterministic constructions, Heisenberg-Weil
representatio
Compressed Sensing and Redundant Dictionaries
This article extends the concept of compressed sensing to signals that are
not sparse in an orthonormal basis but rather in a redundant dictionary. It is
shown that a matrix, which is a composition of a random matrix of certain type
and a deterministic dictionary, has small restricted isometry constants. Thus,
signals that are sparse with respect to the dictionary can be recovered via
Basis Pursuit from a small number of random measurements. Further, thresholding
is investigated as recovery algorithm for compressed sensing and conditions are
provided that guarantee reconstruction with high probability. The different
schemes are compared by numerical experiments.Comment: error in a constant correcte
c-trie++: A Dynamic Trie Tailored for Fast Prefix Searches
Given a dynamic set of strings of total length whose characters
are drawn from an alphabet of size , a keyword dictionary is a data
structure built on that provides locate, prefix search, and update
operations on . Under the assumption that
characters fit into a single machine word , we propose a keyword dictionary
that represents in bits of space,
supporting all operations in expected time on an
input string of length in the word RAM model. This data structure is
underlined with an exhaustive practical evaluation, highlighting the practical
usefulness of the proposed data structure, especially for prefix searches - one
of the most elementary keyword dictionary operations
Dynamic Ordered Sets with Exponential Search Trees
We introduce exponential search trees as a novel technique for converting
static polynomial space search structures for ordered sets into fully-dynamic
linear space data structures.
This leads to an optimal bound of O(sqrt(log n/loglog n)) for searching and
updating a dynamic set of n integer keys in linear space. Here searching an
integer y means finding the maximum key in the set which is smaller than or
equal to y. This problem is equivalent to the standard text book problem of
maintaining an ordered set (see, e.g., Cormen, Leiserson, Rivest, and Stein:
Introduction to Algorithms, 2nd ed., MIT Press, 2001).
The best previous deterministic linear space bound was O(log n/loglog n) due
Fredman and Willard from STOC 1990. No better deterministic search bound was
known using polynomial space.
We also get the following worst-case linear space trade-offs between the
number n, the word length w, and the maximal key U < 2^w: O(min{loglog n+log
n/log w, (loglog n)(loglog U)/(logloglog U)}). These trade-offs are, however,
not likely to be optimal.
Our results are generalized to finger searching and string searching,
providing optimal results for both in terms of n.Comment: Revision corrects some typoes and state things better for
applications in subsequent paper
A high-performance data structure for mobile information systems
Mobile information systems can now be provided on small form-factor computers. Dictionary-based data compression extends the capabilities of systems with limited processing and memory to enable data intensive applications to be supported in such environments. The nature of judicial sentencing decisions requires that a support system provides accurate and up-to-date data and is compatible with the professional working experience of a judge. The difficulties caused by mobility and the data dependence of the decision-making process are addressed by an Internet-based architecture for collecting and distributing system data.We describe an approach to dictionary-based data compression and the structure of an information system that makes use of this technology
Universal Compressed Text Indexing
The rise of repetitive datasets has lately generated a lot of interest in
compressed self-indexes based on dictionary compression, a rich and
heterogeneous family that exploits text repetitions in different ways. For each
such compression scheme, several different indexing solutions have been
proposed in the last two decades. To date, the fastest indexes for repetitive
texts are based on the run-length compressed Burrows-Wheeler transform and on
the Compact Directed Acyclic Word Graph. The most space-efficient indexes, on
the other hand, are based on the Lempel-Ziv parsing and on grammar compression.
Indexes for more universal schemes such as collage systems and macro schemes
have not yet been proposed. Very recently, Kempa and Prezza [STOC 2018] showed
that all dictionary compressors can be interpreted as approximation algorithms
for the smallest string attractor, that is, a set of text positions capturing
all distinct substrings. Starting from this observation, in this paper we
develop the first universal compressed self-index, that is, the first indexing
data structure based on string attractors, which can therefore be built on top
of any dictionary-compressed text representation. Let be the size of a
string attractor for a text of length . Our index takes
words of space and supports locating the
occurrences of any pattern of length in
time, for any constant . This is, in particular, the first index
for general macro schemes and collage systems. Our result shows that the
relation between indexing and compression is much deeper than what was
previously thought: the simple property standing at the core of all dictionary
compressors is sufficient to support fast indexed queries.Comment: Fixed with reviewer's comment
- ā¦