1,470 research outputs found
About Adaptive Coding on Countable Alphabets: Max-Stable Envelope Classes
In this paper, we study the problem of lossless universal source coding for
stationary memoryless sources on countably infinite alphabets. This task is
generally not achievable without restricting the class of sources over which
universality is desired. Building on our prior work, we propose natural
families of sources characterized by a common dominating envelope. We
particularly emphasize the notion of adaptivity, which is the ability to
perform as well as an oracle knowing the envelope, without actually knowing it.
This is closely related to the notion of hierarchical universal source coding,
but with the important difference that families of envelope classes are not
discretely indexed and not necessarily nested.
Our contribution is to extend the classes of envelopes over which adaptive
universal source coding is possible, namely by including max-stable
(heavy-tailed) envelopes which are excellent models in many applications, such
as natural language modeling. We derive a minimax lower bound on the redundancy
of any code on such envelope classes, including an oracle that knows the
envelope. We then propose a constructive code that does not use knowledge of
the envelope. The code is computationally efficient and is structured to use an
{E}xpanding {T}hreshold for {A}uto-{C}ensoring, and we therefore dub it the
\textsc{ETAC}-code. We prove that the \textsc{ETAC}-code achieves the lower
bound on the minimax redundancy within a factor logarithmic in the sequence
length, and can be therefore qualified as a near-adaptive code over families of
heavy-tailed envelopes. For finite and light-tailed envelopes the penalty is
even less, and the same code follows closely previous results that explicitly
made the light-tailed assumption. Our technical results are founded on methods
from regular variation theory and concentration of measure
Change Point Methods on a Sequence of Graphs
Given a finite sequence of graphs, e.g., coming from technological,
biological, and social networks, the paper proposes a methodology to identify
possible changes in stationarity in the stochastic process generating the
graphs. In order to cover a large class of applications, we consider the
general family of attributed graphs where both topology (number of vertexes and
edge configuration) and related attributes are allowed to change also in the
stationary case. Novel Change Point Methods (CPMs) are proposed, that (i) map
graphs into a vector domain; (ii) apply a suitable statistical test in the
vector space; (iii) detect the change --if any-- according to a confidence
level and provide an estimate for its time occurrence. Two specific
multivariate CPMs have been designed: one that detects shifts in the
distribution mean, the other addressing generic changes affecting the
distribution. We ground our proposal with theoretical results showing how to
relate the inference attained in the numerical vector space to the graph
domain, and vice versa. We also show how to extend the methodology for handling
multiple change points in the same sequence. Finally, the proposed CPMs have
been validated on real data sets coming from epileptic-seizure detection
problems and on labeled data sets for graph classification. Results show the
effectiveness of what proposed in relevant application scenarios
Asymptotic Hyperfunctions, Tempered Hyperfunctions, and Asymptotic Expansions
We introduce new subclasses of Fourier hyperfunctions of mixed type,
satisfying polynomial growth conditions at infinity, and develop their sheaf
and duality theory. We use Fourier transformation and duality to examine
relations of these 'asymptotic' and 'tempered' hyperfunctions to known classes
of test functions and distributions, especially the Gelfand-Shilov-Spaces.
Further it is shown that the asymptotic hyperfunctions, which decay faster than
any negative power, are precisely the class that allow asymptotic expansions at
infinity. These asymptotic expansions are carried over to the
higher-dimensional case by applying the Radon transformation for
hyperfunctions.Comment: 31 pages, 1 figure, typos corrected, references adde
Computing the Similarity Between Moving Curves
In this paper we study similarity measures for moving curves which can, for
example, model changing coastlines or retreating glacier termini. Points on a
moving curve have two parameters, namely the position along the curve as well
as time. We therefore focus on similarity measures for surfaces, specifically
the Fr\'echet distance between surfaces. While the Fr\'echet distance between
surfaces is not even known to be computable, we show for variants arising in
the context of moving curves that they are polynomial-time solvable or
NP-complete depending on the restrictions imposed on how the moving curves are
matched. We achieve the polynomial-time solutions by a novel approach for
computing a surface in the so-called free-space diagram based on max-flow
min-cut duality
- …