Search CORE

47,281 research outputs found

Génération aléatoire uniforme de mots de langages rationnels

Author: Denise Alain
Publication venue: Published by Elsevier B.V.
Publication date: 28/05/1996
Field of study

RésuméNous donnons deux algorithmes de génération aléatoire et uniforme de mots, qui s'appliquent à des classes particulières de langages rationnels. Leur efficacité est mesurée en termes de complexité logarithmique, en fonction de la longueur n des mots engendrés. Le premier algorithme est dédié aux langages dont les séries génératrices possèdent un unique pôle, éventuellement multiple; sa complexité en temps est de l'ordre de n log n, et l'espace mémoire occupé est en log n. Le second algorithme est réservé aux langages dont les séries génératrices possèdent la propriété suivante: il existe un unique pôle de plus petit module, et ce pôle est simple. Après un pré-traitement en temps polynomial en n, le tirage aléatoire de tout mot s'effectue en temps moyen et espace linéaires.AbstractThe problem of generating uniformly at random words of a given language has been the subject of extensive study in the last few years. An important part of that work is devoted to the generation of words of context-free languages (see, e.g., [6, 8, 9, 12]). For a given integer n > 0, the words of length n > 0 of any unambiguous context-free language can be generated uniformly at random by using algorithms derived from the general method which was introduced by Wilf [14, 15] and systematized by Flajolet et al. [7]. Clearly, this can be applied to the set of rational languages, which constitute an important special case of context-free languages.Most authors use the uniform measure of complexity (see [1]) in order to compute the complexity of the algorithms of generation. This measure is based on the following hypotheses: any simple arithmetic operation (addition, multiplication) has time cost 0(1), and a constant amount of memory space is taken by any number. Thus, we know that words of any rational language can be generated by using an algorithm which, with respect to the uniform measure of complexity, runs in linear time (in terms of the length of the words) and constant space [9]. This measure is realistic only if there is a reasonable bound on the numbers involved in the operations. However, the classical random generation algorithms involve operations on numbers which grow exponentially in terms of the length of the words to be generated. Moreover, the programs which make use of these algorithms are generally used to generate very large words, for example for the purpose of studying the asymptotic behavior of some parameters. Therefore, the uniform measure does not reflect the real behavior of such programs. It turns out that the logarithmic measure of complexity is much more realistic: one assumes that the space taken by a number k is O(log k), and that any simple arithmetic operation can be done in time O(log k). It is with respect to this measure that we will evaluate the performance of algorithms in this paper.Our goal is to design efficient algorithms (in terms of logarithmic complexity) to generate uniformly at random words from certain classes of rational languages. We consider rational languages defined by their minimal finite deterministic automata. When computing complexity, neither the size of the automaton nor the cardinality of the alphabet are taken in account.In Section 2 we present some background on rational languages and their generating series. We describe briefly the classical method for generating words of such languages and we study its logarithmic complexity. We show that it is at best quadratic for most languages. This is due mainly to computations on numbers which grow exponentially with the length of the words to be generated. In order to improve significantly the efficiency of the algorithms, we must avoid handling of large numbers, or at least decrease substantially the frequency of computations on such numbers. Another alternative, briefly discussed in [7] and [12], is to compute with floating point numbers instead of integers. In this case, the logarithmic complexity is time-linear. However, using floating point numbers leads inevitably to approximations which prevent the exact uniformity of the generation.In Sections 3 and 4 we show that, in some cases, we can avoid computations on large numbers entirely or almost entirely, while keeping the exact uniformity of the generation. We determine two classes of rational languages for which this is the case.Section 3 concerns languages whose associated generating series have a unique singularity. We present a simple version of the classical algorithm, which totally avoids handling of large numbers. The logarithmic complexity of the method is O(n log n) in time and O(log n) in memory space.Section 4 focuses on languages whose associated generating series have the following property: there exists a unique singularity of minimum modulus, and this singularity is simple. For such languages we give a probabilistic version of the classical algorithm which generates words randomly while avoiding most computations on large numbers. This method needs a preprocessing stage, which can be done in polynomial time and linear space in terms of the length n of the words. Following preprocessing, any word of length n can be generated in average linear time and space

Elsevier - Publisher Connector

Multi-dimensional Boltzmann Sampling of Languages

Author: Bodini Olivier
Ponty Yann
Publication venue
Publication date: 01/01/2010
Field of study

This paper addresses the uniform random generation of words from a context-free language (over an alphabet of size

k

), while constraining every letter to a targeted frequency of occurrence. Our approach consists in a multidimensional extension of Boltzmann samplers \cite{Duchon2004}. We show that, under mostly \emph{strong-connectivity} hypotheses, our samplers return a word of size in

[(1-\varepsilon)n, (1+\varepsilon)n]

and exact frequency in

\mathcal{O}(n^{1+k/2})

expected time. Moreover, if we accept tolerance intervals of width in

\Omega(\sqrt{n})

for the number of occurrences of each letters, our samplers perform an approximate-size generation of words in expected

\mathcal{O}(n)

time. We illustrate these techniques on the generation of Tetris tessellations with uniform statistics in the different types of tetraminoes.Comment: 12p

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

HAL-Polytechnique

HAL-Rennes 1

Controlled non uniform random generation of decomposable structures

Author: A. Denise
Berghen
Bertoni
Bostan
Brlek
Denise
Denise
Dershowitz
Drmota
Duchon
Dutour
Faugère
Flajolet
Flajolet
Flajolet
Flajolet
Fontana
Goldwurm
Greene
Hofacker
Hofacker
Jin
Lipshitz
M. Termier
Mathews
Mathews
Nebel
Nebel
Nicodème
Nijenhuis
Ponty
Salvy
Schönhage
van der Hoeven
Vauchaussade de Chaumont
Waterman
Y. Ponty
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

Consider a class of decomposable combinatorial structures, using different types of atoms \Atoms = \{\At_1,\ldots ,\At_{|{\Atoms}|}\}. We address the random generation of such structures with respect to a size

n

and a targeted distribution in

k

of its \emph{distinguished} atoms. We consider two variations on this problem. In the first alternative, the targeted distribution is given by

k

real numbers \TargFreq_1, \ldots, \TargFreq_k such that 0 < \TargFreq_i < 1 for all

i

and \TargFreq_1+\cdots+\TargFreq_k \leq 1. We aim to generate random structures among the whole set of structures of a given size

n

, in such a way that the {\em expected} frequency of any distinguished atom \At_i equals \TargFreq_i. We address this problem by weighting the atoms with a

k

-tuple \Weights of real-valued weights, inducing a weighted distribution over the set of structures of size

n

. We first adapt the classical recursive random generation scheme into an algorithm taking \bigO{n^{1+o(1)}+mn\log{n}} arithmetic operations to draw

m

structures from the \Weights-weighted distribution. Secondly, we address the analytical computation of weights such that the targeted frequencies are achieved asymptotically, i. e. for large values of

n

. We derive systems of functional equations whose resolution gives an explicit relationship between \Weights and \TargFreq_1, \ldots, \TargFreq_k. Lastly, we give an algorithm in \bigO{k n^4} for the inverse problem, {\it i.e.} computing the frequencies associated with a given

k

-tuple \Weights of weights, and an optimized version in \bigO{k n^2} in the case of context-free languages. This allows for a heuristic resolution of the weights/frequencies relationship suitable for complex specifications. In the second alternative, the targeted distribution is given by a

k

natural numbers

n_1, \ldots, n_k

such that

n_1+\cdots+n_k+r=n

where

r \geq 0

is the number of undistinguished atoms. The structures must be generated uniformly among the set of structures of size

n

that contain {\em exactly}

n_i

atoms \At_i (

1 \leq i \leq k

). We give a \bigO{r^2\prod_{i=1}^k n_i^2 +m n k \log n} algorithm for generating

m

structures, which simplifies into a \bigO{r\prod_{i=1}^k n_i +m n} for regular specifications

arXiv.org e-Print Archive

HAL-CentraleSupelec

Elsevier - Publisher Connector

Crossref

INRIA a CCSD electronic archive server

HAL-Polytechnique

HAL-Rennes 1

Uniform Random Sampling of Traces in Very Large Models

Author: Collaboration the RaST
Denise Alain
Gaudel Marie-Claude
Gouraud Sandrine-Dominique
Lasseigne Richard
Peyronnet Sylvain
Publication venue
Publication date: 01/01/2006
Field of study

This paper presents some first results on how to perform uniform random walks (where every trace has the same probability to occur) in very large models. The models considered here are described in a succinct way as a set of communicating reactive modules. The method relies upon techniques for counting and drawing uniformly at random words in regular languages. Each module is considered as an automaton defining such a language. It is shown how it is possible to combine local uniform drawings of traces, and to obtain some global uniform random sampling, without construction of the global model

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

Hal-Diderot

Statistics and compression of scl

Author: Bavard
Brooks
Cannon
DANNY CALEGARI
Epstein
Fujiwara
Furman
Ghys
Gromov
Gromov
Hall
JOSEPH MAHER
Kakutani
Rivin
Stroock
Witte-Morris
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 12/02/2013
Field of study

We obtain sharp estimates on the growth rate of stable commutator length on random (geodesic) words, and on random walks, in hyperbolic groups and groups acting nondegenerately on hyperbolic spaces. In either case, we show that with high probability stable commutator length of an element of length

n

is of order

n/\log{n}

. This establishes quantitative refinements of qualitative results of Bestvina-Fujiwara and others on the infinite dimensionality of 2-dimensional bounded cohomology in groups acting suitably on hyperbolic spaces, in the sense that we can control the geometry of the unit balls in these normed vector spaces (or rather, in random subspaces of their normed duals). As a corollary of our methods, we show that an element obtained by random walk of length

n

in a mapping class group cannot be written as a product of fewer than

O(n/\log{n})

reducible elements, with probability going to 1 as

n

goes to infinity. We also show that the translation length on the complex of free factors of a random walk of length

n

on the outer automorphism group of a free group grows linearly in

n

.Comment: Minor edits arising from referee's comments; 45 page

arXiv.org e-Print Archive

CiteSeerX

Crossref