Search CORE

18,946 research outputs found

Formal inverses of the generalized Thue-Morse sequences and variations of the Rudin-Shapiro sequence

Author: Merta Łukasz
Publication venue
Publication date: 01/01/2020
Field of study

A formal inverse of a given automatic sequence (the sequence of coefficients of the composition inverse of its associated formal power series) is also automatic. The comparison of properties of the original sequence and its formal inverse is an interesting problem. Such an analysis has been done before for the Thue{Morse sequence. In this paper, we describe arithmetic properties of formal inverses of the generalized Thue-Morse sequences and formal inverses of two modifications of the Rudin{Shapiro sequence. In each case, we give the recurrence relations and the automaton, then we analyze the lengths of strings of consecutive identical letters as well as the frequencies of letters. We also compare the obtained results with the original sequences.Comment: 20 page

arXiv.org e-Print Archive

Episciences.org

Jagiellonian Univeristy Repository

Multidimensional Generalized Automatic Sequences and Shape-symmetric Morphic Words

Author: Charlier Emilie
Karki Tomi
Rigo Michel
Publication venue
Publication date: 01/01/2009
Field of study

An infinite word is S-automatic if, for all n>=0, its (n + 1)st letter is the output of a deterministic automaton fed with the representation of n in the considered numeration system S. In this extended abstract, we consider an analogous definition in a multidimensional setting and present the connection to the shape-symmetric infinite words introduced by Arnaud Maes. More precisely, for d>=2, we state that a multidimensional infinite word x : N^d \to \Sigma over a finite alphabet \Sigma is S-automatic for some abstract numeration system S built on a regular language containing the empty word if and only if x is the image by a coding of a shape-symmetric infinite word

arXiv.org e-Print Archive

CiteSeerX

Elsevier - Publisher Connector

Open Repository and Bibliography - Liège

Quasicrystals, model sets, and automatic sequences

Author: Allouche Jean-Paul
Meyer Yves
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

We survey mathematical properties of quasicrystals, first from the point of view of harmonic analysis, then from the point of view of morphic and automatic sequences. Nous proposons un tour d'horizon de propri\'et\'es math\'ematiques des quasicristaux, d'abord du point de vue de l'analyse harmonique, ensuite du point de vue des suites morphiques et automatiques

arXiv.org e-Print Archive

CiteSeerX

Comptes Rendus Physique

Cognitive scale-free networks as a model for intermittency in human natural language

Author: Allegrini Paolo
Grigolini Paolo
Palatella Luigi
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2003
Field of study

We model certain features of human language complexity by means of advanced concepts borrowed from statistical mechanics. Using a time series approach, the diffusion entropy method (DE), we compute the complexity of an Italian corpus of newspapers and magazines. We find that the anomalous scaling index is compatible with a simple dynamical model, a random walk on a complex scale-free network, which is linguistically related to Saussurre's paradigms. The model yields the famous Zipf's law in terms of the generalized central limit theorem.Comment: Conference FRACTAL 200

arXiv.org e-Print Archive

CiteSeerX

UNT Digital Library

On Hilberg's Law and Its Links with Guiraud's Law

Author: Altmann G.
Belevitch V.
Bell T. C.
Billingsley P.
Bod R.
De Marcken C. G.
Dębowski Ł.
Dębowski Ł.
Dębowski Ł.
Dębowski Ł.
Guiraud H.
Hoffmann L.
Jelinek F.
Kallenberg O.
Kornai A.
Lehman E.
Lehman E.
Li M.
Li W.
Mandelbrot B.
Mandelbrot B.
Manning C. D.
Megyesi B.
Menzerath P.
Montemurro M. A.
Nevill-Manning C.
Pareto V.
Petrova N. V.
Shalizi C. R.
Shannon C.
Upper D. R.
Wolff J. G.
Zipf G. K.
Zipf G. K.
Łukasz De¸bowski
Publication venue: 'Informa UK Limited'
Publication date: 07/07/2005
Field of study

Hilberg (1990) supposed that finite-order excess entropy of a random human text is proportional to the square root of the text length. Assuming that Hilberg's hypothesis is true, we derive Guiraud's law, which states that the number of word types in a text is greater than proportional to the square root of the text length. Our derivation is based on some mathematical conjecture in coding theory and on several experiments suggesting that words can be defined approximately as the nonterminals of the shortest context-free grammar for the text. Such operational definition of words can be applied even to texts deprived of spaces, which do not allow for Mandelbrot's ``intermittent silence'' explanation of Zipf's and Guiraud's laws. In contrast to Mandelbrot's, our model assumes some probabilistic long-memory effects in human narration and might be capable of explaining Menzerath's law.Comment: To appear in Journal of Quantitative Linguistic

arXiv.org e-Print Archive

Crossref

Can simple models explain Zipf’s law for all exponents?

Author: Ferrer Cancho Ramon
Servedio Vito D. P.
Publication venue: RAM-Verlag
Publication date: 01/01/2005
Field of study

H. Simon proposed a simple stochastic process for explaining Zipf’s law for word frequencies. Here we introduce two similar generalizations of Simon’s model that cover the same range of exponents as the standard Simon model. The mathematical approach followed minimizes the amount of mathematical background needed for deriving the exponent, compared to previous approaches to the standard Simon’s model. Reviewing what is known from other simple explanations of Zipf’s law, we conclude there is no single radically simple explanation covering the whole range of variation of the exponent of Zipf’s law in humans. The meaningfulness of Zipf’s law for word frequencies remains an open question.Peer ReviewedPostprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Automated Detection of Usage Errors in non-native English Writing

Author: Fujishima Satoru
Ishizaki Shun
Publication venue
Publication date: 26/10/2011
Field of study

In an investigation of the use of a novelty detection algorithm for identifying inappropriate word combinations in a raw English corpus, we employ an unsupervised detection algorithm based on the one- class support vector machines (OC-SVMs) and extract sentences containing word sequences whose frequency of appearance is significantly low in native English writing. Combined with n-gram language models and document categorization techniques, the OC-SVM classifier assigns given sentences into two different groups; the sentences containing errors and those without errors. Accuracies are 79.30 % with bigram model, 86.63 % with trigram model, and 34.34 % with four-gram model

EEPIS Repository

On winning shifts of marked uniform substitutions

Author: Peltomäki Jarkko
Salo Ville
Publication venue: 'EDP Sciences'
Publication date: 05/09/2018
Field of study

The second author introduced with I. T\"orm\"a a two-player word-building game [Playing with Subshifts, Fund. Inform. 132 (2014), 131--152]. The game has a predetermined (possibly finite) choice sequence

\alpha_1

\alpha_2

\ldots

of integers such that on round

n

the player

A

chooses a subset

S_n

of size

\alpha_n

of some fixed finite alphabet and the player

B

picks a letter from the set

S_n

. The outcome is determined by whether the word obtained by concatenating the letters

B

picked lies in a prescribed target set

X

(a win for player

A

) or not (a win for player

B

). Typically, we consider

X

to be a subshift. The winning shift

W(X)

of a subshift

X

is defined as the set of choice sequences for which

A

has a winning strategy when the target set is the language of

X

. The winning shift

W(X)

mirrors some properties of

X

. For instance,

W(X)

and

X

have the same entropy. Virtually nothing is known about the structure of the winning shifts of subshifts common in combinatorics on words. In this paper, we study the winning shifts of subshifts generated by marked uniform substitutions, and show that these winning shifts, viewed as subshifts, also have a substitutive structure. Particularly, we give an explicit description of the winning shift for the generalized Thue-Morse substitutions. It is known that

W(X)

and

X

have the same factor complexity. As an example application, we exploit this connection to give a simple derivation of the first difference and factor complexity functions of subshifts generated by marked substitutions. We describe these functions in particular detail for the generalized Thue-Morse substitutions.Comment: Extended version of a paper presented at RuFiDiM I

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)