Search CORE

7,777 research outputs found

A Note on Computing Time for Recognition of Languages Generated by Linear Grammars

Author: Kasami Tadao
Publication venue: Coordinated Science Laboratory, University of Illinois at Urbana-Champaign
Publication date: 01/04/1966
Field of study

Coordinated Science Laboratory was formerly known as Control Systems LaboratoryJoint Services Electronics Program / DA 28 043 AMC 00073(E)National Science Foundation / NSF GK-69

Elsevier - Publisher Connector

Illinois Digital Environment for Access to Learning and Scholarship Repository

Computation of distances for regular and context-free probabilistic languages

Author: Nederhof Mark Jan
Satta Giorgio
Publication venue
Publication date: 01/01/2008
Field of study

Several mathematical distances between probabilistic languages have been investigated in the literature, motivated by applications in language modeling, computational biology, syntactic pattern matching and machine learning. In most cases, only pairs of probabilistic regular languages were considered. In this paper we extend the previous results to pairs of languages generated by a probabilistic context-free grammar and a probabilistic finite automaton.PostprintPeer reviewe

Elsevier - Publisher Connector

Crossref

Archivio istituzionale della ricerca - Università di Padova

University of St. Andrews - Pure

St Andrews Research Repository

Coding-theorem Like Behaviour and Emergence of the Universal Distribution from Resource-bounded Algorithmic Probability

Author: Badillo Liliana
Hernández-Orozco Santiago
Hernández-Quiroz Francisco
Zenil Hector
Publication venue: 'Informa UK Limited'
Publication date: 13/04/2018
Field of study

Previously referred to as `miraculous' in the scientific literature because of its powerful properties and its wide application as optimal solution to the problem of induction/inference, (approximations to) Algorithmic Probability (AP) and the associated Universal Distribution are (or should be) of the greatest importance in science. Here we investigate the emergence, the rates of emergence and convergence, and the Coding-theorem like behaviour of AP in Turing-subuniversal models of computation. We investigate empirical distributions of computing models in the Chomsky hierarchy. We introduce measures of algorithmic probability and algorithmic complexity based upon resource-bounded computation, in contrast to previously thoroughly investigated distributions produced from the output distribution of Turing machines. This approach allows for numerical approximations to algorithmic (Kolmogorov-Chaitin) complexity-based estimations at each of the levels of a computational hierarchy. We demonstrate that all these estimations are correlated in rank and that they converge both in rank and values as a function of computational power, despite fundamental differences between computational models. In the context of natural processes that operate below the Turing universal level because of finite resources and physical degradation, the investigation of natural biases stemming from algorithmic rules may shed light on the distribution of outcomes. We show that up to 60\% of the simplicity/complexity bias in distributions produced even by the weakest of the computational models can be accounted for by Algorithmic Probability in its approximation to the Universal Distribution.Comment: 27 pages main text, 39 pages including supplement. Online complexity calculator: http://complexitycalculator.com

arXiv.org e-Print Archive

FigShare

Probabilistic parsing

Author: Nederhof Mark Jan
Satta Giorgio
Publication venue: Springer
Publication date: 06/01/2011
Field of study

Postprin

St Andrews Research Repository

Precise n-gram Probabilities from Stochastic Context-free Grammars

Author: Segal Jonathan
Stolcke Andreas
Publication venue
Publication date: 01/01/1994
Field of study

We present an algorithm for computing n-gram probabilities from stochastic context-free grammars, a procedure that can alleviate some of the standard problems associated with n-grams (estimation from sparse data, lack of linguistic structure, among others). The method operates via the computation of substring expectations, which in turn is accomplished by solving systems of linear equations derived from the grammar. We discuss efficient implementation of the algorithm and report our practical experience with it.Comment: 12 pages, to appear in ACL-9

arXiv.org e-Print Archive

CiteSeerX

Crossref

Streaming algorithms for language recognition problems

Author: Babu Ajesh
Limaye Nutan
Radhakrishnan Jaikumar
Varma Girish
Publication venue
Publication date: 01/01/2011
Field of study

We study the complexity of the following problems in the streaming model. Membership testing for \DLIN We show that every language in \DLIN\ can be recognised by a randomized one-pass

O(\log n)

space algorithm with inverse polynomial one-sided error, and by a deterministic p-pass

O(n/p)

space algorithm. We show that these algorithms are optimal. Membership testing for \LL

(k)

For languages generated by \LL

(k)

grammars with a bound of

r

on the number of nonterminals at any stage in the left-most derivation, we show that membership can be tested by a randomized one-pass

O(r\log n)

space algorithm with inverse polynomial (in

n

) one-sided error. Membership testing for \DCFL We show that randomized algorithms as efficient as the ones described above for \DLIN\ and \LL(k) (which are subclasses of \DCFL) cannot exist for all of \DCFL: there is a language in \VPL\ (a subclass of \DCFL) for which any randomized p-pass algorithm with error bounded by

\epsilon < 1/2

must use

\Omega(n/p)

space. Degree sequence problem We study the problem of determining, given a sequence

d_1, d_2,..., d_n

and a graph

G

, whether the degree sequence of

G

is precisely

d_1, d_2,..., d_n

. We give a randomized one-pass

O(\log n)

space algorithm with inverse polynomial one-sided error probability. We show that our algorithms are optimal. Our randomized algorithms are based on the recent work of Magniez et al. \cite{MMN09}; our lower bounds are obtained by considering related communication complexity problems

arXiv.org e-Print Archive

A Tractable Extension of Linear Indexed Grammars

Author: Keller Bill
Weir David
Publication venue
Publication date: 01/01/1995
Field of study

It has been shown that Linear Indexed Grammars can be processed in polynomial time by exploiting constraints which make possible the extensive use of structure-sharing. This paper describes a formalism that is more powerful than Linear Indexed Grammar, but which can also be processed in polynomial time using similar techniques. The formalism, which we refer to as Partially Linear PATR manipulates feature structures rather than stacks.Comment: 8 pages LaTeX, uses eaclap.sty, to appear in EACL-9

arXiv.org e-Print Archive

CiteSeerX

Crossref

Sussex Research Online

Descriptional complexity of cellular automata and decidability questions

Author: Malcher Andreas
Publication venue
Publication date: 01/01/2001
Field of study

We study the descriptional complexity of cellular automata (CA), a parallel model of computation. We show that between one of the simplest cellular models, the realtime-OCA. and "classical" models like deterministic finite automata (DFA) or pushdown automata (PDA), there will be savings concerning the size of description not bounded by any recursive function, a so-called nonrecursive trade-off. Furthermore, nonrecursive trade-offs are shown between some restricted classes of cellular automata. The set of valid computations of a Turing machine can be recognized by a realtime-OCA. This implies that many decidability questions are not even semi decidable for cellular automata. There is no pumping lemma and no minimization algorithm for cellular automata

Hochschulschriftenserver - Universität Frankfurt am Main