Search CORE

17 research outputs found

Random Generation of Nondeterministic Finite-State Tree Automata

Author: Hanneforth Thomas
Maletti Andreas
Quernheim Daniel
Publication venue: 'Open Publishing Association'
Publication date: 21/11/2013
Field of study

Algorithms for (nondeterministic) finite-state tree automata (FTAs) are often tested on random FTAs, in which all internal transitions are equiprobable. The run-time results obtained in this manner are usually overly optimistic as most such generated random FTAs are trivial in the sense that the number of states of an equivalent minimal deterministic FTA is extremely small. It is demonstrated that nontrivial random FTAs are obtained only for a narrow band of transition probabilities. Moreover, an analytic analysis yields a formula to approximate the transition probability that yields the most complex random FTAs, which should be used in experiments.Comment: In Proceedings TTATT 2013, arXiv:1311.5058. Andreas Maletti and Daniel Quernheim were financially supported by the German Research Foundation (DFG) grant MA/4959/1-

arXiv.org e-Print Archive

Directory of Open Access Journals

Statistical language models within the algebra of weighted rational languages

Author: Hanneforth Thomas
Würzner Kay-Michael
Publication venue
Publication date: 01/01/2009
Field of study

Statistical language models are an important tool in natural language processing. They represent prior knowledge about a certain language which is usually gained from a set of samples called a corpus. In this paper, we present a novel way of creating N-gram language models using weighted finite automata. The construction of these models is formalised within the algebra underlying weighted finite automata and expressed in terms of weighted rational languages and transductions. Besides the algebra we make use of five special constant weighted transductions which rely only on the alphabet and the model parameter N. In addition, we discuss efficient implementations of these transductions in terms of virtual constructions

University of Szeged

Recommended from our members

The Acquisition of Programming Skills from Textbooks

Author: Hanneforth Thomas
Klenner Manfred
Publication venue: eScholarship, University of California
Publication date: 01/01/1998
Field of study

We present a computer model for the acquistion of programming languages from textbooks. Starting from a verbal description of the notational conventions that are used to describe the syntactic form of programming commands, a meta grammar is generated that parses concrete command descriptions and builds up grammar rules for that commands. These rules are realized as definite clause grammar rules that captures the syntax of these commands. They can be used to parse and generate syntactically correct examples of a command. However, to solve real programming problems also the semantics of a command and of its parameters needs to be acquired. This is accomplished by the natural language parsing of the explanations given in the text and the augmentation of the definite clause command grammars with semantic structures

eScholarship - University of California

The Acquisition of Programming Skills from Textbooks

Author: Manfred Klenner
Thomas Hanneforth
Publication venue
Publication date
Field of study

CiteSeerX

Weaving the Semantic Web: Extracting and Representing the Content of Pathology Reports

Author: Hanneforth Thomas
Schlangen David
Stede Manfred
Publication venue
Publication date: 01/01/2005
Field of study

Schlangen D, Hanneforth T, Stede M. Weaving the Semantic Web: Extracting and Representing the Content of Pathology Reports. In: Proceedings of the GLDV Conference 2005 (GLDV05). Bonn, Germany; 2005

Publications at Bielefeld University

Pushing for weighted tree automata

Author: Andreas Maletti
Daniel Quernheim
Thomas Hanneforth
Publication venue: Logical Methods in Computer Science e.V.
Publication date: 01/01/2018
Field of study

A weight normalization procedure, commonly called pushing, is introduced for weighted tree automata (wta) over commutative semifields. The normalization preserves the recognized weighted tree language even for nondeterministic wta, but it is most useful for bottom-up deterministic wta, where it can be used for minimization and equivalence testing. In both applications a careful selection of the weights to be redistributed followed by normalization allows a reduction of the general problem to the corresponding problem for bottom-up deterministic unweighted tree automata. This approach was already successfully used by Mohri and Eisner for the minimization of deterministic weighted string automata. Moreover, the new equivalence test for two wta

M

and

M'

runs in time

\mathcal O((\lvert M \rvert + \lvert M'\rvert) \cdot \log {(\lvert Q\rvert + \lvert Q'\rvert)})

, where

Q

and

Q'

are the states of

M

and

M'

, respectively, which improves the previously best run-time

\mathcal O(\lvert M \rvert \cdot \lvert M'\rvert)

Directory of Open Access Journals