16,274 research outputs found
Complete Axiomatizations of Fragments of Monadic Second-Order Logic on Finite Trees
We consider a specific class of tree structures that can represent basic
structures in linguistics and computer science such as XML documents, parse
trees, and treebanks, namely, finite node-labeled sibling-ordered trees. We
present axiomatizations of the monadic second-order logic (MSO), monadic
transitive closure logic (FO(TC1)) and monadic least fixed-point logic
(FO(LFP1)) theories of this class of structures. These logics can express
important properties such as reachability. Using model-theoretic techniques, we
show by a uniform argument that these axiomatizations are complete, i.e., each
formula that is valid on all finite trees is provable using our axioms. As a
backdrop to our positive results, on arbitrary structures, the logics that we
study are known to be non-recursively axiomatizable
Logics for Unranked Trees: An Overview
Labeled unranked trees are used as a model of XML documents, and logical
languages for them have been studied actively over the past several years. Such
logics have different purposes: some are better suited for extracting data,
some for expressing navigational properties, and some make it easy to relate
complex properties of trees to the existence of tree automata for those
properties. Furthermore, logics differ significantly in their model-checking
properties, their automata models, and their behavior on ordered and unordered
trees. In this paper we present a survey of logics for unranked trees
On Descriptive Complexity, Language Complexity, and GB
We introduce , a monadic second-order language for reasoning about
trees which characterizes the strongly Context-Free Languages in the sense that
a set of finite trees is definable in iff it is (modulo a
projection) a Local Set---the set of derivation trees generated by a CFG. This
provides a flexible approach to establishing language-theoretic complexity
results for formalisms that are based on systems of well-formedness constraints
on trees. We demonstrate this technique by sketching two such results for
Government and Binding Theory. First, we show that {\em free-indexation\/}, the
mechanism assumed to mediate a variety of agreement and binding relationships
in GB, is not definable in and therefore not enforcible by CFGs.
Second, we show how, in spite of this limitation, a reasonably complete GB
account of English can be defined in . Consequently, the language
licensed by that account is strongly context-free. We illustrate some of the
issues involved in establishing this result by looking at the definition, in
, of chains. The limitations of this definition provide some insight
into the types of natural linguistic principles that correspond to higher
levels of language complexity. We close with some speculation on the possible
significance of these results for generative linguistics.Comment: To appear in Specifying Syntactic Structures, papers from the Logic,
Structures, and Syntax workshop, Amsterdam, Sept. 1994. LaTeX source with
nine included postscript figure
A Generalised Quantifier Theory of Natural Language in Categorical Compositional Distributional Semantics with Bialgebras
Categorical compositional distributional semantics is a model of natural
language; it combines the statistical vector space models of words with the
compositional models of grammar. We formalise in this model the generalised
quantifier theory of natural language, due to Barwise and Cooper. The
underlying setting is a compact closed category with bialgebras. We start from
a generative grammar formalisation and develop an abstract categorical
compositional semantics for it, then instantiate the abstract setting to sets
and relations and to finite dimensional vector spaces and linear maps. We prove
the equivalence of the relational instantiation to the truth theoretic
semantics of generalised quantifiers. The vector space instantiation formalises
the statistical usages of words and enables us to, for the first time, reason
about quantified phrases and sentences compositionally in distributional
semantics
A Corpus-based Toy Model for DisCoCat
The categorical compositional distributional (DisCoCat) model of meaning
rigorously connects distributional semantics and pregroup grammars, and has
found a variety of applications in computational linguistics. From a more
abstract standpoint, the DisCoCat paradigm predicates the construction of a
mapping from syntax to categorical semantics. In this work we present a
concrete construction of one such mapping, from a toy model of syntax for
corpora annotated with constituent structure trees, to categorical semantics
taking place in a category of free R-semimodules over an involutive commutative
semiring R.Comment: In Proceedings SLPCS 2016, arXiv:1608.0101
- …