6 research outputs found
An Abstract Machine for Unification Grammars
This work describes the design and implementation of an abstract machine,
Amalia, for the linguistic formalism ALE, which is based on typed feature
structures. This formalism is one of the most widely accepted in computational
linguistics and has been used for designing grammars in various linguistic
theories, most notably HPSG. Amalia is composed of data structures and a set of
instructions, augmented by a compiler from the grammatical formalism to the
abstract instructions, and a (portable) interpreter of the abstract
instructions. The effect of each instruction is defined using a low-level
language that can be executed on ordinary hardware.
The advantages of the abstract machine approach are twofold. From a
theoretical point of view, the abstract machine gives a well-defined
operational semantics to the grammatical formalism. This ensures that grammars
specified using our system are endowed with well defined meaning. It enables,
for example, to formally verify the correctness of a compiler for HPSG, given
an independent definition. From a practical point of view, Amalia is the first
system that employs a direct compilation scheme for unification grammars that
are based on typed feature structures. The use of amalia results in a much
improved performance over existing systems.
In order to test the machine on a realistic application, we have developed a
small-scale, HPSG-based grammar for a fragment of the Hebrew language, using
Amalia as the development platform. This is the first application of HPSG to a
Semitic language.Comment: Doctoral Thesis, 96 pages, many postscript figures, uses pstricks,
pst-node, psfig, fullname and a macros fil
Inheritance hierarchies: Semantics and unification
Inheritance hierarchies are introduced as a means of representing taxonomicallyorganized data. The hierarchies are built up from so-called feature types that are ordered by subtyping and whose elements are records. Every feature type comes with a set of features prescribing fields of its record elements. So-called feature terms are available to denote subsets of feature types. Feature unification is introduced as an operation that decides whether two feature terms have a nonempty intersection and computes a feature term denoting the intersection.We model our inheritance hierarchies as algebraic specifications in ordersortedequational logic using initial algebra semantics. Our framework integrates feature types whose elements are obtained as records with constructor types whose elements are obtained by constructor application. Unification in these hierarchies combines record unification with order-sorted term unification and is presented as constraint solving. We specify a unitary unification algorithm by a set of simplification rules and prove its soundness and completeness with respect to the model-theoretic semantics
Recommended from our members
On the Formal Flexibility of Syntactic Categories
This dissertation explores the formal flexibility of syntactic categories. The main proposal is that Universal Grammar (UG) only provides templatic guidance for syntactic category formation and organization but leaves many other issues open, including issues internal to a single category and issues at the intercategorial, system level: these points that UG "does not care about" turn out to enrich the categorial ontology of human language in important ways.
The dissertation consists of seven chapters. After a general introduction in Chapter 1, I lay out some foundational issues regarding features and categories in Chapter 2 and delineate a featural metalanguage comprising four components: specification, valuation, typing, and granularity. Based on that I put forward a templatic definition for syntactic categories, which unifies the combinatorial and taxonomic perspectives under the notion mergeme. Then, a detailed overview of the "categorial universe" I work with is presented, which shows that the syntactic category system (SCS) is an intricate web structured by five layers of abstraction divided into three broad levels of concern: the individual level (layers 1–2), the global level (layers 3–4), and the supraglobal level (layer 5). In the subsequent chapters I explore the template-flexibility pairs at each abstraction layer, with Chapters 3–4 focusing on the first layer, Chapter 5 on the second layer, and Chapter 6 on the third and fourth layers; the fifth layer is not in the scope of this dissertation.
Chapter 3 examines a special type of category defined by an underspecified mergeme, the defective category, which behaves like a "chameleon" in that it gets assimilated into whatever nondefective category it merges with. This characteristic makes it potentially useful in analyzing certain adjunction structures, and I explore this potential by two case studies, one focusing on modifier-head compounds and the other on sentence-final particles. Chapter 4 examines another special type of category defined by the absence of a mergeme, the Root category. Deductive reasoning leads me to propose a generalized root syntax, according to which roots are not confined to lexical categorial environments but may legally merge with and hence "support" any non-Root category. I demonstrate the empirical consequences of this theory by a comprehensive study of the half-lexical–half-functional vocabulary items in Chinese.
Chapter 5 ascends to the second abstraction layer and raises the question of whether the categorial sequences (or projection hierarchies) in human language are necessarily totally ordered, as certain analytical devices (e.g., "flavored" categories) can only be theoretically maintained if we also allow categorial sequences to be partially ordered. After a diachronic study of the flavored verbalizer (stative) in Chinese resultative compounds, I conclude that while "flavoring" is indeed a possible type of flexibility in the SCS, it is the deviation rather than the norm due to non-UG or "third" factors and hence should be cautiously used in syntactic analyses.
Chapter 6 ascends even higher on the ladder of abstraction and examines the global interconnection in the SCS ontology with the aid of mathematical Category theory. I formalize the functional parallelism across major parts of speech and the inheritance-based relations across granularity levels as Category-theoretic structures, which reveal further and more abstract templates and flexibility types in the SCS. A crucial mathematical concept in the formalization is epi-Adjunction. Finally, in Chapter 7 I summarize the main results of this dissertation and briefly discuss some potential directions of future research.My PhD is funded by Cambridge Trust and China Scholarship Council. I have also received travel grants and financial aids from Gonville and Caius College and the Faculty of Modern and Medieval Languages