71 research outputs found

    Two characterisation results of multiple context-free grammars and their application to parsing

    Get PDF
    In the first part of this thesis, a Chomsky-SchĂŒtzenberger characterisation and an automaton characterisation of multiple context-free grammars are proved. Furthermore, a framework for approximation of automata with storage is described. The second part develops each of the three theoretical results into a parsing algorithm

    Synoptic introduction

    Get PDF
    Noam Chomsky is justly famous for his revolutionary contributions to linguistics, psychology and philosophy. He is presently in his 92nd year, and we thought it high time to provide an overview of the major achievements of his now more than sixty-year-old research program and its prospects for the future. This is particularly pressing in the light of persistent rumors, encouraged by a number of authors1, that his program has proven bankrupt, “completely wrong” and has been replaced by various sorts of proposals in general statistical learning and “functionalist/constructionist” linguistic theories (which we return to below)

    Chomsky-SchĂŒtzenberger parsing for weighted multiple context-free languages

    Full text link

    On Languages Generated by Signed Grammars

    Full text link
    We consider languages defined by signed grammars which are similar to context-free grammars except productions with signs associated to them are allowed. As a consequence, the words generated also have signs. We use the structure of the formal series of yields of all derivation trees over such a grammar as a method of specifying a formal language and study properties of the resulting family of languages.Comment: In Proceedings NCMA 2023, arXiv:2309.0733

    Context-Free Graph Properties via Definable Decompositions

    Get PDF

    Self-Attention Networks Can Process Bounded Hierarchical Languages

    Full text link
    Despite their impressive performance in NLP, self-attention networks were recently proved to be limited for processing formal languages with hierarchical structure, such as Dyckk\mathsf{Dyck}_k, the language consisting of well-nested parentheses of kk types. This suggested that natural language can be approximated well with models that are too weak for formal languages, or that the role of hierarchy and recursion in natural language might be limited. We qualify this implication by proving that self-attention networks can process Dyckk,D\mathsf{Dyck}_{k, D}, the subset of Dyckk\mathsf{Dyck}_{k} with depth bounded by DD, which arguably better captures the bounded hierarchical structure of natural language. Specifically, we construct a hard-attention network with D+1D+1 layers and O(log⁥k)O(\log k) memory size (per token per layer) that recognizes Dyckk,D\mathsf{Dyck}_{k, D}, and a soft-attention network with two layers and O(log⁥k)O(\log k) memory size that generates Dyckk,D\mathsf{Dyck}_{k, D}. Experiments show that self-attention networks trained on Dyckk,D\mathsf{Dyck}_{k, D} generalize to longer inputs with near-perfect accuracy, and also verify the theoretical memory advantage of self-attention networks over recurrent networks.Comment: ACL 2021. 19 pages with extended appendix. Fixed a small typo in the formula at the end of page 5 (thank to Gabriel Faria). Code: https://github.com/princeton-nlp/dyck-transforme

    Topics in Programming Languages, a Philosophical Analysis through the case of Prolog

    Get PDF
    [EN]Programming languages seldom find proper anchorage in philosophy of logic, language and science. is more, philosophy of language seems to be restricted to natural languages and linguistics, and even philosophy of logic is rarely framed into programming languages topics. The logic programming paradigm and Prolog are, thus, the most adequate paradigm and programming language to work on this subject, combining natural language processing and linguistics, logic programming and constriction methodology on both algorithms and procedures, on an overall philosophizing declarative status. Not only this, but the dimension of the Fifth Generation Computer system related to strong Al wherein Prolog took a major role. and its historical frame in the very crucial dialectic between procedural and declarative paradigms, structuralist and empiricist biases, serves, in exemplar form, to treat straight ahead philosophy of logic, language and science in the contemporaneous age as well. In recounting Prolog's philosophical, mechanical and algorithmic harbingers, the opportunity is open to various routes. We herein shall exemplify some: - the mechanical-computational background explored by Pascal, Leibniz, Boole, Jacquard, Babbage, Konrad Zuse, until reaching to the ACE (Alan Turing) and EDVAC (von Neumann), offering the backbone in computer architecture, and the work of Turing, Church, Gödel, Kleene, von Neumann, Shannon, and others on computability, in parallel lines, throughly studied in detail, permit us to interpret ahead the evolving realm of programming languages. The proper line from lambda-calculus, to the Algol-family, the declarative and procedural split with the C language and Prolog, and the ensuing branching and programming languages explosion and further delimitation, are thereupon inspected as to relate them with the proper syntax, semantics and philosophical élan of logic programming and Prolog

    Weighted Operator Precedence Languages

    Get PDF
    In the last years renewed investigation of operator precedence languages (OPL) led to discover important properties thereof: OPL are closed with respect to all major operations, are characterized, besides the original grammar family, in terms of an automata family (OPA) and an MSO logic; furthermore they significantly generalize the well-known visibly pushdown languages (VPL). In another area of research, quantitative models of systems are also greatly in demand. In this paper, we lay the foundation to marry these two research fields. We introduce weighted operator precedence automata and show how they are both strict extensions of OPA and weighted visibly pushdown automata. We prove a Nivat-like result which shows that quantitative OPL can be described by unweighted OPA and very particular weighted OPA. In a BĂŒchi-like theorem, we show that weighted OPA are expressively equivalent to a weighted MSO-logic for OPL

    New Results on Context-Free Tree Languages

    Get PDF
    Context-free tree languages play an important role in algebraic semantics and are applied in mathematical linguistics. In this thesis, we present some new results on context-free tree languages

    Multi-weighted Automata Models and Quantitative Logics

    Get PDF
    Recently, multi-priced timed automata have received much attention for real-time systems. These automata extend priced timed automata by featuring several price parameters. This permits to compute objectives like the optimal ratio between rewards and costs. Arising from the model of timed automata, the multi-weighted setting has also attracted much notice for classical nondeterministic automata. The present thesis develops multi-weighted MSO-logics on finite, infinite and timed words which are expressively equivalent to multi-weighted automata, and studies decision problems for them. In addition, a Nivat-like theorem for weighted timed automata is proved; this theorem establishes a connection between quantitative and qualitative behaviors of timed automata. Moreover, a logical characterization of timed pushdown automata is given
    • 

    corecore