Search CORE

432 research outputs found

Validating LR(1) Parsers

Author: Jourdan Jacques-Henri
Leroy Xavier
Pottier François
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

International audienceAn LR(1) parser is a finite-state automaton, equipped with a stack, which uses a combination of its current state and one lookahead symbol in order to determine which action to perform next. We present a validator which, when applied to a context-free grammar G and an automaton A, checks that A and G agree. Validating the parser pro-vides the correctness guarantees required by verified compilers and other high-assurance software that involves parsing. The validation process is independent of which technique was used to construct A. The validator is implemented and proved correct using the Coq proof assistant. As an application, we build a formally-verified parser for the C99 language

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

TRX: A Formally Verified Parser Interpreter

Author: Adam Koprowski
Andrew Gordon
Gerald J. Sussman and Guy L. Steele Jr
Graham Hutton
Henri Binsztok
Luís Cruz-Filipe and Pierre Letouzey
Roman R. Redziejowski
Xavier Leroy
Publication venue: 'Logical Methods in Computer Science e.V.'
Publication date: 01/01/2011
Field of study

Parsing is an important problem in computer science and yet surprisingly little attention has been devoted to its formal verification. In this paper, we present TRX: a parser interpreter formally developed in the proof assistant Coq, capable of producing formally correct parsers. We are using parsing expression grammars (PEGs), a formalism essentially representing recursive descent parsing, which we consider an attractive alternative to context-free grammars (CFGs). From this formalization we can extract a parser for an arbitrary PEG grammar with the warranty of total correctness, i.e., the resulting parser is terminating and correct with respect to its grammar and the semantics of PEGs; both properties formally proven in Coq.Comment: 26 pages, LMC

arXiv.org e-Print Archive

CiteSeerX

Crossref

Episciences.org

Directory of Open Access Journals

Morpheus: Automated Safety Verification of Data-Dependent Parser Combinator Programs

Author: Jagannathan Suresh
Mishra Ashish
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 37th European Conference on Object-Oriented Programming (ECOOP 2023)
Publication date: 01/01/2023
Field of study

Parser combinators are a well-known mechanism used for the compositional construction of parsers, and have shown to be particularly useful in writing parsers for rich grammars with data-dependencies and global state. Verifying applications written using them, however, has proven to be challenging in large part because of the inherently effectful nature of the parsers being composed and the difficulty in reasoning about the arbitrarily rich data-dependent semantic actions that can be associated with parsing actions. In this paper, we address these challenges by defining a parser combinator framework called Morpheus equipped with abstractions for defining composable effects tailored for parsing and semantic actions, and a rich specification language used to define safety properties over the constituent parsers comprising a program. Even though its abstractions yield many of the same expressivity benefits as other parser combinator systems, Morpheus is carefully engineered to yield a substantially more tractable automated verification pathway. We demonstrate its utility in verifying a number of realistic, challenging parsing applications, including several cases that involve non-trivial data-dependent relations

Dagstuhl Research Online Publication Server

Because Syntax does Matter: Improving Predicate-Argument Structures Parsing Using Syntactic Features

Author: Ribeyre Corentin
Seddah Djamé
Villemonte de La Clergerie Éric
Publication venue: HAL CCSD
Publication date: 01/01/2015
Field of study

International audienceParsing full-fledged predicate-argument structures in a deep syntax framework requires graphs to be predicted. Using the DeepBank (Flickinger et al., 2012) and the Predicate-Argument Structure treebank (Miyao and Tsujii, 2005) as a test field, we show how transition-based parsers, extended to handle connected graphs, benefit from the use of topologically different syntactic features such as dependencies, tree fragments, spines or syntactic paths, bringing a much needed context to the parsing models, improving notably over long distance dependencies and elided coordinate structures. By confirming this positive impact on an accurate 2nd-order graph-based parser (Martins and Almeida, 2014), we establish a new state-of-the-art on these data sets

Crossref

INRIA a CCSD electronic archive server

Hal-Diderot

A Verified LL(1) Parser Generator

Author: Casinghino Chris
Fisher Kathleen
Lasser Sam
Roux Cody
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 10th International Conference on Interactive Theorem Proving (ITP 2019)
Publication date: 01/01/2019
Field of study

An LL(1) parser is a recursive descent algorithm that uses a single token of lookahead to build a grammatical derivation for an input sequence. We present an LL(1) parser generator that, when applied to grammar G, produces an LL(1) parser for G if such a parser exists. We use the Coq Proof Assistant to verify that the generator and the parsers that it produces are sound and complete, and that they terminate on all inputs without using fuel parameters. As a case study, we extract the tool\u27s source code and use it to generate a JSON parser. The generated parser runs in linear time; it is two to four times slower than an unverified parser for the same grammar

Dagstuhl Research Online Publication Server

Arguing security: validating security requirements using structured argumentation

Author: Haley Charles B.
Laney Robin
Moffett Jonathan D.
Nuseibeh Bashar
Publication venue
Publication date: 01/01/2005
Field of study

This paper proposes using both formal and structured informal arguments to show that an eventual realized system can satisfy its security requirements. These arguments, called 'satisfaction arguments', consist of two parts: a formal argument based upon claims about domain properties, and a set of informal arguments that justify the claims. Building on our earlier work on trust assumptions and security requirements, we show how using satisfaction arguments assists in clarifying how a system satisfies its security requirements, in the process identifying those properties of domains that are critical to the requirements

CiteSeerX

Open Research Online (The Open University)

Foundations of fast communication via XML

Author: Gaul Thilo
Loewe Welf
Noga Markus L.
Publication venue
Publication date: 02/08/2007
Field of study

Communication with XML often involves pre-agreed document types. In this paper, we propose an offline parser generation approach to enhance online processing performance for documents con-forming to a given DTD. Our examination of DTDs and the languages they define demonstrates the existence of ambiguities. We present an algorithm that maps DTDs to deterministic context-free grammars defining the same languages. We prove the grammars to be LL(1) and LALR(1), making them suitable for standard parser generators. Our experiments show the superior performance of generated optimized parsers. Our results generalize from DTDs to XML Schema specifications with certain restrictions, most notably the absence of namespaces, which exceed the scope of context-free grammars

KITopen

CLARIN: Common language resources and technology infrastructure

Author: Koskenniemi K.
Krauwer S.
Váradi T.
Wittenburg P.
Wynne M.
Publication venue
Publication date: 01/01/2008
Field of study

This paper gives an overview of the CLARIN project [1], which aims to create a research infrastructure that makes language resources and technology (LRT) available and readily usable to scholars of all disciplines, in particular the humanities and social sciences (HSS)

MPG.PuRe