Search CORE

194 research outputs found

Meta-F*: Proof Automation with SMT, Tactics, and Metaprograms

Author: Ahman Danel
Dumitrescu Victor
Giannarakis Nick
Hawblitzel Chris
Hritcu Catalin
Martínez Guido
Narasimhamurthy Monal
Paraskevopoulou Zoe
Pit-Claudel Clément
Protzenko Jonathan
Ramananandro Tahina
Rastogi Aseem
Swamy Nikhil
Publication venue
Publication date: 07/03/2019
Field of study

We introduce Meta-F*, a tactics and metaprogramming framework for the F* program verifier. The main novelty of Meta-F* is allowing the use of tactics and metaprogramming to discharge assertions not solvable by SMT, or to just simplify them into well-behaved SMT fragments. Plus, Meta-F* can be used to generate verified code automatically. Meta-F* is implemented as an F* effect, which, given the powerful effect system of F*, heavily increases code reuse and even enables the lightweight verification of metaprograms. Metaprograms can be either interpreted, or compiled to efficient native code that can be dynamically loaded into the F* type-checker and can interoperate with interpreted code. Evaluation on realistic case studies shows that Meta-F* provides substantial gains in proof development, efficiency, and robustness.Comment: Full version of ESOP'19 pape

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

How functional programming mattered

Author: Abelson
Armstrong
Armstrong
Arts
Arts
Axelsson
Baars
Backus
Bahr
Barthe
Bertot
Bird
Bird
Bird
Bird
Bird RS Moor
Blelloch
Blelloch
Bringert
Carette
Chakravarty
Chakravarty MMT Leshchinskiy
Chetali
Chin
Claessen
Claessen
Cole
Cole
de Moor
Dean
Devriese
Dijkstra
Dybvig
Elliott
Elliott
Epstein
Farmer
Fegaras
Felleisen
Ford
Gibbons
Gibbons
Gill
Halloway
Hammond
Hansen MR Rischel
Harris
Hinze
Hinze
Hu
Hu
Hu
Hu
Hu
Hudak
Hudak
Hudak
Hudak
Hudak
Hudak
Hughes
Hughes
Hughes
Hughes JM Bolinder
Hutton
Hutton
Jones
Katayama
Katayama
Kiselyov
Launchbury
Launchbury
Leijen
Leroy
Liang
Lindley
Loidl HW Rubio
Matsuzaki
Mcbride
Meijer
Meijer
Milner
Minsky
Minsky
Moggi
Moggi
Morita
Mu
Naiman
Norell
Odersky
Oliveira BCS Moors
Paterson
Paulson
Persson
Peyton Jones
Peyton Jones
Peyton Jones SL Wadler
Reynolds
Sagonas
Schrijvers
Sculthorpe
Seibel
Sheard
Skillicorn
Smith
Snyder
Steele
Steele
Svenningsson
Svenningsson
Swierstra
Swierstra SD Duponcheel
Takano
Takano
Tesson
Wadler
Wadler
Wadler
Wadler
Wampler
Yang
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

In 1989 when functional programming was still considered a niche topic, Hughes wrote a visionary paper arguing convincingly ‘why functional programming matters’. More than two decades have passed. Has functional programming really mattered? Our answer is a resounding ‘Yes!’. Functional programming is now at the forefront of a new generation of programming technologies, and enjoying increasing popularity and influence. In this paper, we review the impact of functional programming, focusing on how it has changed the way we may construct programs, the way we may verify programs, and fundamentally the way we may think about programs

Crossref

Chalmers Research

Kent Academic Repository

Chalmers Publication Library

Explore Bristol Research

Accelerating parser combinators with macros

Author: Béguet Eric
Jonnalagedda Manohar
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

Parser combinators provide an elegant way of writing parsers: parser implementations closely follow the structure of the underlying grammar, while accommodating interleaved host language code for data processing. However, the host language features used for composition introduce substantial overhead, which leads to poor performance. In this paper, we present a technique to systematically eliminate this overhead. We use Scala macros to analyse the grammar specification at compile-time and remove composition, leaving behind an efficient top-down, recursive-descent parser. We compare our macro-based approach to a staging-based approach using the LMS framework, and provide an experience report in which we discuss the advantages and drawbacks of both methods. Our library outperforms Scala's standard parser combinators on a set of benchmarks by an order of magnitude, and is 2x faster than code generated by LMS

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Parsing for agile modeling

Author: Kurš Jan
Publication venue: Universität Bern
Publication date: 01/01/2016
Field of study

Agile modeling refers to a set of methods that allow for a quick initial development of an importer and its further refinement. These requirements are not met simultaneously by the current parsing technology. Problems with parsing became a bottleneck in our research of agile modeling. In this thesis we introduce a novel approach to specify and build parsers. Our approach allows for expressive, tolerant and composable parsers without sacrificing performance. The approach is based on a context-sensitive extension of parsing expression grammars that allows a grammar engineer to specify complex language restrictions. To insure high parsing performance we automatically analyze a grammar definition and choose different parsing strategies for different parts of the grammar. We show that context-sensitive parsing expression grammars allow for highly composable, tolerant and variable-grained parsers that can be easily refined. Different parsing strategies significantly insure high-performance of parsers without sacrificing expressiveness of the underlying grammars

BORIS Theses

Specialising Parsers for Queries

Author: Jonnalagedda Manohar
Publication venue: Lausanne, EPFL
Publication date: 09/11/2016
Field of study

Many software systems consist of data processing components that analyse large datasets to gather information and learn from these. Often, only part of the data is relevant for analysis. Data processing systems contain an initial preprocessing step that filters out the unwanted information. While efficient data analysis techniques and methodologies are accessible to non-expert programmers, data preprocessing seems to be forgotten, or worse, ignored. This despite real performance gains being possible by efficiently preprocessing data. Implementations of the data preprocessing step traditionally have to trade modularity for performance: to achieve the former, one separates the parsing of raw data and filtering it, and leads to slow programs because of the creation of intermediate objects during execution. The efficient version is a low-level implementation that interleaves parsing and querying. In this dissertation we demonstrate a principled and practical technique to convert the modular, maintainable program into its interleaved efficient counterpart. Key to achieving this objective is the removal, or deforestation, of intermediate objects in a program execution. We first show that by encoding data types using Böhm-Berarducci encodings (often referred to as Church encodings), and combining these with partial evaluation for function composition we achieve deforestation. This allows us to implement optimisations themselves as libraries, with minimal dependence on an underlying optimising compiler. Next we illustrate the applicability of this approach to parsing and preprocessing queries. The approach is general enough to cover top-down and bottom-up parsing techniques, and deforestation of pipelines of operations on lists and streams. We finally present a set of transformation rules that for a parser on a nested data format and a query on the structure, produces a parser specialised for the query. As a result we preserve the modularity of writing parsers and queries separately while also minimising resource usage. These transformation rules combine deforested implementations of both libraries to yield an efficient, interleaved result

Infoscience - École polytechnique fédérale de Lausanne

Toward an engineering discipline for grammarware

Author: Klint P. (Paul)
Lämmel R. (Ralf)
Verhoef C. (Chris)
Publication venue: A.C.M.
Publication date: 01/01/2005
Field of study

CWI's Institutional Repository

LMS-Verify: abstraction without regret for verified systems programming

Author: Beckmann O.
Berdine J.
Boyland J. T.
Bratus S.
Cousot P.
Cuoq P.
Felleisen M.
Jacobs B.
Jeuring J.
Keil M.
Kernighan B.
Kuncak V.
Leino K. R. M.
Meyer B.
Nada Amin
Rizkallah C.
Rompf T.
Rompf T.
Sujeeth A. K.
Svenningsson J.
Takikawa A.
Tiark Rompf
Wang X.
Publication venue: ACM SIGPLAN Notices
Publication date: 31/03/2017
Field of study

Performance critical software is almost always developed in C, as programmers do not trust high-level languages to deliver the same reliable performance. This is bad because low-level code in unsafe languages attracts security vulnerabilities and because development is far less productive, with PL advances mostly lost on programmers operating under tight performance constraints. High-level languages provide memory safety out of the box, but they are deemed too slow and unpredictable for serious system software. Recent years have seen a surge in staging and generative programming: the key idea is to use high-level languages and their abstraction power as glorified macro systems to compose code fragments in first-order, potentially domain-specific, intermediate languages, from which fast C can be emitted. But what about security? Since the end result is still C code, the safety guarantees of the high-level host language are lost. In this paper, we extend this generative approach to emit ACSL specifications along with C code. We demonstrate that staging achieves ``abstraction without regret'' for verification: we show how high-level programming models, in particular higher-order composable contracts from dynamic languages, can be used at generation time to compose and generate first-order specifications that can be statically checked by existing tools. We also show how type classes can automatically attach invariants to data types, reducing the need for repetitive manual annotations. We evaluate our system on several case studies that varyingly exercise verification of memory safety, overflow safety, and functional correctness. We feature an HTTP parser that is (1) fast (2) high-level: implemented using staged parser combinators (3) secure: with verified memory safety. This result is significant, as input parsing is a key attack vector, and vulnerabilities related to HTTP parsing have been documented in all widely-used web servers.</jats:p

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Apollo (Cambridge)

A Functional Implementation of a Multiway Dataflow Constraint System Library

Author: Aanes Bo Victor Isak
Publication venue: The University of Bergen
Publication date: 27/06/2023
Field of study

Masteroppgave i Programvareutvikling samarbeid med HVLPROG399MAMN-PRO

University of Bergen

Protecting Systems From Exploits Using Language-Theoretic Security

Author: Anantharaman Prashant
Publication venue: Dartmouth Digital Commons
Publication date: 03/05/2022
Field of study

Any computer program processing input from the user or network must validate the input. Input-handling vulnerabilities occur in programs when the software component responsible for filtering malicious input---the parser---does not perform validation adequately. Consequently, parsers are among the most targeted components since they defend the rest of the program from malicious input. This thesis adopts the Language-Theoretic Security (LangSec) principle to understand what tools and research are needed to prevent exploits that target parsers. LangSec proposes specifying the syntactic structure of the input format as a formal grammar. We then build a recognizer for this formal grammar to validate any input before the rest of the program acts on it. To ensure that these recognizers represent the data format, programmers often rely on parser generators or parser combinators tools to build the parsers. This thesis propels several sub-fields in LangSec by proposing new techniques to find bugs in implementations, novel categorizations of vulnerabilities, and new parsing algorithms and tools to handle practical data formats. To this end, this thesis comprises five parts that tackle various tenets of LangSec. First, I categorize various input-handling vulnerabilities and exploits using two frameworks. First, I use the mismorphisms framework to reason about vulnerabilities. This framework helps us reason about the root causes leading to various vulnerabilities. Next, we built a categorization framework using various LangSec anti-patterns, such as parser differentials and insufficient input validation. Finally, we built a catalog of more than 30 popular vulnerabilities to demonstrate the categorization frameworks. Second, I built parsers for various Internet of Things and power grid network protocols and the iccMAX file format using parser combinator libraries. The parsers I built for power grid protocols were deployed and tested on power grid substation networks as an intrusion detection tool. The parser I built for the iccMAX file format led to several corrections and modifications to the iccMAX specifications and reference implementations. Third, I present SPARTA, a novel tool I built that generates Rust code that type checks Portable Data Format (PDF) files. The type checker I helped build strictly enforces the constraints in the PDF specification to find deviations. Our checker has contributed to at least four significant clarifications and corrections to the PDF 2.0 specification and various open-source PDF tools. In addition to our checker, we also built a practical tool, PDFFixer, to dynamically patch type errors in PDF files. Fourth, I present ParseSmith, a tool to build verified parsers for real-world data formats. Most parsing tools available for data formats are insufficient to handle practical formats or have not been verified for their correctness. I built a verified parsing tool in Dafny that builds on ideas from attribute grammars, data-dependent grammars, and parsing expression grammars to tackle various constructs commonly seen in network formats. I prove that our parsers run in linear time and always terminate for well-formed grammars. Finally, I provide the earliest systematic comparison of various data description languages (DDLs) and their parser generation tools. DDLs are used to describe and parse commonly used data formats, such as image formats. Next, I conducted an expert elicitation qualitative study to derive various metrics that I use to compare the DDLs. I also systematically compare these DDLs based on sample data descriptions available with the DDLs---checking for correctness and resilience

Dartmouth Digital Commons (Dartmouth College)

Compiling a domain specific language for dynamic programming

Author: Steffen Peter
Publication venue: Bielefeld University
Publication date: 01/01/2006
Field of study

Steffen P. Compiling a domain specific language for dynamic programming. Bielefeld (Germany): Bielefeld University; 2006

Publications at Bielefeld University