Search CORE

375 research outputs found

Learning-Assisted Automated Reasoning with Flyspeck

Author: Kaliszyk Cezary
Urban Josef
Publication venue
Publication date: 01/01/2014
Field of study

The considerable mathematical knowledge encoded by the Flyspeck project is combined with external automated theorem provers (ATPs) and machine-learning premise selection methods trained on the proofs, producing an AI system capable of answering a wide range of mathematical queries automatically. The performance of this architecture is evaluated in a bootstrapping scenario emulating the development of Flyspeck from axioms to the last theorem, each time using only the previous theorems and proofs. It is shown that 39% of the 14185 theorems could be proved in a push-button mode (without any high-level advice and user interaction) in 30 seconds of real time on a fourteen-CPU workstation. The necessary work involves: (i) an implementation of sound translations of the HOL Light logic to ATP formalisms: untyped first-order, polymorphic typed first-order, and typed higher-order, (ii) export of the dependency information from HOL Light and ATP proofs for the machine learners, and (iii) choice of suitable representations and methods for learning from previous proofs, and their integration as advisors with HOL Light. This work is described and discussed here, and an initial analysis of the body of proofs that were found fully automatically is provided

arXiv.org e-Print Archive

Springer - Publisher Connector

Radboud Repository

HOL(y)Hammer: Online ATP Service for HOL Light

Author: Kaliszyk Cezary
Urban Josef
Publication venue
Publication date: 19/09/2013
Field of study

HOL(y)Hammer is an online AI/ATP service for formal (computer-understandable) mathematics encoded in the HOL Light system. The service allows its users to upload and automatically process an arbitrary formal development (project) based on HOL Light, and to attack arbitrary conjectures that use the concepts defined in some of the uploaded projects. For that, the service uses several automated reasoning systems combined with several premise selection methods trained on all the project proofs. The projects that are readily available on the server for such query answering include the recent versions of the Flyspeck, Multivariate Analysis and Complex Analysis libraries. The service runs on a 48-CPU server, currently employing in parallel for each task 7 AI/ATP combinations and 4 decision procedures that contribute to its overall performance. The system is also available for local installation by interested users, who can customize it for their own proof development. An Emacs interface allowing parallel asynchronous queries to the service is also provided. The overall structure of the service is outlined, problems that arise and their solutions are discussed, and an initial account of using the system is given

arXiv.org e-Print Archive

Springer - Publisher Connector

Premise Selection and External Provers for HOL4

Author: Farmer W. M.
Gauthier T.
Harrison J.
Kaliszyk C.
Paulson L. C.
Paulson L. C.
Schulz S.
Sutcliffe G.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

Learning-assisted automated reasoning has recently gained popularity among the users of Isabelle/HOL, HOL Light, and Mizar. In this paper, we present an add-on to the HOL4 proof assistant and an adaptation of the HOLyHammer system that provides machine learning-based premise selection and automated reasoning also for HOL4. We efficiently record the HOL4 dependencies and extract features from the theorem statements, which form a basis for premise selection. HOLyHammer transforms the HOL4 statements in the various TPTP-ATP proof formats, which are then processed by the ATPs. We discuss the different evaluation settings: ATPs, accessible lemmas, and premise numbers. We measure the performance of HOLyHammer on the HOL4 standard library. The results are combined accordingly and compared with the HOL Light experiments, showing a comparably high quality of predictions. The system directly benefits HOL4 users by automatically finding proofs dependencies that can be reconstructed by Metis

arXiv.org e-Print Archive

CiteSeerX

Crossref

Initial Experiments with TPTP-style Automated Theorem Provers on ACL2 Problems

Author: Joosten Sebastiaan
Kaliszyk Cezary
Urban Josef
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2014
Field of study

This paper reports our initial experiments with using external ATP on some corpora built with the ACL2 system. This is intended to provide the first estimate about the usefulness of such external reasoning and AI systems for solving ACL2 problems.Comment: In Proceedings ACL2 2014, arXiv:1406.123

arXiv.org e-Print Archive

Repository TU/e

Pure OAI Repository

Directory of Open Access Journals

Radboud Repository

More SPASS with Isabelle: superposition with hard sorts and configurable simplification

Author: Blanchette J.
Blanchette J.
Popescu A.
Popescu A.
Wand D.
Wand D.
Weidenbach C.
Weidenbach C.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2012
Field of study

Sledgehammer for Isabelle/HOL integrates automatic theorem provers to discharge interactive proof obligations. This paper considers a tighter integration of the superposition prover SPASS to increase Sledgehammer’s success rate. The main enhancements are native support for hard sorts (simple types) in SPASS, simplification that honors the orientation of Isabelle simp rules, and a pair of clause-selection strategies targeted at large lemma libraries. The usefulness of this integration is confirmed by an evaluation on a vast benchmark suite and by a case study featuring a formalization of language-based security

Middlesex University Research Repository

GRUNGE: A Grand Unified ATP Challenge

Author: A Church
A Steen
C Barrett
C Benzmüller
C Benzmüller
C Kaliszyk
C Weidenbach
CE Brown
D King
David Delahaye
F Lindblad
G Burel
G Sutcliffe
G Sutcliffe
G Sutcliffe
G Sutcliffe
J Alama
J Harrison
J Harrison
J Hurd
J Meng
JC Blanchette
JC Blanchette
JC Blanchette
John Harrison
K Korovin
K Slind
L Kovács
P Baumgartner
P Rümmer
P Rümmer
S Böhme
Stephan Schulz
T Weber
Y Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/11/2019
Field of study

This paper describes a large set of related theorem proving problems obtained by translating theorems from the HOL4 standard library into multiple logical formalisms. The formalisms are in higher-order logic (with and without type variables) and first-order logic (possibly with multiple types, and possibly with type variables). The resultant problem sets allow us to run automated theorem provers that support different logical formats on corresponding problems, and compare their performances. This also results in a new "grand unified" large theory benchmark that emulates the ITP/ATP hammer setting, where systems and metasystems can use multiple ATP formalisms in complementary ways, and jointly learn from the accumulated knowledge.Comment: CADE 27 -- 27th International Conference on Automated Deductio

arXiv.org e-Print Archive

Crossref

Hammering towards QED

Author: Blanchette JC
Kaliszyk C
Paulson LC
Urban J
Publication venue: Journal of Formalized Reasoning
Publication date: 01/01/2016
Field of study

This paper surveys the emerging methods to automate reasoning over large libraries developed with formal proof assistants. We call these methods hammers. They give the authors of formal proofs a strong “one-stroke” tool for discharging difficult lemmas without the need for careful and detailed manual programming of proof search. The main ingredients underlying this approach are efficient automatic theorem provers that can cope with hundreds of axioms, suitable translations of the proof assistant’s logic to the logic of the automatic provers, heuristic and learning methods that select relevant facts from large libraries, and methods that reconstruct the automatically found proofs inside the proof assistants. We outline the history of these methods, explain the main issues and techniques, and show their strength on several large benchmarks. We also discuss the relation of this technology to the QED Manifesto and consider its implications for QED-like efforts.Blanchette’s Sledgehammer research was supported by the Deutsche Forschungs- gemeinschaft projects Quis Custodiet (grants NI 491/11-1 and NI 491/11-2) and Hardening the Hammer (grant NI 491/14-1). Kaliszyk is supported by the Austrian Science Fund (FWF) grant P26201. Sledgehammer was originally supported by the UK’s Engineering and Physical Sciences Research Council (grant GR/S57198/01). Urban’s work was supported by the Marie-Curie Outgoing International Fellowship project AUTOKNOMATH (grant MOIF-CT-2005-21875) and by the Netherlands Organisation for Scientific Research (NWO) project Knowledge-based Automated Reasoning (grant 612.001.208).This is the final published version. It first appeared at http://jfr.unibo.it/article/view/4593/5730?acceptCookies=1

Radboud Repository

Apollo (Cambridge)

Automation for interactive proof: First prototype

Author: Meng Jia
Paulson Lawrence C.
Quigley Claire
Publication venue: Elsevier Inc.
Publication date: 31/10/2006
Field of study

AbstractInteractive theorem provers require too much effort from their users. We have been developing a system in which Isabelle users obtain automatic support from automatic theorem provers (ATPs) such as Vampire and SPASS. An ATP is invoked at suitable points in the interactive session, and any proof found is given to the user in a window displaying an Isar proof script. There are numerous differences between Isabelle (polymorphic higher-order logic with type classes, natural deduction rule format) and classical ATPs (first-order, untyped, and clause form). Many of these differences have been bridged, and a working prototype that uses background processes already provides much of the desired functionality

Elsevier - Publisher Connector

Supplemental Information 3: Raw data

Author: Aljanabi
Balloux
Bashalkhanov
Castoe
Cleary
Crawford
Dawson
Evelyn
Excoffier
Faircloth
Gannon
Jan
Kalinowski
Korstian
Lee
Lewis
Li
McCulloch
Molinari
Olea-Wagner
Ortega
Piaggio
Romero-Nava
Rossiter
Rousset
Schlötterer
Selkoe
Van Oosterhout
Velazco
Velazco
Villalobos
Zane
Publication venue: 'PeerJ'
Publication date
Field of study

Crossref