Search CORE

32 research outputs found

MizAR 60 for Mizar 50

Author: Goertzel Zarathustra
Jakub?v Jan
Kaliszyk Cezary
Piotrowski Bartosz
Schulz Stephan
Suda Martin
Urban Josef
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 14th International Conference on Interactive Theorem Proving (ITP 2023)
Publication date: 01/01/2023
Field of study

As a present to Mizar on its 50th anniversary, we develop an AI/TP system that automatically proves about 60% of the Mizar theorems in the hammer setting. We also automatically prove 75% of the Mizar theorems when the automated provers are helped by using only the premises used in the human-written Mizar proofs. We describe the methods and large-scale experiments leading to these results. This includes in particular the E and Vampire provers, their ENIGMA and Deepire learning modifications, a number of learning-based premise selection methods, and the incremental loop that interleaves growing a corpus of millions of ATP proofs with training increasingly strong AI/TP systems on them. We also present a selection of Mizar problems that were proved automatically

Dagstuhl Research Online Publication Server

ENIGMA: Efficient Learning-based Inference Guiding Machine

Author: C Kaliszyk
C Kaliszyk
C Kaliszyk
C Lin
D Kühlwein
G Sutcliffe
J Otten
J Urban
J Urban
JC Blanchette
JC Blanchette
L Kovács
M Kinyon
R Fan
S Schulz
T Gransden
Publication venue
Publication date: 23/01/2017
Field of study

ENIGMA is a learning-based method for guiding given clause selection in saturation-based theorem provers. Clauses from many proof searches are classified as positive and negative based on their participation in the proofs. An efficient classification model is trained on this data, using fast feature-based characterization of the clauses . The learned model is then tightly linked with the core prover and used as a basis of a new parameterized evaluation heuristic that provides fast ranking of all generated clauses. The approach is evaluated on the E prover and the CASC 2016 AIM benchmark, showing a large increase of E's performance.Comment: Submitted to LPAR 201

arXiv.org e-Print Archive

Crossref

Hammering towards QED

Author: Blanchette JC
Kaliszyk C
Paulson LC
Urban J
Publication venue: Journal of Formalized Reasoning
Publication date: 01/01/2016
Field of study

This paper surveys the emerging methods to automate reasoning over large libraries developed with formal proof assistants. We call these methods hammers. They give the authors of formal proofs a strong “one-stroke” tool for discharging difficult lemmas without the need for careful and detailed manual programming of proof search. The main ingredients underlying this approach are efficient automatic theorem provers that can cope with hundreds of axioms, suitable translations of the proof assistant’s logic to the logic of the automatic provers, heuristic and learning methods that select relevant facts from large libraries, and methods that reconstruct the automatically found proofs inside the proof assistants. We outline the history of these methods, explain the main issues and techniques, and show their strength on several large benchmarks. We also discuss the relation of this technology to the QED Manifesto and consider its implications for QED-like efforts.Blanchette’s Sledgehammer research was supported by the Deutsche Forschungs- gemeinschaft projects Quis Custodiet (grants NI 491/11-1 and NI 491/11-2) and Hardening the Hammer (grant NI 491/14-1). Kaliszyk is supported by the Austrian Science Fund (FWF) grant P26201. Sledgehammer was originally supported by the UK’s Engineering and Physical Sciences Research Council (grant GR/S57198/01). Urban’s work was supported by the Marie-Curie Outgoing International Fellowship project AUTOKNOMATH (grant MOIF-CT-2005-21875) and by the Netherlands Organisation for Scientific Research (NWO) project Knowledge-based Automated Reasoning (grant 612.001.208).This is the final published version. It first appeared at http://jfr.unibo.it/article/view/4593/5730?acceptCookies=1

Radboud Repository

Apollo (Cambridge)

ProofWatch: Watchlist Guidance for Large Theories in E

Author: A Grabowski
C Kaliszyk
C Kaliszyk
C Kaliszyk
C Kaliszyk
D Silver
J Alama
J Jakubův
J Jakubův
J Otten
J Urban
J Urban
JC Blanchette
JC Blanchette
K Slind
L Bachmair
L Kovács
M Färber
M Färber
M Kinyon
R Veroff
S Schulz
S Schulz
S Schulz
S Schulz
T Gauthier
T Gransden
W McCune
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/05/2018
Field of study

Watchlist (also hint list) is a mechanism that allows related proofs to guide a proof search for a new conjecture. This mechanism has been used with the Otter and Prover9 theorem provers, both for interactive formalizations and for human-assisted proving of open conjectures in small theories. In this work we explore the use of watchlists in large theories coming from first-order translations of large ITP libraries, aiming at improving hammer-style automation by smarter internal guidance of the ATP systems. In particular, we (i) design watchlist-based clause evaluation heuristics inside the E ATP system, and (ii) develop new proof guiding algorithms that load many previous proofs inside the ATP and focus the proof search using a dynamically updated notion of proof matching. The methods are evaluated on a large set of problems coming from the Mizar library, showing significant improvement of E's standard portfolio of strategies, and also of the previous best set of strategies invented for Mizar by evolutionary methods.Comment: 19 pages, 10 tables, submitted to ITP 2018 at FLO

arXiv.org e-Print Archive

Crossref