Search CORE

13 research outputs found

Dreaming to Prove

Author: Szabó Kristóf
Zombori Zsolt
Publication venue
Publication date: 01/01/2021
Field of study

MizAR 60 for Mizar 50

Author: Goertzel Zarathustra
Jakub?v Jan
Kaliszyk Cezary
Piotrowski Bartosz
Schulz Stephan
Suda Martin
Urban Josef
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 14th International Conference on Interactive Theorem Proving (ITP 2023)
Publication date: 01/01/2023
Field of study

As a present to Mizar on its 50th anniversary, we develop an AI/TP system that automatically proves about 60% of the Mizar theorems in the hammer setting. We also automatically prove 75% of the Mizar theorems when the automated provers are helped by using only the premises used in the human-written Mizar proofs. We describe the methods and large-scale experiments leading to these results. This includes in particular the E and Vampire provers, their ENIGMA and Deepire learning modifications, a number of learning-based premise selection methods, and the incremental loop that interleaves growing a corpus of millions of ATP proofs with training increasingly strong AI/TP systems on them. We also present a selection of Mizar problems that were proved automatically

Dagstuhl Research Online Publication Server

Proof Repair Infrastructure for Supervised Models: Building a Large Proof Repair Dataset

Author: Gardner Andrew
Henderson R. Wesley
Reichel Tom
Ringer Talia
Touchet Andrew
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 14th International Conference on Interactive Theorem Proving (ITP 2023)
Publication date: 01/01/2023
Field of study

We report on our efforts building a new, large proof-repair dataset and benchmark suite for the Coq proof assistant. The dataset is made up of Git commits from open-source projects with old and new versions of definitions and proofs aligned across commits. Building this dataset has been a significant undertaking, highlighting a number of challenges and gaps in existing infrastructure. We discuss these challenges and gaps, and we provide recommendations for how the proof assistant community can address them. Our hope is to make it easier to build datasets and benchmark suites so that machine-learning tools for proofs will move to target the tasks that matter most and do so equitably across proof assistants

Dagstuhl Research Online Publication Server