Search CORE

9,005 research outputs found

A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

Author: Abdelaziz Ibrahim
Cornelio Cristina
Crouse Maxwell
Fokoue Achille
Kapanipathi Pavan
Makni Bassem
Srinivas Kavitha
Thost Veronika
Whitehead Spencer
Witbrock Michael
Publication venue
Publication date: 15/09/2020
Field of study

Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search. Deep reinforcement learning has been proposed as a way to obviate the need for such heuristics, however, its deployment in automated theorem proving remains a challenge. In this paper we introduce TRAIL, a system that applies deep reinforcement learning to saturation-based theorem proving. TRAIL leverages (a) a novel neural representation of the state of a theorem prover and (b) a novel characterization of the inference selection process in terms of an attention-based action policy. We show through systematic analysis that these mechanisms allow TRAIL to significantly outperform previous reinforcement-learning-based theorem provers on two benchmark datasets for first-order logic automated theorem proving (proving around 15% more theorems)

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Prolog Technology Reinforcement Learning Prover: (System Description)

Author: A Biere
B Beckert
C Browne
C Kaliszyk
C Kaliszyk
D Silver
D Silver
G Lukácsy
J Alama
J Jakubův
J Otten
J Otten
J Otten
J Urban
J Urban
J Urban
J Wielemaker
K Chvalovský
L Kocsis
L Kovács
ME Stickel
PB Andrews
RS Sutton
S Muggleton
S Schulz
W Bibel
Z Goertzel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

We present a reinforcement learning toolkit for experiments with guiding automated theorem proving in the connection calculus. The core of the toolkit is a compact and easy to extend Prolog-based automated theorem prover called plCoP. plCoP builds on the leanCoP Prolog implementation and adds learning-guided Monte-Carlo Tree Search as done in the rlCoP system. Other components include a Python interface to plCoP and machine learners, and an external proof checker that verifies the validity of plCoP proofs. The toolkit is evaluated on two benchmarks and we demonstrate its extendability by two additions: (1) guidance is extended to reduction steps and (2) the standard leanCoP calculus is extended with rewrite steps and their learned guidance. We argue that the Prolog setting is suitable for combining statistical and symbolic learning methods. The complete toolkit is publicly released. © 2020, Springer Nature Switzerland AG

Crossref

Repository of the Academy's Library