Search CORE

4 research outputs found

Explaining SAT Solving Using Causal Reasoning

Author: Baluta Teodora
Meel Kuldeep S.
Shaw Arijit
Soos Mate
Yang Jiong
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 26th International Conference on Theory and Applications of Satisfiability Testing (SAT 2023)
Publication date: 01/01/2023
Field of study

The past three decades have witnessed notable success in designing efficient SAT solvers, with modern solvers capable of solving industrial benchmarks containing millions of variables in just a few seconds. The success of modern SAT solvers owes to the widely-used CDCL algorithm, which lacks comprehensive theoretical investigation. Furthermore, it has been observed that CDCL solvers still struggle to deal with specific classes of benchmarks comprising only hundreds of variables, which contrasts with their widespread use in real-world applications. Consequently, there is an urgent need to uncover the inner workings of these seemingly weak yet powerful black boxes. In this paper, we present a first step towards this goal by introducing an approach called {CausalSAT}, which employs causal reasoning to gain insights into the functioning of modern SAT solvers. {CausalSAT} initially generates observational data from the execution of SAT solvers and learns a structured graph representing the causal relationships between the components of a SAT solver. Subsequently, given a query such as whether a clause with low literals blocks distance (LBD) has a higher clause utility, {CausalSAT} calculates the causal effect of LBD on clause utility and provides an answer to the question. We use {CausalSAT} to quantitatively verify hypotheses previously regarded as "rules of thumb" or empirical findings, such as the query above or the notion that clauses with high LBD experience a rapid drop in utility over time. Moreover, {CausalSAT} can address previously unexplored questions, like which branching heuristic leads to greater clause utility in order to study the relationship between branching and clause management. Experimental evaluations using practical benchmarks demonstrate that {CausalSAT} effectively fits the data, verifies four "rules of thumb", and provides answers to three questions closely related to implementing modern solvers

Dagstuhl Research Online Publication Server

A Study of the Learnability of Relational Properties: Model Counting Meets Machine Learning (MCML)

Author: Baluta Teodora
Blumer Anselm
Chavira Mark
Cormen Thomas H.
Demsky Brian
Fierens Daan
Galeotti J. P.
GarcÃŋa Salvador
Gopinath Divya
Gopinath Divya
Heule Marijn J. H.
Håstad Johan
Iman Ronald L.
Jackson Daniel
Katz G.
Ke Yalin
Khurshid Sarfraz
Kim Moonzoo
Korel B.
Narodytska Nina
Samimi Hesam
Shalev-Shwartz Shai
Soos Mate
Spivey J. M.
Trippel Caroline
Vapnik V. N.
Vasic Marko
Wickerson John
Zave P.
Zave Pamela
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/09/2020
Field of study

This paper introduces the MCML approach for empirically studying the learnability of relational properties that can be expressed in the well-known software design language Alloy. A key novelty of MCML is quantification of the performance of and semantic differences among trained machine learning (ML) models, specifically decision trees, with respect to entire (bounded) input spaces, and not just for given training and test datasets (as is the common practice). MCML reduces the quantification problems to the classic complexity theory problem of model counting, and employs state-of-the-art model counters. The results show that relatively simple ML models can achieve surprisingly high performance (accuracy and F1-score) when evaluated in the common setting of using training and test datasets - even when the training dataset is much smaller than the test dataset - indicating the seeming simplicity of learning relational properties. However, MCML metrics based on model counting show that the performance can degrade substantially when tested against the entire (bounded) input space, indicating the high complexity of precisely learning these properties, and the usefulness of model counting in quantifying the true performance

arXiv.org e-Print Archive

Crossref