2 research outputs found
On When and How to use SAT to Mine Frequent Itemsets
A new stream of research was born in the last decade with the goal of mining
itemsets of interest using Constraint Programming (CP). This has promoted a
natural way to combine complex constraints in a highly flexible manner.
Although CP state-of-the-art solutions formulate the task using Boolean
variables, the few attempts to adopt propositional Satisfiability (SAT)
provided an unsatisfactory performance. This work deepens the study on when and
how to use SAT for the frequent itemset mining (FIM) problem by defining
different encodings with multiple task-driven enumeration options and search
strategies. Although for the majority of the scenarios SAT-based solutions
appear to be non-competitive with CP peers, results show a variety of
interesting cases where SAT encodings are the best option
On SAT Models Enumeration in Itemset Mining
Frequent itemset mining is an essential part of data analysis and data
mining. Recent works propose interesting SAT-based encodings for the problem of
discovering frequent itemsets. Our aim in this work is to define strategies for
adapting SAT solvers to such encodings in order to improve models enumeration.
In this context, we deeply study the effects of restart, branching heuristics
and clauses learning. We then conduct an experimental evaluation on SAT-Based
itemset mining instances to show how SAT solvers can be adapted to obtain an
efficient SAT model enumerator