Search CORE

2 research outputs found

On When and How to use SAT to Mine Frequent Itemsets

Author: Henriques Rui
Lynce Inês
Manquinho Vasco
Publication venue
Publication date: 26/07/2012
Field of study

A new stream of research was born in the last decade with the goal of mining itemsets of interest using Constraint Programming (CP). This has promoted a natural way to combine complex constraints in a highly flexible manner. Although CP state-of-the-art solutions formulate the task using Boolean variables, the few attempts to adopt propositional Satisfiability (SAT) provided an unsatisfactory performance. This work deepens the study on when and how to use SAT for the frequent itemset mining (FIM) problem by defining different encodings with multiple task-driven enumeration options and search strategies. Although for the majority of the scenarios SAT-based solutions appear to be non-competitive with CP peers, results show a variety of interesting cases where SAT encodings are the best option

arXiv.org e-Print Archive

On SAT Models Enumeration in Itemset Mining

Author: Jabbour Said
Sais Lakhdar
Salhi Yakoub
Publication venue
Publication date: 08/06/2015
Field of study

Frequent itemset mining is an essential part of data analysis and data mining. Recent works propose interesting SAT-based encodings for the problem of discovering frequent itemsets. Our aim in this work is to define strategies for adapting SAT solvers to such encodings in order to improve models enumeration. In this context, we deeply study the effects of restart, branching heuristics and clauses learning. We then conduct an experimental evaluation on SAT-Based itemset mining instances to show how SAT solvers can be adapted to obtain an efficient SAT model enumerator

arXiv.org e-Print Archive