A parallel SAT-based framework for closed frequent itemsets mining

Imen Ouled Dlala, Said Jabbour, Badran Raddaoui, Lakhdar Sais

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Constraint programming (CP) and propositional satisfiability (SAT) based framework for modeling and solving pattern mining tasks has gained a considerable audience in recent years. However, this nice declarative and generic framework encounters a scaling problem. The huge size of constraints networks/propositional formulas encoding large datasets is identified as the main bottleneck of most existing approaches. In this paper, we propose a parallel SAT based framework for itemset mining problem to push forward the solving efficiency. The proposed approach is based on a divide-and-conquer paradigm, where the transaction database is partitioned using item-based guiding paths. Such decomposition allows us to derive smaller and independent Boolean formulas that can be solved in parallel. The performance and scalability of the proposed algorithm are evaluated through extensive experiments on several datasets. We demonstrate that our partition-based parallel SAT approach outperforms other CP approaches even in the sequential case, while significantly reducing the performances gap with specialized approaches.

Original languageEnglish
Title of host publicationPrinciples and Practice of Constraint Programming - 24th International Conference, CP 2018, Proceedings
EditorsJohn Hooker
PublisherSpringer Verlag
Pages570-587
Number of pages18
ISBN (Print)9783319983332
DOIs
Publication statusPublished - 1 Jan 2018
Externally publishedYes
Event24th International Conference on the Principles and Practice of Constraint Programming, CP 2018 - Lille, France
Duration: 27 Aug 201831 Aug 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11008 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference24th International Conference on the Principles and Practice of Constraint Programming, CP 2018
Country/TerritoryFrance
CityLille
Period27/08/1831/08/18

Fingerprint

Dive into the research topics of 'A parallel SAT-based framework for closed frequent itemsets mining'. Together they form a unique fingerprint.

Cite this