Skip to main navigation Skip to search Skip to main content

A sat-based approach for mining high utility itemsets from transaction databases

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Mining high utility itemsets is a keystone in several data analysis tasks. High Utility Itemset Mining generalizes the frequent itemset mining problem by considering item quantities and weights. A high utility itemset is a set of items that appears in the transadatabase and having a high importance to the user, measured by a utility function. The utility of a pattern can be quantified in terms of various objective criteria, e.g., profit, frequency, and weight. Constraint Programming (CP) and Propositional Satisfiability (SAT) based frameworks for modeling and solving pattern mining tasks have gained a considerable attention in recent few years. This paper introduces the first declarative framework for mining high utility itemsets from transaction databases. First, we model the problem of mining high utility itemsets from transaction databases as a propositional satifiability problem. Moreover, to facilitate the mining task, we add an additional constraint to the efficiency of our method by using weighted clique cover problem. Then, we exploit the efficient SAT solving techniques to output all the high utility itemsets in the data that satisfy a user-specified minimum support and minimum utility values. Experimental evaluations on real and synthetic datasets show that the performance of our proposed approach is close to that of the optimal case of state-of-the-art HUIM algorithms.

Original languageEnglish
Title of host publicationBig Data Analytics and Knowledge Discovery - 22nd International Conference, DaWaK 2020, Proceedings
EditorsMin Song, Il-Yeol Song, Gabriele Kotsis, Ismail Khalil, A Min Tjoa
PublisherSpringer Science and Business Media Deutschland GmbH
Pages91-106
Number of pages16
ISBN (Print)9783030590642
DOIs
Publication statusPublished - 1 Jan 2020
Event22nd International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2020 - Bratislava, Slovakia
Duration: 14 Sept 202017 Sept 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12393 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference22nd International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2020
Country/TerritorySlovakia
CityBratislava
Period14/09/2017/09/20

Keywords

  • Constraint programming
  • Data mining
  • High utility
  • Maximal clique
  • Propositional satisfiabilty

Fingerprint

Dive into the research topics of 'A sat-based approach for mining high utility itemsets from transaction databases'. Together they form a unique fingerprint.

Cite this