Passer à la navigation principale Passer à la recherche Passer au contenu principal

VEPRECO: Vertical databases with pre-pruning strategies and common candidate selection policies to fasten sequential pattern mining

  • University of Girona

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

Sequential pattern mining (SPM) discovers, from event transactions recorded along time, patterns of events fulfilling a sequential order. In this work, we introduce a new efficient sequential pattern mining algorithm called VEPRECO. VEPRECO proposes three main contributions that fasten the mining process: a vertical representation of patterns, pre-pruning strategies to avoid checking infrequent patterns, and common candidate selection policies that reduce the number of iterations performed by the algorithm. An experimental evaluation was performed with synthetic and real-world datasets, and the results have been compared with the most time and memory-efficient sequential pattern mining algorithm in the literature, the CM-SPAM algorithm, which we have taken as a baseline. We analysed separately how each of the proposed contributions affects time and memory usage and found that the one that reduced the most time and memory was the representation of the proposed patterns. Pre-pruning strategies and common candidate selection policies reduce runtime in datasets with many sequences and similar lengths of transactions and sequences.

langue originaleAnglais
Numéro d'article117517
journalExpert Systems with Applications
Volume204
Les DOIs
étatPublié - 15 oct. 2022

Empreinte digitale

Examiner les sujets de recherche de « VEPRECO: Vertical databases with pre-pruning strategies and common candidate selection policies to fasten sequential pattern mining ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation