Efficiently Mining Maximal Diverse Frequent Itemsets

Dingming Wu, Dexin Luo, Christian S. Jensen, Joshua Zhexu Huang

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Abstrakt

Given a database of transactions, where each transaction is a set of items, maximal frequent itemset mining aims to find all itemsets that are frequent, meaning that they consist of items that co-occur in transactions more often than a given threshold, and that are maximal, meaning that they are not contained in other frequent itemsets. Such itemsets are the most interesting ones in a meaningful sense. We study the problem of efficiently finding such itemsets with the added constraint that only the top-k most diverse ones should be returned. An itemset is diverse if its items belong to many different categories according to a given hierarchy of item categories. We propose a solution that relies on a purposefully designed index structure called the FP*-tree and an accompanying bound-based algorithm. An extensive experimental study offers insight into the performance of the solution, indicating that it is capable of outperforming an existing method by orders of magnitude and of scaling to large databases of transactions
OriginalsprogEngelsk
TitelDatabase Systems for Advanced Applications - 24th International Conference, DASFAA 2019, Proceedings : DASFAA 2019: Database Systems for Advanced Applications
RedaktørerYongxin Tong, Juggapong Natwichai, Jun Yang, Guoliang Li, Joao Gama
Antal sider17
ForlagSpringer
Publikationsdato2019
Sider191-207
ISBN (Trykt)978-3-030-18578-7
ISBN (Elektronisk)978-3-030-18579-4
DOI
StatusUdgivet - 2019
BegivenhedInternational Conference on Database Systems for Advanced Applications - Chiang Mai, Thailand
Varighed: 22 apr. 201925 apr. 2019

Konference

KonferenceInternational Conference on Database Systems for Advanced Applications
LandThailand
ByChiang Mai
Periode22/04/201925/04/2019
NavnLecture Notes in Computer Science
Vol/bind11447
ISSN0302-9743

Citationsformater

Wu, D., Luo, D., Jensen, C. S., & Huang, J. Z. (2019). Efficiently Mining Maximal Diverse Frequent Itemsets. I Y. Tong, J. Natwichai, J. Yang, G. Li, & J. Gama (red.), Database Systems for Advanced Applications - 24th International Conference, DASFAA 2019, Proceedings: DASFAA 2019: Database Systems for Advanced Applications (s. 191-207). Springer. Lecture Notes in Computer Science, Bind. 11447 https://doi.org/10.1007/978-3-030-18579-4_12