Learned Bloom Filter for Multi-key Membership Testing

Yunchuan Li, Ziwei Wang, Ruixin Yang, Yan Zhao, Rui Zhou, Kai Zheng

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

1 Citationer (Scopus)

Abstract

Multi-key membership testing refers to checking whether a queried element exists in a given set of multi-key elements, which is a fundamental operation for computing systems and networking applications such as web search, mail systems, distributed databases, firewalls, and network routing. Most existing studies for membership testing are built on Bloom filter, a space-efficient and high-security probabilistic data structure. However, traditional Bloom filter always performs poorly in multi-key scenarios. Recently, a new variant of Bloom filter that has combined machine learning methods and Bloom filter, also known as Learned Bloom Filter (LBF), has drawn increasing attention for its significant improvements in reducing space occupation and False Positive Rate (FPR). More importantly, due to the introduction of the learned model, LBF can well address some problems of Bloom filter in multi-key scenarios. Because of this, we propose a Multi-key LBF (MLBF) data structure, which contains a value-interaction-based multi-key classifier and a multi-key Bloom filter. To reduce FPR, we further propose an Interval-based MLBF, which divides keys into specific intervals according to the data distribution. Extensive experiments based on two real datasets confirm the superiority of the proposed data structures in terms of FPR and query efficiency.

OriginalsprogEngelsk
TitelDatabase Systems for Advanced Applications - 28th International Conference, DASFAA 2023, Proceedings : 28th International Conference, DASFAA 2023
RedaktørerXin Wang, Maria Luisa Sapino, Wook-Shin Han, Amr El Abbadi, Gill Dobbie, Zhiyong Feng, Yingxiao Shao, Hongzhi Yin
Antal sider18
ForlagSpringer Nature Switzerland AG
Publikationsdato2023
Sider62-79
ISBN (Trykt)978-3-031-30636-5
ISBN (Elektronisk)978-3-031-30637-2
DOI
StatusUdgivet - 2023
Begivenhed28th International Conference, DASFAA 2023 - Tianjin, Kina
Varighed: 17 apr. 202320 apr. 2023

Konference

Konference28th International Conference, DASFAA 2023
Land/OmrådeKina
ByTianjin
Periode17/04/202320/04/2023
NavnLecture Notes in Computer Science
Nummer13943
ISSN0302-9743

Fingeraftryk

Dyk ned i forskningsemnerne om 'Learned Bloom Filter for Multi-key Membership Testing'. Sammen danner de et unikt fingeraftryk.

Citationsformater