Efficiently Learning Spatial Indices

Guanli Liu, Jianzhong Qi*, Christian S. Jensen, James Bailey, Lars Kulik

*Kontaktforfatter

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

2 Citationer (Scopus)

Abstract

Learned indices can leverage the high prediction accuracy and efficiency of modern deep learning techniques. They are capable of delivering better query performance than traditional indices over one-dimensional data. Recent studies demonstrate that we can also achieve query-efficient learned in-dices for spatial data by partitioning and subsequently transforming spatial data to one-dimensional values, after which existing techniques can be applied. While enabling efficient querying, building and rebuilding learned spatial indices efficiently remains largely unaddressed. As the model training needed to learn a spatial index is costly, efficient building and rebuilding of learned spatial indices on large data sets is challenging if performed by means of model training and retraining.To advance the practicality of learned spatial indices, we propose a system named ELSI that enables the efficient building and rebuilding of a class of learned spatial indices that follow two simple design principles. The core idea is to reduce the model (re-)building times by engineering reduced training sets that preserve key data distribution patterns. ELSI encompasses a suite of methods for constructing small and distribution-preserving training sets from input data sets. Further, given an input data set, ELSI can adaptively select a method that produces a learned index with high query efficiency. Experiments on real data sets of 100+ million points show that ELSI can reduce the build times of four different learned spatial indices consistently (by up to two orders of magnitude) without jeopardizing query efficiency.

OriginalsprogEngelsk
TitelProceedings - 2023 IEEE 39th International Conference on Data Engineering, ICDE 2023
Antal sider13
Vol/bind2023-april
ForlagIEEE Computer Society Press
Publikationsdato2023
Sider1572-1584
ISBN (Elektronisk)9798350322279
DOI
StatusUdgivet - 2023
Begivenhed39th IEEE International Conference on Data Engineering, ICDE 2023 - Anaheim, USA
Varighed: 3 apr. 20237 apr. 2023

Konference

Konference39th IEEE International Conference on Data Engineering, ICDE 2023
Land/OmrådeUSA
ByAnaheim
Periode03/04/202307/04/2023
NavnProceedings - International Conference on Data Engineering
Vol/bind2023-April
ISSN1084-4627

Bibliografisk note

Publisher Copyright:
© 2023 IEEE.

Fingeraftryk

Dyk ned i forskningsemnerne om 'Efficiently Learning Spatial Indices'. Sammen danner de et unikt fingeraftryk.

Citationsformater