Effectively Learning Spatial Indices

Jianzhong Qi, Guanli Liu, Christian S. Jensen, Lars Kulik

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

50 Citationer (Scopus)
415 Downloads (Pure)

Abstract

Machine learning, especially deep learning, is used increasingly to enable better solutions for data management tasks previously solved by other means, including database indexing. A recent study shows that a neural network can not only learn to predict the disk address of the data value associated with a one-dimensional search key but also outperform B-tree-based indexing, thus promises to speed up a broad range of database queries that rely on B-trees for efficient data access. We consider the problem of learning an index for two-dimensional spatial data. A direct application of a neural network is unattractive because there is no obvious ordering of spatial point data. Instead, we introduce a rank space based ordering technique to establish an ordering of point data and group the points into blocks for index learning. To enable scalability, we propose a recursive strategy that partitions a large point set and learns indices for each partition. Experiments on real and synthetic data sets with more than 100 million points show that our learned indices are highly effective and efficient. Query processing using our indices is more than an order of magnitude faster than the use of R-trees or a recently proposed learned index.

OriginalsprogEngelsk
TidsskriftProceedings of the VLDB Endowment
Vol/bind13
Udgave nummer12
Sider (fra-til)2341-2354
Antal sider14
ISSN2150-8097
DOI
StatusUdgivet - 2020

Fingeraftryk

Dyk ned i forskningsemnerne om 'Effectively Learning Spatial Indices'. Sammen danner de et unikt fingeraftryk.

Citationsformater