Efficiently answer top-k queries on typed intervals

Jianqiu Xu; Hua Lu

doi:10.1016/j.is.2017.08.005

Efficiently answer top-k queries on typed intervals

Jianqiu Xu^*, Hua Lu

^*Kontaktforfatter

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

2 Citationer (Scopus)

Abstract

Consider a database consisting of a set of tuples, each of which contains an interval, a type and a weight. These tuples are called typed intervals and used to support applications involving diverse intervals. In this paper, we study top-k queries on typed intervals. The query reports k intervals intersecting the query time, containing a particular type and having the largest weight. The query time can be a point or an interval. Further, we define top-k continuous queries that return qualified intervals at each time point during the query interval. To efficiently answer such queries, a key challenge is to build an index structure to manage typed intervals. Employing the standard interval tree, we build the structure in a compact way to reduce the I/O cost, and provide analytically derived partitioning methods to manage the data. Query algorithms are proposed to support point, interval and continuous queries. An auxiliary main-memory structure is developed to report continuous results. Using large real and synthetic datasets, extensive experiments are performed in a prototype database system to demonstrate the effectiveness, efficiency and scalability. The results show that our method significantly outperforms alternative methods in most settings.

Originalsprog	Engelsk
Tidsskrift	Information Systems
Vol/bind	71
Sider (fra-til)	164-181
Antal sider	18
ISSN	0306-4379
DOI	https://doi.org/10.1016/j.is.2017.08.005
Status	Udgivet - 1 nov. 2017

Adgang til dokumentet

10.1016/j.is.2017.08.005

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=85028084645&partnerID=8YFLogxK

Citationsformater

@article{a913485617b24d94a771ccfc5744d561,

title = "Efficiently answer top-k queries on typed intervals",

abstract = "Consider a database consisting of a set of tuples, each of which contains an interval, a type and a weight. These tuples are called typed intervals and used to support applications involving diverse intervals. In this paper, we study top-k queries on typed intervals. The query reports k intervals intersecting the query time, containing a particular type and having the largest weight. The query time can be a point or an interval. Further, we define top-k continuous queries that return qualified intervals at each time point during the query interval. To efficiently answer such queries, a key challenge is to build an index structure to manage typed intervals. Employing the standard interval tree, we build the structure in a compact way to reduce the I/O cost, and provide analytically derived partitioning methods to manage the data. Query algorithms are proposed to support point, interval and continuous queries. An auxiliary main-memory structure is developed to report continuous results. Using large real and synthetic datasets, extensive experiments are performed in a prototype database system to demonstrate the effectiveness, efficiency and scalability. The results show that our method significantly outperforms alternative methods in most settings.",

author = "Jianqiu Xu and Hua Lu",

year = "2017",

month = nov,

day = "1",

doi = "10.1016/j.is.2017.08.005",

language = "English",

volume = "71",

pages = "164--181",

journal = "Information Systems",

issn = "0306-4379",

publisher = "Pergamon Press",

}

TY - JOUR

T1 - Efficiently answer top-k queries on typed intervals

AU - Xu, Jianqiu

AU - Lu, Hua

PY - 2017/11/1

Y1 - 2017/11/1

N2 - Consider a database consisting of a set of tuples, each of which contains an interval, a type and a weight. These tuples are called typed intervals and used to support applications involving diverse intervals. In this paper, we study top-k queries on typed intervals. The query reports k intervals intersecting the query time, containing a particular type and having the largest weight. The query time can be a point or an interval. Further, we define top-k continuous queries that return qualified intervals at each time point during the query interval. To efficiently answer such queries, a key challenge is to build an index structure to manage typed intervals. Employing the standard interval tree, we build the structure in a compact way to reduce the I/O cost, and provide analytically derived partitioning methods to manage the data. Query algorithms are proposed to support point, interval and continuous queries. An auxiliary main-memory structure is developed to report continuous results. Using large real and synthetic datasets, extensive experiments are performed in a prototype database system to demonstrate the effectiveness, efficiency and scalability. The results show that our method significantly outperforms alternative methods in most settings.

AB - Consider a database consisting of a set of tuples, each of which contains an interval, a type and a weight. These tuples are called typed intervals and used to support applications involving diverse intervals. In this paper, we study top-k queries on typed intervals. The query reports k intervals intersecting the query time, containing a particular type and having the largest weight. The query time can be a point or an interval. Further, we define top-k continuous queries that return qualified intervals at each time point during the query interval. To efficiently answer such queries, a key challenge is to build an index structure to manage typed intervals. Employing the standard interval tree, we build the structure in a compact way to reduce the I/O cost, and provide analytically derived partitioning methods to manage the data. Query algorithms are proposed to support point, interval and continuous queries. An auxiliary main-memory structure is developed to report continuous results. Using large real and synthetic datasets, extensive experiments are performed in a prototype database system to demonstrate the effectiveness, efficiency and scalability. The results show that our method significantly outperforms alternative methods in most settings.

UR - http://www.scopus.com/inward/record.url?scp=85028084645&partnerID=8YFLogxK

U2 - 10.1016/j.is.2017.08.005

DO - 10.1016/j.is.2017.08.005

M3 - Journal article

AN - SCOPUS:85028084645

SN - 0306-4379

VL - 71

SP - 164

EP - 181

JO - Information Systems

JF - Information Systems

ER -

Efficiently answer top-k queries on typed intervals

Abstract

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater