Answering skyline queries over incomplete data with crowdsourcing (Extended Abstract)

Xiaoye Miao, Yunjun Gao*, Su Guo, Lu Chen, Jianwei Yin, Qing Li

*Corresponding author for this work

Research output: Contribution to book/anthology/report/conference proceedingConference abstract in proceedingResearchpeer-review

1 Citation (Scopus)
236 Downloads (Pure)

Abstract

Due to the pervasiveness of incomplete data, incomplete data queries are vital in a large number of real-life scenarios. Current models and approaches for incomplete data queries mainly rely on the machine power. In this paper, we study the problem of skyline queries over incomplete data with crowdsourcing. We propose a novel query framework, termed as BayesCrowd, on top of Bayesian network and the typical c-table model on incomplete data. Considering budget and latency constraints, we present a suite of effective task selection strategies. In particular, since the probability computation of each object being an answer object is at least as hard as #SAT problem, we propose an adaptive DPLL (i.e., Davis-Putnam-Logemann-Loveland) algorithm to speed up the computation. Extensive experiments using both real and synthetic data sets confirm the superiority of BayesCrowd to the state-of-the-art method.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020
Number of pages2
PublisherIEEE (Institute of Electrical and Electronics Engineers)
Publication dateApr 2020
Pages2032-2033
Article number9101783
ISBN (Electronic)9781728129037
DOIs
Publication statusPublished - Apr 2020
Event36th IEEE International Conference on Data Engineering, ICDE 2020 - Dallas, United States
Duration: 20 Apr 202024 Apr 2020

Conference

Conference36th IEEE International Conference on Data Engineering, ICDE 2020
Country/TerritoryUnited States
CityDallas
Period20/04/202024/04/2020
SeriesProceedings - International Conference on Data Engineering
Volume2020-April
ISSN1084-4627

Fingerprint

Dive into the research topics of 'Answering skyline queries over incomplete data with crowdsourcing (Extended Abstract)'. Together they form a unique fingerprint.

Cite this