Towards Efficient Query Processing over Heterogeneous RDF Interfaces

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

Since the proposal of RDF as a standard for representing statements about entities, diverse interfaces to publish and strategies to query RDF data have been proposed. Although some recent proposals are aware of the advantages and disadvantages of state-of-the-art approaches, no work has yet tried to integrate them into a hybrid system that exploits their, in many cases, complementary strengths to process queries more efficiently than each of these approaches could do individually. In this paper, we present hybridSE, an approach that exploits the diverse characteristics of queryable RDF interfaces to efficiently process SPARQL queries. We present a brief study of the characteristics of some of the most popular RDF interfaces (brTPF and SPARQL endpoints), a method to estimate the impact of using a particular interface on query evaluation, and a method to use multiple interfaces to efficiently process a query. Our experiments, using a well-known benchmark dataset and a large number of queries, with result sizes varying from 1 up to 1 million, show that hybridSE processes queries up to three orders of magnitude faster and transfers up to four orders of magnitude less data.
Original languageEnglish
Title of host publicationEmerging Topics in Semantic Technologies : ISWC 2018 Satellite Events
EditorsElena Demidova, Amrapali J. Zaveri, Elena Simperl
PublisherIOS Press
Publication date2018
Pages39-53
ISBN (Print)978-3-89838-736-1
ISBN (Electronic)978-1-61499-894-5
DOIs
Publication statusPublished - 2018
Event2nd Workshop on Decentralizing the Semantic Web, DeSemWeb 2018 - Monterey, United States
Duration: 8 Oct 2018 → …

Conference

Conference2nd Workshop on Decentralizing the Semantic Web, DeSemWeb 2018
CountryUnited States
CityMonterey
Period08/10/2018 → …
SeriesStudies on the Semantic Web
Volume36
ISSN1868-1158

Fingerprint

Query processing
Hybrid systems
Experiments

Keywords

  • SPARQL query processing
  • Heterogeneous RDF interfaces
  • brTPF
  • SPARQL endpoints

Cite this

Montoya, G., Aebeloe, C., & Hose, K. (2018). Towards Efficient Query Processing over Heterogeneous RDF Interfaces. In E. Demidova, A. J. Zaveri, & E. Simperl (Eds.), Emerging Topics in Semantic Technologies: ISWC 2018 Satellite Events (pp. 39-53). IOS Press. Studies on the Semantic Web, Vol.. 36 https://doi.org/10.3233/978-1-61499-894-5-39
Montoya, Gabriela ; Aebeloe, Christian ; Hose, Katja. / Towards Efficient Query Processing over Heterogeneous RDF Interfaces. Emerging Topics in Semantic Technologies: ISWC 2018 Satellite Events. editor / Elena Demidova ; Amrapali J. Zaveri ; Elena Simperl. IOS Press, 2018. pp. 39-53 (Studies on the Semantic Web, Vol. 36).
@inproceedings{ef690d0aaf9d41f8801a6490c6014f5e,
title = "Towards Efficient Query Processing over Heterogeneous RDF Interfaces",
abstract = "Since the proposal of RDF as a standard for representing statements about entities, diverse interfaces to publish and strategies to query RDF data have been proposed. Although some recent proposals are aware of the advantages and disadvantages of state-of-the-art approaches, no work has yet tried to integrate them into a hybrid system that exploits their, in many cases, complementary strengths to process queries more efficiently than each of these approaches could do individually. In this paper, we present hybridSE, an approach that exploits the diverse characteristics of queryable RDF interfaces to efficiently process SPARQL queries. We present a brief study of the characteristics of some of the most popular RDF interfaces (brTPF and SPARQL endpoints), a method to estimate the impact of using a particular interface on query evaluation, and a method to use multiple interfaces to efficiently process a query. Our experiments, using a well-known benchmark dataset and a large number of queries, with result sizes varying from 1 up to 1 million, show that hybridSE processes queries up to three orders of magnitude faster and transfers up to four orders of magnitude less data.",
keywords = "SPARQL query processing, Heterogeneous RDF interfaces, brTPF, SPARQL endpoints",
author = "Gabriela Montoya and Christian Aebeloe and Katja Hose",
year = "2018",
doi = "10.3233/978-1-61499-894-5-39",
language = "English",
isbn = "978-3-89838-736-1",
series = "Studies on the Semantic Web",
pages = "39--53",
editor = "Elena Demidova and Zaveri, {Amrapali J.} and Elena Simperl",
booktitle = "Emerging Topics in Semantic Technologies",
publisher = "IOS Press",

}

Montoya, G, Aebeloe, C & Hose, K 2018, Towards Efficient Query Processing over Heterogeneous RDF Interfaces. in E Demidova, AJ Zaveri & E Simperl (eds), Emerging Topics in Semantic Technologies: ISWC 2018 Satellite Events. IOS Press, Studies on the Semantic Web, vol. 36, pp. 39-53, 2nd Workshop on Decentralizing the Semantic Web, DeSemWeb 2018, Monterey, United States, 08/10/2018. https://doi.org/10.3233/978-1-61499-894-5-39

Towards Efficient Query Processing over Heterogeneous RDF Interfaces. / Montoya, Gabriela; Aebeloe, Christian; Hose, Katja.

Emerging Topics in Semantic Technologies: ISWC 2018 Satellite Events. ed. / Elena Demidova; Amrapali J. Zaveri; Elena Simperl. IOS Press, 2018. p. 39-53 (Studies on the Semantic Web, Vol. 36).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

TY - GEN

T1 - Towards Efficient Query Processing over Heterogeneous RDF Interfaces

AU - Montoya, Gabriela

AU - Aebeloe, Christian

AU - Hose, Katja

PY - 2018

Y1 - 2018

N2 - Since the proposal of RDF as a standard for representing statements about entities, diverse interfaces to publish and strategies to query RDF data have been proposed. Although some recent proposals are aware of the advantages and disadvantages of state-of-the-art approaches, no work has yet tried to integrate them into a hybrid system that exploits their, in many cases, complementary strengths to process queries more efficiently than each of these approaches could do individually. In this paper, we present hybridSE, an approach that exploits the diverse characteristics of queryable RDF interfaces to efficiently process SPARQL queries. We present a brief study of the characteristics of some of the most popular RDF interfaces (brTPF and SPARQL endpoints), a method to estimate the impact of using a particular interface on query evaluation, and a method to use multiple interfaces to efficiently process a query. Our experiments, using a well-known benchmark dataset and a large number of queries, with result sizes varying from 1 up to 1 million, show that hybridSE processes queries up to three orders of magnitude faster and transfers up to four orders of magnitude less data.

AB - Since the proposal of RDF as a standard for representing statements about entities, diverse interfaces to publish and strategies to query RDF data have been proposed. Although some recent proposals are aware of the advantages and disadvantages of state-of-the-art approaches, no work has yet tried to integrate them into a hybrid system that exploits their, in many cases, complementary strengths to process queries more efficiently than each of these approaches could do individually. In this paper, we present hybridSE, an approach that exploits the diverse characteristics of queryable RDF interfaces to efficiently process SPARQL queries. We present a brief study of the characteristics of some of the most popular RDF interfaces (brTPF and SPARQL endpoints), a method to estimate the impact of using a particular interface on query evaluation, and a method to use multiple interfaces to efficiently process a query. Our experiments, using a well-known benchmark dataset and a large number of queries, with result sizes varying from 1 up to 1 million, show that hybridSE processes queries up to three orders of magnitude faster and transfers up to four orders of magnitude less data.

KW - SPARQL query processing

KW - Heterogeneous RDF interfaces

KW - brTPF

KW - SPARQL endpoints

U2 - 10.3233/978-1-61499-894-5-39

DO - 10.3233/978-1-61499-894-5-39

M3 - Article in proceeding

SN - 978-3-89838-736-1

T3 - Studies on the Semantic Web

SP - 39

EP - 53

BT - Emerging Topics in Semantic Technologies

A2 - Demidova, Elena

A2 - Zaveri, Amrapali J.

A2 - Simperl, Elena

PB - IOS Press

ER -

Montoya G, Aebeloe C, Hose K. Towards Efficient Query Processing over Heterogeneous RDF Interfaces. In Demidova E, Zaveri AJ, Simperl E, editors, Emerging Topics in Semantic Technologies: ISWC 2018 Satellite Events. IOS Press. 2018. p. 39-53. (Studies on the Semantic Web, Vol. 36). https://doi.org/10.3233/978-1-61499-894-5-39