Parallelizing Federated SPARQL Queries in Presence of Replicated Data

Thomas Minier, Gabriela Montoya, Hala Skaf-Molli, Pascal Molli

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

1 Citation (Scopus)

Abstract

Federated query engines have been enhanced to exploit new data localities created by replicated data, e.g., Fedra. However, existing replication aware federated query engines mainly focus on pruning sources during the source selection and query decomposition in order to reduce intermediate results thanks to data locality. In this paper, we implement a replication-aware parallel join operator: Pen. This operator can be used to exploit replicated data during query execution. For existing replication-aware federated query engines, this operator exploits replicated data to parallelize the execution of joins and reduce execution time. For Triple Pattern Fragment (TPF) clients, this operator exploits the availability of several TPF servers exposing the same dataset to share the load among the servers. We implemented Pen in the federated query engine FedX with the replicated-aware source selection Fedra and in the reference TPF client. We empirically evaluated the performance of engines extended with the Pen operator and the experimental results suggest that our extensions outperform the existing approaches in terms of execution time and balance of load among the servers, respectively.
Original languageEnglish
Title of host publicationThe Semantic Web: ESWC 2017 Satellite Events : ESWC 2017 Satellite Events, Portorož, Slovenia, May 28 – June 1, 2017, Revised Selected Papers
PublisherSpringer
Publication date2017
Pages181-196
ISBN (Print)978-3-319-70406-7
ISBN (Electronic)978-3-319-70407-4
DOIs
Publication statusPublished - 2017
Event14th Extended Semantic Web Conference, ESWC 2017 - Portoroz, Slovenia
Duration: 28 May 20171 Jun 2017

Conference

Conference14th Extended Semantic Web Conference, ESWC 2017
CountrySlovenia
CityPortoroz
Period28/05/201701/06/2017
SponsorElsevier, IOS Press
SeriesLecture Notes in Computer Science
Volume10577
ISSN0302-9743

Keywords

  • Linked Data
  • Parallel query processing
  • Fragment replication
  • federated
  • Triple Pattern Fragment
  • Load balancing

Fingerprint

Dive into the research topics of 'Parallelizing Federated SPARQL Queries in Presence of Replicated Data'. Together they form a unique fingerprint.

Cite this