Resource Planning for SPARQL Query Execution on Data Sharing Platforms

Stefan Hagedorn, Katja Hose, Kai-Uwe Sattler, Jürgen Umbrich

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

1 Citation (Scopus)

Abstract

To increase performance, data sharing platforms often make use of clusters of nodes where certain tasks can be executed in parallel. Resource planning and especially deciding how many processors should be chosen to exploit parallel processing is complex in such a setup as increasing the number of processors does not always improve runtime due to communication overhead. Instead, there is usually an optimum number of processors for which using more or fewer processors leads to less efficient runtimes. In this paper, we present a cost model based on widely used statistics (VoiD) and show how to compute the optimum number of processors that should be used to evaluate a particular SPARQL query over a particular configuration and RDF dataset. Our first experiments show the general applicability of our approach but also how shortcomings in the used statistics limit the potential of optimization.
Original languageEnglish
Title of host publicationConsuming Linked Data (COLD 2014) : 5th International Workshop on Consuming Linked Data (COLD 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014
EditorsOlaf Hartig, Aidan Hogan, Juan Sequeda
Number of pages12
Volume1264
PublisherCEUR Workshop Proceedings
Publication date2014
Publication statusPublished - 2014
Event5th International Workshop on Consuming Linked Data (COLD 2014) - Riva del Garda, Italy
Duration: 20 Oct 2014 → …

Conference

Conference5th International Workshop on Consuming Linked Data (COLD 2014)
Country/TerritoryItaly
CityRiva del Garda
Period20/10/2014 → …
SeriesCEUR Workshop Proceedings
ISSN1613-0073

Keywords

  • resource planning
  • SPARQL
  • data sharing

Fingerprint

Dive into the research topics of 'Resource Planning for SPARQL Query Execution on Data Sharing Platforms'. Together they form a unique fingerprint.

Cite this