Resource Planning for SPARQL Query Execution on Data Sharing Platforms

Stefan Hagedorn; Katja Hose; Kai-Uwe Sattler; Jürgen Umbrich

Resource Planning for SPARQL Query Execution on Data Sharing Platforms

Stefan Hagedorn, Katja Hose, Kai-Uwe Sattler, Jürgen Umbrich

Department of Computer Science

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

1 Citation (Scopus)

Abstract

To increase performance, data sharing platforms often make use of clusters of nodes where certain tasks can be executed in parallel. Resource planning and especially deciding how many processors should be chosen to exploit parallel processing is complex in such a setup as increasing the number of processors does not always improve runtime due to communication overhead. Instead, there is usually an optimum number of processors for which using more or fewer processors leads to less efficient runtimes. In this paper, we present a cost model based on widely used statistics (VoiD) and show how to compute the optimum number of processors that should be used to evaluate a particular SPARQL query over a particular configuration and RDF dataset. Our first experiments show the general applicability of our approach but also how shortcomings in the used statistics limit the potential of optimization.

Original language	English
Title of host publication	Consuming Linked Data (COLD 2014) : 5th International Workshop on Consuming Linked Data (COLD 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014
Editors	Olaf Hartig, Aidan Hogan, Juan Sequeda
Number of pages	12
Volume	1264
Publisher	CEUR Workshop Proceedings
Publication date	2014
Publication status	Published - 2014
Event	5th International Workshop on Consuming Linked Data (COLD 2014) - Riva del Garda, Italy Duration: 20 Oct 2014 → …

Conference

Conference	5th International Workshop on Consuming Linked Data (COLD 2014)
Country/Territory	Italy
City	Riva del Garda
Period	20/10/2014 → …

Series	CEUR Workshop Proceedings
ISSN	1613-0073

Keywords

resource planning
SPARQL
data sharing

Access to Document

http://ceur-ws.org/Vol-1264/cold2014_HagedornHSU.pdf

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

Hagedorn, S., Hose, K., Sattler, K-U., & Umbrich, J. (2014). Resource Planning for SPARQL Query Execution on Data Sharing Platforms. In O. Hartig, A. Hogan, & J. Sequeda (Eds.), Consuming Linked Data (COLD 2014): 5th International Workshop on Consuming Linked Data (COLD 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014 (Vol. 1264). CEUR Workshop Proceedings. http://ceur-ws.org/Vol-1264/cold2014_HagedornHSU.pdf

Hagedorn, Stefan ; Hose, Katja ; Sattler, Kai-Uwe et al. / Resource Planning for SPARQL Query Execution on Data Sharing Platforms. Consuming Linked Data (COLD 2014): 5th International Workshop on Consuming Linked Data (COLD 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014. editor / Olaf Hartig ; Aidan Hogan ; Juan Sequeda. Vol. 1264 CEUR Workshop Proceedings, 2014. (CEUR Workshop Proceedings).

@inproceedings{313c366aa3a440c3a106176b16eb02ec,

title = "Resource Planning for SPARQL Query Execution on Data Sharing Platforms",

abstract = "To increase performance, data sharing platforms often make use of clusters of nodes where certain tasks can be executed in parallel. Resource planning and especially deciding how many processors should be chosen to exploit parallel processing is complex in such a setup as increasing the number of processors does not always improve runtime due to communication overhead. Instead, there is usually an optimum number of processors for which using more or fewer processors leads to less efficient runtimes. In this paper, we present a cost model based on widely used statistics (VoiD) and show how to compute the optimum number of processors that should be used to evaluate a particular SPARQL query over a particular configuration and RDF dataset. Our first experiments show the general applicability of our approach but also how shortcomings in the used statistics limit the potential of optimization.",

keywords = "resource planning, SPARQL, data sharing",

author = "Stefan Hagedorn and Katja Hose and Kai-Uwe Sattler and J{\"u}rgen Umbrich",

year = "2014",

language = "English",

volume = "1264",

series = "CEUR Workshop Proceedings",

publisher = "CEUR Workshop Proceedings",

editor = "Olaf Hartig and Aidan Hogan and Juan Sequeda",

booktitle = "Consuming Linked Data (COLD 2014)",

note = "5th International Workshop on Consuming Linked Data (COLD 2014) ; Conference date: 20-10-2014",

}

Hagedorn, S, Hose, K, Sattler, K-U & Umbrich, J 2014, Resource Planning for SPARQL Query Execution on Data Sharing Platforms. in O Hartig, A Hogan & J Sequeda (eds), Consuming Linked Data (COLD 2014): 5th International Workshop on Consuming Linked Data (COLD 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014. vol. 1264, CEUR Workshop Proceedings, CEUR Workshop Proceedings, 5th International Workshop on Consuming Linked Data (COLD 2014), Riva del Garda, Italy, 20/10/2014. <http://ceur-ws.org/Vol-1264/cold2014_HagedornHSU.pdf>

Resource Planning for SPARQL Query Execution on Data Sharing Platforms. / Hagedorn, Stefan; Hose, Katja; Sattler, Kai-Uwe et al.
Consuming Linked Data (COLD 2014): 5th International Workshop on Consuming Linked Data (COLD 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014. ed. / Olaf Hartig; Aidan Hogan; Juan Sequeda. Vol. 1264 CEUR Workshop Proceedings, 2014. (CEUR Workshop Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - Resource Planning for SPARQL Query Execution on Data Sharing Platforms

AU - Hagedorn, Stefan

AU - Hose, Katja

AU - Sattler, Kai-Uwe

AU - Umbrich, Jürgen

PY - 2014

Y1 - 2014

N2 - To increase performance, data sharing platforms often make use of clusters of nodes where certain tasks can be executed in parallel. Resource planning and especially deciding how many processors should be chosen to exploit parallel processing is complex in such a setup as increasing the number of processors does not always improve runtime due to communication overhead. Instead, there is usually an optimum number of processors for which using more or fewer processors leads to less efficient runtimes. In this paper, we present a cost model based on widely used statistics (VoiD) and show how to compute the optimum number of processors that should be used to evaluate a particular SPARQL query over a particular configuration and RDF dataset. Our first experiments show the general applicability of our approach but also how shortcomings in the used statistics limit the potential of optimization.

AB - To increase performance, data sharing platforms often make use of clusters of nodes where certain tasks can be executed in parallel. Resource planning and especially deciding how many processors should be chosen to exploit parallel processing is complex in such a setup as increasing the number of processors does not always improve runtime due to communication overhead. Instead, there is usually an optimum number of processors for which using more or fewer processors leads to less efficient runtimes. In this paper, we present a cost model based on widely used statistics (VoiD) and show how to compute the optimum number of processors that should be used to evaluate a particular SPARQL query over a particular configuration and RDF dataset. Our first experiments show the general applicability of our approach but also how shortcomings in the used statistics limit the potential of optimization.

KW - resource planning

KW - SPARQL

KW - data sharing

M3 - Article in proceeding

VL - 1264

T3 - CEUR Workshop Proceedings

BT - Consuming Linked Data (COLD 2014)

A2 - Hartig, Olaf

A2 - Hogan, Aidan

A2 - Sequeda, Juan

PB - CEUR Workshop Proceedings

T2 - 5th International Workshop on Consuming Linked Data (COLD 2014)

Y2 - 20 October 2014

ER -

Hagedorn S, Hose K, Sattler K-U, Umbrich J. Resource Planning for SPARQL Query Execution on Data Sharing Platforms. In Hartig O, Hogan A, Sequeda J, editors, Consuming Linked Data (COLD 2014): 5th International Workshop on Consuming Linked Data (COLD 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014. Vol. 1264. CEUR Workshop Proceedings. 2014. (CEUR Workshop Proceedings).

Resource Planning for SPARQL Query Execution on Data Sharing Platforms

Abstract

Conference

Keywords

Access to Document

AUB Link

Fingerprint

Cite this