Federated SPARQL queries processing with replicated fragments

Gabriela Montoya; Hala Skaf-Molli; Pascal Molli; Maria Esther Vidal

doi:10.1007/978-3-319-25007-6_3

Federated SPARQL queries processing with replicated fragments

Gabriela Montoya^*, Hala Skaf-Molli, Pascal Molli, Maria Esther Vidal

^*Corresponding author for this work

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

16 Citations (Scopus)

Abstract

Federated query engines provide a unified query interface to federations of SPARQL endpoints. Replicating data fragments from different Linked Data sources facilitates data re-organization to better fit federated query processing needs of data consumers. However, existing federated query engines are not designed to support replication and replicated data can negatively impact their performance. In this paper, we formulate the source selection problem with fragment replication (SSP-FR). For a given set of endpoints with replicated fragments and a SPARQL query, the problem is to select the endpoints that minimize the number of tuples to be transferred. We devise the Fedra source selection algorithm that approximates SSP-FR. We implement Fedra in the state-of-the-art federated query engines FedX and ANAPSID, and empirically evaluate their performance. Experimental results suggest that Fedra efficiently solves SSP-FR, reducing the number of selected SPARQL endpoints as well as the size of query intermediate results.

Original language	English
Title of host publication	The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings
Editors	Mathieu d’Aquin, Krishnaprasad Thirunarayan, Kavitha Srinivas, Paul Groth, Marcelo Arenas, Oscar Corcho, Markus Strohmaier, Jeff Heflin, Elena Simperl, Steffen Staab, Michel Dumontier
Number of pages	16
Publisher	Springer
Publication date	1 Jan 2015
Pages	36-51
ISBN (Print)	9783319250069
DOIs	https://doi.org/10.1007/978-3-319-25007-6_3
Publication status	Published - 1 Jan 2015
Event	14th International Semantic Web Conference, ISWC 2015 - Bethlehem, United States Duration: 11 Oct 2015 → 15 Oct 2015

Conference

Conference	14th International Semantic Web Conference, ISWC 2015
Country/Territory	United States
City	Bethlehem
Period	11/10/2015 → 15/10/2015
Sponsor	Carlos Pedrinaci

Series	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	9366
ISSN	0302-9743

Keywords

Federated query processing
Fragment replication
Linked data
Source selection

Access to Document

10.1007/978-3-319-25007-6_3

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

Montoya, G., Skaf-Molli, H., Molli, P., & Vidal, M. E. (2015). Federated SPARQL queries processing with replicated fragments. In M. d’Aquin, K. Thirunarayan, K. Srinivas, P. Groth, M. Arenas, O. Corcho, M. Strohmaier, J. Heflin, E. Simperl, S. Staab, & M. Dumontier (Eds.), The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings (pp. 36-51). Springer. https://doi.org/10.1007/978-3-319-25007-6_3

Montoya, Gabriela ; Skaf-Molli, Hala ; Molli, Pascal et al. / Federated SPARQL queries processing with replicated fragments. The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings. editor / Mathieu d’Aquin ; Krishnaprasad Thirunarayan ; Kavitha Srinivas ; Paul Groth ; Marcelo Arenas ; Oscar Corcho ; Markus Strohmaier ; Jeff Heflin ; Elena Simperl ; Steffen Staab ; Michel Dumontier. Springer, 2015. pp. 36-51 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 9366).

@inproceedings{2185b711c4c6463788694bf023f0f5cf,

title = "Federated SPARQL queries processing with replicated fragments",

abstract = "Federated query engines provide a unified query interface to federations of SPARQL endpoints. Replicating data fragments from different Linked Data sources facilitates data re-organization to better fit federated query processing needs of data consumers. However, existing federated query engines are not designed to support replication and replicated data can negatively impact their performance. In this paper, we formulate the source selection problem with fragment replication (SSP-FR). For a given set of endpoints with replicated fragments and a SPARQL query, the problem is to select the endpoints that minimize the number of tuples to be transferred. We devise the Fedra source selection algorithm that approximates SSP-FR. We implement Fedra in the state-of-the-art federated query engines FedX and ANAPSID, and empirically evaluate their performance. Experimental results suggest that Fedra efficiently solves SSP-FR, reducing the number of selected SPARQL endpoints as well as the size of query intermediate results.",

keywords = "Federated query processing, Fragment replication, Linked data, Source selection",

author = "Gabriela Montoya and Hala Skaf-Molli and Pascal Molli and Vidal, {Maria Esther}",

year = "2015",

month = jan,

day = "1",

doi = "10.1007/978-3-319-25007-6_3",

language = "English",

isbn = "9783319250069",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "36--51",

editor = "Mathieu d{\textquoteright}Aquin and Krishnaprasad Thirunarayan and Kavitha Srinivas and Paul Groth and Marcelo Arenas and Oscar Corcho and Markus Strohmaier and Jeff Heflin and Elena Simperl and Steffen Staab and Michel Dumontier",

booktitle = "The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings",

address = "Germany",

note = "14th International Semantic Web Conference, ISWC 2015 ; Conference date: 11-10-2015 Through 15-10-2015",

}

Montoya, G, Skaf-Molli, H, Molli, P & Vidal, ME 2015, Federated SPARQL queries processing with replicated fragments. in M d’Aquin, K Thirunarayan, K Srinivas, P Groth, M Arenas, O Corcho, M Strohmaier, J Heflin, E Simperl, S Staab & M Dumontier (eds), The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings. Springer, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9366, pp. 36-51, 14th International Semantic Web Conference, ISWC 2015, Bethlehem, United States, 11/10/2015. https://doi.org/10.1007/978-3-319-25007-6_3

Federated SPARQL queries processing with replicated fragments. / Montoya, Gabriela; Skaf-Molli, Hala; Molli, Pascal et al.
The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings. ed. / Mathieu d’Aquin; Krishnaprasad Thirunarayan; Kavitha Srinivas; Paul Groth; Marcelo Arenas; Oscar Corcho; Markus Strohmaier; Jeff Heflin; Elena Simperl; Steffen Staab; Michel Dumontier. Springer, 2015. p. 36-51 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 9366).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - Federated SPARQL queries processing with replicated fragments

AU - Montoya, Gabriela

AU - Skaf-Molli, Hala

AU - Molli, Pascal

AU - Vidal, Maria Esther

PY - 2015/1/1

Y1 - 2015/1/1

N2 - Federated query engines provide a unified query interface to federations of SPARQL endpoints. Replicating data fragments from different Linked Data sources facilitates data re-organization to better fit federated query processing needs of data consumers. However, existing federated query engines are not designed to support replication and replicated data can negatively impact their performance. In this paper, we formulate the source selection problem with fragment replication (SSP-FR). For a given set of endpoints with replicated fragments and a SPARQL query, the problem is to select the endpoints that minimize the number of tuples to be transferred. We devise the Fedra source selection algorithm that approximates SSP-FR. We implement Fedra in the state-of-the-art federated query engines FedX and ANAPSID, and empirically evaluate their performance. Experimental results suggest that Fedra efficiently solves SSP-FR, reducing the number of selected SPARQL endpoints as well as the size of query intermediate results.

AB - Federated query engines provide a unified query interface to federations of SPARQL endpoints. Replicating data fragments from different Linked Data sources facilitates data re-organization to better fit federated query processing needs of data consumers. However, existing federated query engines are not designed to support replication and replicated data can negatively impact their performance. In this paper, we formulate the source selection problem with fragment replication (SSP-FR). For a given set of endpoints with replicated fragments and a SPARQL query, the problem is to select the endpoints that minimize the number of tuples to be transferred. We devise the Fedra source selection algorithm that approximates SSP-FR. We implement Fedra in the state-of-the-art federated query engines FedX and ANAPSID, and empirically evaluate their performance. Experimental results suggest that Fedra efficiently solves SSP-FR, reducing the number of selected SPARQL endpoints as well as the size of query intermediate results.

KW - Federated query processing

KW - Fragment replication

KW - Linked data

KW - Source selection

UR - http://www.scopus.com/inward/record.url?scp=84952315835&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-25007-6_3

DO - 10.1007/978-3-319-25007-6_3

M3 - Article in proceeding

AN - SCOPUS:84952315835

SN - 9783319250069

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 36

EP - 51

BT - The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings

A2 - d’Aquin, Mathieu

A2 - Thirunarayan, Krishnaprasad

A2 - Srinivas, Kavitha

A2 - Groth, Paul

A2 - Arenas, Marcelo

A2 - Corcho, Oscar

A2 - Strohmaier, Markus

A2 - Heflin, Jeff

A2 - Simperl, Elena

A2 - Staab, Steffen

A2 - Dumontier, Michel

PB - Springer

T2 - 14th International Semantic Web Conference, ISWC 2015

Y2 - 11 October 2015 through 15 October 2015

ER -

Montoya G, Skaf-Molli H, Molli P, Vidal ME. Federated SPARQL queries processing with replicated fragments. In d’Aquin M, Thirunarayan K, Srinivas K, Groth P, Arenas M, Corcho O, Strohmaier M, Heflin J, Simperl E, Staab S, Dumontier M, editors, The Semantic Web – ISWC 2015 - 14th International Semantic Web Conference, Proceedings. Springer. 2015. p. 36-51. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 9366). doi: 10.1007/978-3-319-25007-6_3

Federated SPARQL queries processing with replicated fragments

Abstract

Conference

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Cite this