Projekter pr. år
Abstract
Because of the flexibility and expressiveness of their model, Knowledge Graphs (KGs) have received
increasing interest. These resources are usually represented in RDF and stored in specialized data
management systems called triplestores. Yet, while there exists a multitude of such systems, exploiting
varying data representation and indexing schemes, it is unclear which of the many design choices are
the most effective for a given database and query workload. Thus, first, we introduce a set of 20 access
patterns, which we identify within 6 categories, adopted to analyze the needs of a given query workload.
Then, we identify a novel three-dimensional design space for RDF data representations built on the
dimensions of subdivision, redundancy, and compression of data. This design space maps the trade-offs
between different RDF data representations employed to store RDF data within a triplestore. Thus, each
of the required access patterns is compared against its compatibility with a given data representation. As
we show, this approach allows identifying both the most effective RDF data representation for a given
query workload as well as unexplored design solutions.
increasing interest. These resources are usually represented in RDF and stored in specialized data
management systems called triplestores. Yet, while there exists a multitude of such systems, exploiting
varying data representation and indexing schemes, it is unclear which of the many design choices are
the most effective for a given database and query workload. Thus, first, we introduce a set of 20 access
patterns, which we identify within 6 categories, adopted to analyze the needs of a given query workload.
Then, we identify a novel three-dimensional design space for RDF data representations built on the
dimensions of subdivision, redundancy, and compression of data. This design space maps the trade-offs
between different RDF data representations employed to store RDF data within a triplestore. Thus, each
of the required access patterns is compared against its compatibility with a given data representation. As
we show, this approach allows identifying both the most effective RDF data representation for a given
query workload as well as unexplored design solutions.
Originalsprog | Engelsk |
---|---|
Tidsskrift | CEUR Workshop Proceedings |
Vol/bind | 3194 |
ISSN | 1613-0073 |
Status | Udgivet - 2022 |
Begivenhed | 30th Italian Symposium on Advanced Database Systems, SEBD 2022 - Tirrenia, Italien Varighed: 19 jun. 2022 → 20 jun. 2022 |
Konference
Konference | 30th Italian Symposium on Advanced Database Systems, SEBD 2022 |
---|---|
Land/Område | Italien |
By | Tirrenia |
Periode | 19/06/2022 → 20/06/2022 |
Fingeraftryk
Dyk ned i forskningsemnerne om 'Understanding RDF Data Representations in Triplestores'. Sammen danner de et unikt fingeraftryk.-
Poul Due Jensen Professorate in Big Data and Artificial Intelligence
Hose, K. (PI (principal investigator)), Jendal, T. E. (Projektdeltager) & Hansen, E. R. (Projektdeltager)
01/11/2019 → 31/12/2025
Projekter: Projekt › Forskning
-
EDAO: EDAO: Example Driven Analytics for Open Knowledge Graphs
Lissandrini, M. (PI (principal investigator)), Pedersen, T. B. (Supervisor) & Hose, K. (Andet)
15/09/2019 → 14/09/2021
Projekter: Projekt › Forskning
-
RelWeb: A Reliable Web of Data
Hose, K. (PI (principal investigator))
01/09/2019 → 31/08/2024
Projekter: Projekt › Forskning
Publikation
- 1 Tidsskriftartikel
-
A design space for RDF data representations
Sagi, T., Lissandrini, M., Pedersen, T. B. & Hose, K., 21 jan. 2022, I: The VLDB Journal. 31, 2, s. 347-373 27 s.Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review
Åben adgangFil16 Citationer (Scopus)128 Downloads (Pure)