Scaling Large RDF Archives to Very Long Histories

Olivier Pelgrin*, Ruben Taelman, Luis Galarraga, Katja Hose

*Corresponding author for this work

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

4 Citations (Scopus)
55 Downloads (Pure)

Abstract

In recent years, research in RDF archiving has gained traction due to the ever-growing nature of semantic data and the emergence of community-maintained knowledge bases. Several solutions have been proposed to manage the history of large RDF graphs, including approaches based on independent copies, time-based indexes, and change-based schemes. In particular, aggregated changesets have been shown to be relatively efficient at handling very large datasets. However, ingestion time can still become prohibitive as the revision history increases. To tackle this challenge, we propose a hybrid storage approach based on aggregated changesets, snapshots, and multiple delta chains. We evaluate different snapshot creation strategies on the BEAR benchmark for RDF archives, and show that our techniques can speed up ingestion time up to two orders of magnitude while keeping competitive performance for version materialization and delta queries. This allows us to support revision histories of lengths that are beyond reach with existing approaches.

Original languageEnglish
Title of host publicationProceedings - 17th IEEE International Conference on Semantic Computing, ICSC 2023
Number of pages8
PublisherIEEE (Institute of Electrical and Electronics Engineers)
Publication date2023
Pages41-48
ISBN (Print)978-1-6654-8264-6
ISBN (Electronic)978-1-6654-8263-9
DOIs
Publication statusPublished - 2023
Event17th IEEE International Conference on Semantic Computing, ICSC 2023 - Virtual, Online, United States
Duration: 1 Feb 20233 Feb 2023

Conference

Conference17th IEEE International Conference on Semantic Computing, ICSC 2023
Country/TerritoryUnited States
CityVirtual, Online
Period01/02/202303/02/2023
SeriesIEEE International Conference on Semantic Computing (ICSC)
ISSN2325-6516

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • Archiving
  • Indexing
  • RDF
  • Semantic Web
  • Storage
  • Versioning

Fingerprint

Dive into the research topics of 'Scaling Large RDF Archives to Very Long Histories'. Together they form a unique fingerprint.

Cite this