Projects per year
Abstract
In recent years, research in RDF archiving has gained traction due to the ever-growing nature of semantic data and the emergence of community-maintained knowledge bases. Several solutions have been proposed to manage the history of large RDF graphs, including approaches based on independent copies, time-based indexes, and change-based schemes. In particular, aggregated changesets have been shown to be relatively efficient at handling very large datasets. However, ingestion time can still become prohibitive as the revision history increases. To tackle this challenge, we propose a hybrid storage approach based on aggregated changesets, snapshots, and multiple delta chains. We evaluate different snapshot creation strategies on the BEAR benchmark for RDF archives, and show that our techniques can speed up ingestion time up to two orders of magnitude while keeping competitive performance for version materialization and delta queries. This allows us to support revision histories of lengths that are beyond reach with existing approaches.
Original language | English |
---|---|
Title of host publication | Proceedings - 17th IEEE International Conference on Semantic Computing, ICSC 2023 |
Number of pages | 8 |
Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
Publication date | 2023 |
Pages | 41-48 |
ISBN (Print) | 978-1-6654-8264-6 |
ISBN (Electronic) | 978-1-6654-8263-9 |
DOIs | |
Publication status | Published - 2023 |
Event | 17th IEEE International Conference on Semantic Computing, ICSC 2023 - Virtual, Online, United States Duration: 1 Feb 2023 → 3 Feb 2023 |
Conference
Conference | 17th IEEE International Conference on Semantic Computing, ICSC 2023 |
---|---|
Country/Territory | United States |
City | Virtual, Online |
Period | 01/02/2023 → 03/02/2023 |
Series | IEEE International Conference on Semantic Computing (ICSC) |
---|---|
ISSN | 2325-6516 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- Archiving
- Indexing
- RDF
- Semantic Web
- Storage
- Versioning
Fingerprint
Dive into the research topics of 'Scaling Large RDF Archives to Very Long Histories'. Together they form a unique fingerprint.-
Poul Due Jensen Professorate in Big Data and Artificial Intelligence
Hose, K. (PI), Jendal, T. E. (Project Participant) & Hansen, E. R. (Project Participant)
01/11/2019 → 31/12/2025
Project: Research
-
Research output
- 4 Citations
- 1 PhD thesis
-
Efficient Management of Large RDF Archives
Pelgrin, O. P., 2024, Aalborg University Open Publishing.Research output: PhD thesis
Open AccessFile176 Downloads (Pure)