ARDI: Automatic Generation of RDFS Models from Heterogeneous Data Sources

Shumet Tadesse Nigatu, Cristina Gomez, Oscar Romero, Katja Hose, Kashif Rabbani

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

2 Citations (Scopus)

Abstract

The current wealth of information, typically known as Big Data, generates a large amount of available data for organisations. Data Integration provides foundations to query disparate data sources as if they were integrated into a single source. However, current data integration tools are far from being useful for most organisations due to the heterogeneous nature of data sources, which represents a challenge for current frameworks. To enable data integration of highly heterogeneous and disparate data sources, this paper proposes a method to extract the schema from semi-structured (such as JSON and XML) and structured (such as relational) data sources, and generate an equivalent RDFS representation. The output of our method complements current frameworks and reduces the manual workload required to represent the input data sources in terms of the integration canonical data model. Our approach consists of production rules at the meta-model level that guarantee the correctness of the model translations. Finally, a tool for implementing our approach has been developed.
Original languageEnglish
Title of host publication2019 IEEE 23rd International Enterprise Distributed Object Computing Conference (EDOC)
Number of pages7
PublisherIEEE Press
Publication date2019
Pages190-196
Article number8945018
ISBN (Electronic)978-1-7281-2702-6
DOIs
Publication statusPublished - 2019
Event23rd IEEE International EDOC Conference - The Enterprise Computing Conference - Paris, France
Duration: 28 Oct 201931 Oct 2019
https://edoc2019.sciencesconf.org/

Conference

Conference23rd IEEE International EDOC Conference - The Enterprise Computing Conference
Country/TerritoryFrance
CityParis
Period28/10/201931/10/2019
Internet address
SeriesIEEE International Enterprise Distributed Object Computing Conference (EDOC)
ISSN2325-6362

Keywords

  • Data Integration
  • Data Model Translation
  • Meta-modeling
  • RDF Schema

Fingerprint

Dive into the research topics of 'ARDI: Automatic Generation of RDFS Models from Heterogeneous Data Sources'. Together they form a unique fingerprint.

Cite this