Answering GPSJ Queries in a Polystore: A Dataspace-Based Approach

Hamdi Ben Hamadou, Enrico Gallinucci, Matteo Golfarelli

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

12 Citations (Scopus)

Abstract

The discipline of data science is steering analysts away from traditional data warehousing and towards a more flexible and lightweight approach to data analysis. The idea is to perform OLAP analyses in a pay-as-you-go manner across heterogeneous schemas and data models, where the integration is progressively carried out by the user as the available data is explored. In this paper, we propose an approach to support data analysis within a polystore supporting relational, document and column data models by automatically handling both data model and schema heterogeneity through a dataspace layer on top of the underlying databases. The expressiveness we enable corresponds to GPSJ queries, which are the most common class of queries in OLAP applications. We rely on Nested Relational Algebra to define a cross-database execution plan. The plan is composed of several local plans, to be executed on the distinct databases, and a global plan, which combines and possibly aggregates inter-database data. The system has been prototyped on Apache Spark.
Original languageEnglish
Title of host publicationInternational Conference on Conceptual Modeling
Volume11788
PublisherSpringer
Publication date2019
Pages189-203
ISBN (Print)978-3-030-33222-8
ISBN (Electronic)978-3-030-33223-5
DOIs
Publication statusPublished - 2019
Externally publishedYes
EventInternational Conference on Conceptual Modeling - Salvador, Brazil
Duration: 4 Nov 20197 Nov 2019

Conference

ConferenceInternational Conference on Conceptual Modeling
Country/TerritoryBrazil
CitySalvador
Period04/11/201907/11/2019
SeriesLecture Notes in Computer Science
Volume11788
ISSN0302-9743

Fingerprint

Dive into the research topics of 'Answering GPSJ Queries in a Polystore: A Dataspace-Based Approach'. Together they form a unique fingerprint.

Cite this