Scalable Model-Based Management of Correlated Dimensional Time Series in ModelarDB+

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

8 Citations (Scopus)

Abstract

To monitor critical infrastructure, high quality sensors sampled at a high frequency are increasingly used. However, as they produce huge amounts of data, only simple aggregates are stored. This removes outliers and fluctuations that could indicate problems. As a remedy, we present a model-based approach for managing time series with dimensions that exploits correlation in and among time series. Specifically, we propose compressing groups of correlated time series using an extensible set of model types within a user-defined error bound (possibly zero). We name this new category of model-based compression methods for time series Multi-Model Group Compression (MMGC). We present the first MMGC method GOLEMM and extend model types to compress time series groups. We propose primitives for users to effectively define groups for differently sized data sets, and based on these, an automated grouping method using only the time series dimensions. We propose algorithms for executing simple and multi-dimensional aggregate queries on models. Last, we implement our methods in the Time Series Management System (TSMS) ModelarDB (ModelarDB+). Our evaluation shows that compared to widely used formats, ModelarDB+ provides up to 13.7x faster ingestion due to high compression, 113x better compression due to the adaptivity of GOLEMM, 573x faster aggregates by using models, and close to linear scalability. It is also extensible and supports online query processing.
Original languageEnglish
Title of host publicationProceedings of the 37th IEEE International Conference on Data Engineering
Number of pages12
PublisherIEEE
Publication date22 Apr 2021
Pages1380-1391
Article number9458830
ISBN (Print)978-1-7281-9185-0
ISBN (Electronic)978-1-7281-9184-3
DOIs
Publication statusPublished - 22 Apr 2021
Event37th International Conference on Data Engineering -
Duration: 19 Apr 202122 Apr 2021
Conference number: 2021
https://icde2021.gr/

Conference

Conference37th International Conference on Data Engineering
Number2021
Period19/04/202122/04/2021
Internet address
SeriesProceedings of the International Conference on Data Engineering
ISSN1063-6382

Fingerprint

Dive into the research topics of 'Scalable Model-Based Management of Correlated Dimensional Time Series in ModelarDB+'. Together they form a unique fingerprint.
  • ModelarDB: Integrated Model-Based Management of Time Series from Edge to Cloud

    Jensen, S. K., Thomsen, C. & Pedersen, T. B., 9 Feb 2023, Transactions on Large-Scale Data- and Knowledge-Centered Systems LIII. Hameurlain, A. & Tjoa, A. M. (eds.). Springer, p. 1-33 33 p. (Transactions on Large-Scale Data- and Knowledge-Centered Systems). (Lecture Notes in Computer Science, Vol. 13840).

    Research output: Contribution to book/anthology/report/conference proceedingBook chapterResearchpeer-review

    Open Access
    File
    24 Downloads (Pure)
  • Machine Learning Platform for Extreme Scale Computing on Compressed IoT Data

    Tirupathi, S., Salwala, D., Zizzo, G., Rawat, A., Purcell, M., Jensen, S. K., Thomsen, C., Ho, N., Cuza, C. E. M., Brusokas, J., Pedersen, T. B., Alexiou, G., Giannopoulos, G., Gidarakos, P., Kalimeris, A., Maroulis, S., Papastefanatos, G., Psarros, I., Stamatopoulos, V. & Terrovitis, M., 20 Dec 2022, 2022 IEEE International Conference on Big Data (Big Data). Tsumoto, S., Ohsawa, Y., Chen, L., Van den Poel, D., Hu, X., Motomura, Y., Takagi, T., Wu, L., Xie, Y., Abe, A. & Raghavan, V. (eds.). IEEE Communications Society, p. 3179-3185 7 p. 10020540

    Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

    2 Citations (Scopus)
  • Time Series Management Systems: A 2022 Survey

    Jensen, S. K., Pedersen, T. B. & Thomsen, C., 4 Dec 2022, (Accepted/In press) Data Series Management and Analytics. Palpanas, T. & Zoumpatianos, K. (eds.). Association for Computing Machinery, 81 p.

    Research output: Contribution to book/anthology/report/conference proceedingBook chapterResearchpeer-review

    Open Access
    File

Cite this