Abstract
Very large time series are increasingly available from an ever wider range of IoT-enabled sensors deployed in different environments. Significant insights can be gained by mining temporal patterns from these time series. Unlike traditional pattern mining, temporal pattern mining (TPM) adds event time intervals into extracted patterns, making them more expressive at the expense of increased time and space complexities. Existing TPM methods either cannot scale to large datasets, or work only on pre-processed temporal events rather than on time series. This paper presents our Frequent Temporal Pattern Mining from Time Series (FTPMfTS) approach providing: (1) The end-to-end FTPMfTS process taking time series as input and producing frequent temporal patterns as output. (2) The efficient Hierarchical Temporal Pattern Graph Mining (HTPGM) algorithm that uses efficient data structures for fast support and confidence computation, and employs effective pruning techniques for significantly faster mining. (3) An approximate version of HTPGM that uses mutual information, a measure of data correlation, to prune unpromising time series from the search space. (4) An extensive experimental evaluation showing that HTPGM outperforms the baselines in runtime and memory consumption, and can scale to big datasets. The approximate HTPGM is up to two orders of magnitude faster and less memory consuming than the baselines, while retaining high accuracy.
Original language | English |
---|---|
Journal | Proceedings of the VLDB Endowment |
Volume | 15 |
Issue number | 3 |
Pages (from-to) | 673-685 |
Number of pages | 12 |
ISSN | 2150-8097 |
DOIs | |
Publication status | Published - 2022 |
Event | 48th International Conference on Very Large Data Bases - Sydney, Australia Duration: 5 Sept 2022 → 9 Sept 2022 https://vldb.org/2022/ |
Conference
Conference | 48th International Conference on Very Large Data Bases |
---|---|
Country/Territory | Australia |
City | Sydney |
Period | 05/09/2022 → 09/09/2022 |
Internet address |