Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation

Xiao Li, Huan Li, Hua Lu, Christian S. Jensen, Varun Pandey, Volker Markl

Research output: Contribution to journalConference article in JournalResearchpeer-review

Abstract

Sensor data streams occur widely in various real-time applications in the context of the Internet of Things (IoT). However, sensor data streams feature missing values due to factors such as sensor failures, communication errors, or depleted batteries. Missing values can compromise the quality of real-time analytics tasks and downstream applications. Existing imputation methods either make strong assumptions about streams or have low efficiency. In this study, we aim to accurately and efficiently impute missing values in data streams that satisfy only general characteristics in order to benefit real-time applications more widely. First, we propose a message propagation imputation network (MPIN) that is able to recover the missing values of data instances in a time window. We give a theoretical analysis of why MPIN is effective. Second, we present a continuous imputation framework that consists of data update and model update mechanisms to enable MPIN to perform continuous imputation both effectively and efficiently. Extensive experiments on multiple real datasets show that MPIN can outperform the existing data imputers by wide margins and that the continuous imputation framework is efficient and accurate.
Original languageEnglish
JournalProceedings of the VLDB Endowment
Volume17
Issue number3
Pages (from-to)345-358
Number of pages14
ISSN2150-8097
DOIs
Publication statusPublished - 2023
Event50th International Conference on Very Large Data Bases - Gungzhou, China
Duration: 25 Aug 202429 Aug 2024
https://vldb.org/2024/

Conference

Conference50th International Conference on Very Large Data Bases
Country/TerritoryChina
CityGungzhou
Period25/08/202429/08/2024
Internet address

Fingerprint

Dive into the research topics of 'Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation'. Together they form a unique fingerprint.

Cite this