Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation

Xiao Li, Huan Li, Hua Lu, Christian S. Jensen, Varun Pandey, Volker Markl

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review


Sensor data streams occur widely in various real-time applications in the context of the Internet of Things (IoT). However, sensor data streams feature missing values due to factors such as sensor failures, communication errors, or depleted batteries. Missing values can compromise the quality of real-time analytics tasks and downstream applications. Existing imputation methods either make strong assumptions about streams or have low efficiency. In this study, we aim to accurately and efficiently impute missing values in data streams that satisfy only general characteristics in order to benefit real-time applications more widely. First, we propose a message propagation imputation network (MPIN) that is able to recover the missing values of data instances in a time window. We give a theoretical analysis of why MPIN is effective. Second, we present a continuous imputation framework that consists of data update and model update mechanisms to enable MPIN to perform continuous imputation both effectively and efficiently. Extensive experiments on multiple real datasets show that MPIN can outperform the existing data imputers by wide margins and that the continuous imputation framework is efficient and accurate.
TidsskriftProceedings of the VLDB Endowment
Udgave nummer3
Sider (fra-til)345-358
Antal sider14
StatusUdgivet - 2023
Begivenhed50th International Conference on Very Large Data Bases - Gungzhou, Kina
Varighed: 25 aug. 202429 aug. 2024


Konference50th International Conference on Very Large Data Bases


Dyk ned i forskningsemnerne om 'Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation'. Sammen danner de et unikt fingeraftryk.