TY - JOUR
T1 - Value-Based Reinforcement Learning for Digital Twins in Cloud Computing
AU - Bui, Van Phuc
AU - Pandey, Shashi Raj
AU - De Sant Ana, Pedro M.
AU - Popovski, Petar
PY - 2024/1/1
Y1 - 2024/1/1
N2 - The setup considered in the paper consists of sensors in a Networked Control System that are used to build a digital twin (DT) model of the system dynamics. The focus is on control, scheduling, and resource allocation for sensory observation to ensure timely delivery to the DT model deployed in the cloud. Low latency and communication timeliness are instrumental in ensuring that the DT model can accurately estimate and predict system states. However, acquiring data for efficient state estimation and control computing poses a non-trivial problem given the limited network resources, partial state vector information, and measurement errors encountered at distributed sensors. We propose the REinforcement learning and Variational Extended Kalman filter with Robust Belief (REVERB), which leverages a reinforcement learning solution combined with a Value of Information-based algorithm for performing optimal control and selecting the most informative sensors to satisfy the prediction accuracy of DT. Numerical results demonstrate that the DT platform can offer satisfactory performance while reducing the communication overhead up to five times.
AB - The setup considered in the paper consists of sensors in a Networked Control System that are used to build a digital twin (DT) model of the system dynamics. The focus is on control, scheduling, and resource allocation for sensory observation to ensure timely delivery to the DT model deployed in the cloud. Low latency and communication timeliness are instrumental in ensuring that the DT model can accurately estimate and predict system states. However, acquiring data for efficient state estimation and control computing poses a non-trivial problem given the limited network resources, partial state vector information, and measurement errors encountered at distributed sensors. We propose the REinforcement learning and Variational Extended Kalman filter with Robust Belief (REVERB), which leverages a reinforcement learning solution combined with a Value of Information-based algorithm for performing optimal control and selecting the most informative sensors to satisfy the prediction accuracy of DT. Numerical results demonstrate that the DT platform can offer satisfactory performance while reducing the communication overhead up to five times.
UR - http://www.scopus.com/inward/record.url?scp=85202825906&partnerID=8YFLogxK
U2 - 10.1109/ICC51166.2024.10622403
DO - 10.1109/ICC51166.2024.10622403
M3 - Journal article
AN - SCOPUS:85202825906
SN - 1550-3607
SP - 1413
EP - 1418
JO - IEEE International Conference on Communications
JF - IEEE International Conference on Communications
ER -