TY - JOUR
T1 - Deep learning based simulators for the phosphorus removal process control in wastewater treatment via deep reinforcement learning algorithms
AU - Mohammadi, Esmaeel
AU - Stokholm-Bjerregaard, Mikkel
AU - Hansen, Aviaja Anna
AU - Nielsen, Per Halkjær
AU - Ortiz Arroyo, Daniel
AU - Durdevic, Petar
PY - 2024/2
Y1 - 2024/2
N2 - Phosphorus removal is vital in wastewater treatment to reduce reliance on limited resources. Deep reinforcement learning (DRL) can be used to optimize the processes in wastewater treatment plants by learning control policies through trial and error. However, applying DRL to chemical and biological processes is challenging due to the need for accurate simulators. This study trained six models to identify the phosphorus removal process and used them to create a simulator for the DRL environment. While achieving high accuracy (>97%) in one-step ahead prediction of the test dataset, these models struggled as simulators over longer horizons, showing uncertainty and incorrect predictions when using their own outputs for multi-step simulations. Compounding errors in the models’ predictions were identified as one of the causes of this problem. This approach for improving process control involves creating simulation environments for DRL algorithms, using data from supervisory control and data acquisition (SCADA) systems with a sufficient historical horizon without complex system modeling or parameter estimation.
AB - Phosphorus removal is vital in wastewater treatment to reduce reliance on limited resources. Deep reinforcement learning (DRL) can be used to optimize the processes in wastewater treatment plants by learning control policies through trial and error. However, applying DRL to chemical and biological processes is challenging due to the need for accurate simulators. This study trained six models to identify the phosphorus removal process and used them to create a simulator for the DRL environment. While achieving high accuracy (>97%) in one-step ahead prediction of the test dataset, these models struggled as simulators over longer horizons, showing uncertainty and incorrect predictions when using their own outputs for multi-step simulations. Compounding errors in the models’ predictions were identified as one of the causes of this problem. This approach for improving process control involves creating simulation environments for DRL algorithms, using data from supervisory control and data acquisition (SCADA) systems with a sufficient historical horizon without complex system modeling or parameter estimation.
KW - Attention
KW - Deep reinforcement learning
KW - Dynamic model
KW - Phosphorus
KW - Sequence to sequence
KW - Simulator
UR - http://www.scopus.com/inward/record.url?scp=85183962940&partnerID=8YFLogxK
U2 - 10.1016/j.engappai.2024.107992
DO - 10.1016/j.engappai.2024.107992
M3 - Journal article
SN - 0952-1976
VL - 133
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
IS - Part A
M1 - 107992
ER -