TY - JOUR
T1 - Minimizing Combined Sewer Overflows with Online Model-Predictive Reinforcement Learning
AU - Kim, Esther Hahyeon
AU - Nielsen, Thomas Dyhre
AU - Larsen, Kim Guldstrand
AU - Goorden, Martijn
AU - Ghaffari, Mohsen
AU - Wasowski, Andrzej
AU - Høeg-Petersen, Andreas Holck
PY - 2024/7/5
Y1 - 2024/7/5
N2 - This paper addresses the challenges posed by urban stormwater runoff in light of increased urbanization and climate change, which strain traditional stormwater infrastructure. It focuses on mitigating Combined Sewer Overflows (CSOs) by maximizing urban runoff storage in stormwater tunnels during Wastewater Treatment Plant (WWTP) capacity overloads. Unlike passive rule-based control, this research explores adaptive control systems that utilize weather forecasts and dynamic strategies. It introduces a novel control synthesis approach, combining Model Predictive Control (MPC) and Q-learning, to optimize CSO management based on real-time weather predictions. Simulated evaluations (using EPA-SWMM, focused on the Hvidovre stormwater tunnel in Copenhagen, Denmark) show significant improvements in CSO management: 17% over classic Q-learning and 34% over Q-learning with the \uppaal model, and 72% over Rule-Based Control (RBC). Our method, MPC Q-learning, dynamically incorporates weather conditions, outperforming other Q-learning approaches.
AB - This paper addresses the challenges posed by urban stormwater runoff in light of increased urbanization and climate change, which strain traditional stormwater infrastructure. It focuses on mitigating Combined Sewer Overflows (CSOs) by maximizing urban runoff storage in stormwater tunnels during Wastewater Treatment Plant (WWTP) capacity overloads. Unlike passive rule-based control, this research explores adaptive control systems that utilize weather forecasts and dynamic strategies. It introduces a novel control synthesis approach, combining Model Predictive Control (MPC) and Q-learning, to optimize CSO management based on real-time weather predictions. Simulated evaluations (using EPA-SWMM, focused on the Hvidovre stormwater tunnel in Copenhagen, Denmark) show significant improvements in CSO management: 17% over classic Q-learning and 34% over Q-learning with the \uppaal model, and 72% over Rule-Based Control (RBC). Our method, MPC Q-learning, dynamically incorporates weather conditions, outperforming other Q-learning approaches.
KW - Stormwater management,
KW - Combined sewer overflows
KW - Model predictive control
KW - Control synthesis
KW - Reinforcement learning
M3 - Journal article
SP - 1
EP - 18
JO - Journal - in the process
JF - Journal - in the process
ER -