Abstract
Reinforcement Learning (RL) is a widely used method for solving optimal problems without system knowledge. However, the use of RL for control of industrial applications is still reduced. One of the reasons for limited applicability of RL in this field is the difficulty of learning the system
behaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second method
applies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.
behaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second method
applies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.
Original language | English |
---|---|
Title of host publication | 2021 European Control Conference (ECC) |
Number of pages | 6 |
Publisher | IEEE |
Publication date | 2021 |
Pages | 126-131 |
Article number | 9654896 |
ISBN (Print) | 978-1-6654-7945-5 |
ISBN (Electronic) | 978-9-4638-4236-5 |
DOIs | |
Publication status | Published - 2021 |
Event | 2021 European Control Conference (ECC) - Delft, Netherlands Duration: 29 Jun 2021 → 2 Jul 2021 |
Conference
Conference | 2021 European Control Conference (ECC) |
---|---|
Country/Territory | Netherlands |
City | Delft |
Period | 29/06/2021 → 02/07/2021 |
Fingerprint
Dive into the research topics of 'Real-Time Reinforcement Learning Control in Poor Experimental Conditions'. Together they form a unique fingerprint.Equipment
-
Smart Water Infrastructures Laboratory (SWIL)
Jorge Val Ledesma (Operator), Rafal Wisniewski (Manager), Carsten Kallesøe (Operator), Saruch Satishkumar Rathore (Manager), Rahul Misra (Manager), Vishal Sopan Sawant (Manager) & Abhijit Mazumdar (Manager)
Department of Electronic SystemsFacility: Laboratory