Abstract
Reinforcement Learning (RL) is a widely used method for solving optimal problems without system knowledge. However, the use of RL for control of industrial applications is still reduced. One of the reasons for limited applicability of RL in this field is the difficulty of learning the system
behaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second method
applies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.
behaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second method
applies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.
Originalsprog | Engelsk |
---|---|
Titel | 2021 European Control Conference (ECC) |
Antal sider | 6 |
Forlag | IEEE |
Publikationsdato | 2021 |
Sider | 126-131 |
Artikelnummer | 9654896 |
ISBN (Trykt) | 978-1-6654-7945-5 |
ISBN (Elektronisk) | 978-9-4638-4236-5 |
DOI | |
Status | Udgivet - 2021 |
Begivenhed | 2021 European Control Conference (ECC) - Delft, Holland Varighed: 29 jun. 2021 → 2 jul. 2021 |
Konference
Konference | 2021 European Control Conference (ECC) |
---|---|
Land/Område | Holland |
By | Delft |
Periode | 29/06/2021 → 02/07/2021 |
Fingeraftryk
Dyk ned i forskningsemnerne om 'Real-Time Reinforcement Learning Control in Poor Experimental Conditions'. Sammen danner de et unikt fingeraftryk.Udstyr
-
Smart Water Infrastructures Laboratory (SWIL)
Jorge Val Ledesma (Operatør), Rafal Wisniewski (Leder), Carsten Kallesøe (Operatør), Saruch Satishkumar Rathore (Leder), Rahul Misra (Leder), Vishal Sopan Sawant (Leder) & Abhijit Mazumdar (Leder)
Institut for Elektroniske SystemerFacilitet: Laboratorie