Real-Time Reinforcement Learning Control in Poor Experimental Conditions

Jorge Val Ledesma; Rafal Wisniewski; Carsten Kallesøe

doi:10.23919/ECC54610.2021.9654896

Real-Time Reinforcement Learning Control in Poor Experimental Conditions

Jorge Val Ledesma, Rafal Wisniewski, Carsten Kallesøe

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

1 Citationer (Scopus)

Abstract

Reinforcement Learning (RL) is a widely used method for solving optimal problems without system knowledge. However, the use of RL for control of industrial applications is still reduced. One of the reasons for limited applicability of RL in this field is the difficulty of learning the system
behaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second method
applies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.

Originalsprog	Engelsk
Titel	2021 European Control Conference (ECC)
Antal sider	6
Forlag	IEEE
Publikationsdato	2021
Sider	126-131
Artikelnummer	9654896
ISBN (Trykt)	978-1-6654-7945-5
ISBN (Elektronisk)	978-9-4638-4236-5
DOI	https://doi.org/10.23919/ECC54610.2021.9654896
Status	Udgivet - 2021
Begivenhed	2021 European Control Conference (ECC) - Delft, Holland Varighed: 29 jun. 2021 → 2 jul. 2021

Konference

Konference	2021 European Control Conference (ECC)
Land/Område	Holland
By	Delft
Periode	29/06/2021 → 02/07/2021

Adgang til dokumentet

10.23919/ECC54610.2021.9654896

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Smart Water Infrastructures Laboratory (SWIL)
Jorge Val Ledesma (Operatør), Rafal Wisniewski (Leder), Carsten Kallesøe (Operatør), Saruch Satishkumar Rathore (Leder), Rahul Misra (Leder), Vishal Sopan Sawant (Leder) & Abhijit Mazumdar (Leder)
Institut for Elektroniske Systemer
Facilitet: Laboratorie

Citationsformater

@inproceedings{ef4c3578260b400ca48e06cd5634953e,

title = "Real-Time Reinforcement Learning Control in Poor Experimental Conditions",

abstract = "Reinforcement Learning (RL) is a widely used method for solving optimal problems without system knowledge. However, the use of RL for control of industrial applications is still reduced. One of the reasons for limited applicability of RL in this field is the difficulty of learning the systembehaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second methodapplies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.",

author = "Ledesma, {Jorge Val} and Rafal Wisniewski and Carsten Kalles{\o}e",

year = "2021",

doi = "10.23919/ECC54610.2021.9654896",

language = "English",

isbn = "978-1-6654-7945-5",

pages = "126--131",

booktitle = "2021 European Control Conference (ECC)",

publisher = "IEEE",

address = "United States",

note = "2021 European Control Conference (ECC) ; Conference date: 29-06-2021 Through 02-07-2021",

}

TY - GEN

T1 - Real-Time Reinforcement Learning Control in Poor Experimental Conditions

AU - Ledesma, Jorge Val

AU - Wisniewski, Rafal

AU - Kallesøe, Carsten

PY - 2021

Y1 - 2021

N2 - Reinforcement Learning (RL) is a widely used method for solving optimal problems without system knowledge. However, the use of RL for control of industrial applications is still reduced. One of the reasons for limited applicability of RL in this field is the difficulty of learning the systembehaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second methodapplies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.

AB - Reinforcement Learning (RL) is a widely used method for solving optimal problems without system knowledge. However, the use of RL for control of industrial applications is still reduced. One of the reasons for limited applicability of RL in this field is the difficulty of learning the systembehaviour under poor experimental conditions. This paper proposes two methods to cope with scenarios where the data collected is not contributing to the learning in linear systems. The first method identifies the periods where the learning is not efficient and pauses the policy update, the second methodapplies a reduction of the approximation space to continue with the learning. The proposed methods are validated in a simulation environment of a water distribution network. Both methods show similar performance and provide a reliable operation during steady state or poor experimental conditions.

U2 - 10.23919/ECC54610.2021.9654896

DO - 10.23919/ECC54610.2021.9654896

M3 - Article in proceeding

SN - 978-1-6654-7945-5

SP - 126

EP - 131

BT - 2021 European Control Conference (ECC)

PB - IEEE

T2 - 2021 European Control Conference (ECC)

Y2 - 29 June 2021 through 2 July 2021

ER -

Real-Time Reinforcement Learning Control in Poor Experimental Conditions

Abstract

Konference

Adgang til dokumentet

AUB Link

Fingeraftryk

Udstyr

Smart Water Infrastructures Laboratory (SWIL)

Citationsformater