Abstract
In this paper, we aim to study safety specifications for a Markov decision process with stochastic stopping time in an almost model-free setting. Our approach involves characterizing a proxy set of the states that are near in a probabilistic sense to the set of unsafe states - forbidden set. We also provide results that relate safety function with reinforcement learning. Consequently, we develop an online algorithm based on the temporal difference method to compute the safety function. Finally, we provide simulation results that demonstrate our work in a simple example.
Originalsprog | Engelsk |
---|---|
Titel | 2023 European Control Conference (ECC) |
Antal sider | 6 |
Forlag | IEEE (Institute of Electrical and Electronics Engineers) |
Publikationsdato | 13 jun. 2023 |
Sider | 1-6 |
ISBN (Trykt) | 978-1-6654-6531-1, 978-3-907144-09-1 |
ISBN (Elektronisk) | 978-3-907144-08-4 |
DOI | |
Status | Udgivet - 13 jun. 2023 |
Begivenhed | 2023 European Control Conference, ECC 2023 - Bucharest, Rumænien Varighed: 13 jun. 2023 → 16 jun. 2023 |
Konference
Konference | 2023 European Control Conference, ECC 2023 |
---|---|
Land/Område | Rumænien |
By | Bucharest |
Periode | 13/06/2023 → 16/06/2023 |