Abstract
In this paper, we aim to study safety specifications for a Markov decision process with stochastic stopping time in an almost model-free setting. Our approach involves characterizing a proxy set of the states that are near in a probabilistic sense to the set of unsafe states - forbidden set. We also provide results that relate safety function with reinforcement learning. Consequently, we develop an online algorithm based on the temporal difference method to compute the safety function. Finally, we provide simulation results that demonstrate our work in a simple example.
Original language | English |
---|---|
Title of host publication | 2023 European Control Conference (ECC) |
Number of pages | 6 |
Publisher | IEEE |
Publication date | 13 Jun 2023 |
Pages | 1-6 |
ISBN (Print) | 978-1-6654-6531-1, 978-3-907144-09-1 |
ISBN (Electronic) | 978-3-907144-08-4 |
DOIs | |
Publication status | Published - 13 Jun 2023 |
Event | 2023 European Control Conference, ECC 2023 - Bucharest, Romania Duration: 13 Jun 2023 → 16 Jun 2023 |
Conference
Conference | 2023 European Control Conference, ECC 2023 |
---|---|
Country/Territory | Romania |
City | Bucharest |
Period | 13/06/2023 → 16/06/2023 |