SOS: Safe, Optimal and Small Strategies for Hybrid Markov Decision Processes

Pranav Ashok, Jan Kretínsky, Kim Guldstrand Larsen, Adrien Camille le Coent, Jakob Haahr Taankvist, Maxmillian Weininger

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

16 Citations (Scopus)

Abstract

For hybrid Markov decision processes, Open image in new window Stratego can compute strategies that are safe for a given safety property and (in the limit) optimal for a given cost function. Unfortunately, these strategies cannot be exported easily since they are computed as a very long list. In this paper, we demonstrate methods to learn compact representations of the strategies in the form of decision trees. These decision trees are much smaller, more understandable, and can easily be exported as code that can be loaded into embedded systems. Despite the size compression and actual differences to the original strategy, we provide guarantees on both safety and optimality of the decision-tree strategy. On the top, we show how to obtain yet smaller representations, which are still guaranteed safe, but achieve a desired trade-off between size and optimality.
Original languageEnglish
Title of host publicationQuantitative Evaluation of Systems - 16th International Conference, QEST 2019, Proceedings
EditorsDavid Parker, Verena Wolf
Number of pages18
Place of PublicationHeidelberg
PublisherSpringer
Publication date2019
Pages147-164
ISBN (Print)978-3-030-30280-1
ISBN (Electronic)978-3-030-30281-8
DOIs
Publication statusPublished - 2019
EventQEST 2019: International Conference on Quantitative Evaluation of Systems -
Duration: 10 Sept 201912 Sept 2019

Conference

ConferenceQEST 2019: International Conference on Quantitative Evaluation of Systems
Period10/09/201912/09/2019
SeriesLecture Notes in Computer Science
Volume11785
ISSN0302-9743

Fingerprint

Dive into the research topics of 'SOS: Safe, Optimal and Small Strategies for Hybrid Markov Decision Processes'. Together they form a unique fingerprint.

Cite this