Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

UPPAAL-Stratego is a recent branch of the verification tool UPPAAL allowing for synthesis of safe and optimal strategies for stochastic timed (hybrid) games. We describe newly developed learning methods, allowing for synthesis of significantly better strategies and with much improved convergence behaviour. Also, we describe novel use of decision trees for learning orders-of-magnitude more compact strategy representation. In both cases, the seek for optimality does not compromise safety.
Original languageEnglish
Title of host publicationProceedings of 30th International Conference on Concurrency Theory, CONCUR 2019
EditorsWan Fokkink, Rob van Glabbeek
Number of pages5
Place of PublicationSchloss Dagstuhl
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Publication date2019
Article number2
ISBN (Print)978-3-95977-121-4
ISBN (Electronic)9783959771214
DOIs
Publication statusPublished - 2019
Event30th International Conference on Concurrency Theory, CONCUR 2019 - Amsterdam, Netherlands
Duration: 27 Aug 201930 Aug 2019

Conference

Conference30th International Conference on Concurrency Theory, CONCUR 2019
CountryNetherlands
CityAmsterdam
Period27/08/201930/08/2019
SeriesLeibniz International Proceedings in Informatics
Volume140
ISSN1868-8969

Fingerprint

Decision trees

Keywords

  • M-learning
  • Q-learning
  • Reinforcement learning
  • Stochastic hybrid grame
  • Symbolic synthesis
  • Timed automata

Cite this

Larsen, K. G. (2019). Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games. In W. Fokkink, & R. van Glabbeek (Eds.), Proceedings of 30th International Conference on Concurrency Theory, CONCUR 2019 [2] Schloss Dagstuhl: Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing. Leibniz International Proceedings in Informatics, Vol.. 140 https://doi.org/10.4230/LIPIcs.CONCUR.2019.2
Larsen, Kim Guldstrand. / Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games. Proceedings of 30th International Conference on Concurrency Theory, CONCUR 2019. editor / Wan Fokkink ; Rob van Glabbeek. Schloss Dagstuhl : Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2019. (Leibniz International Proceedings in Informatics, Vol. 140).
@inproceedings{b3a6c399b0ee4e2e97b0b28f466e413d,
title = "Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games",
abstract = "UPPAAL-Stratego is a recent branch of the verification tool UPPAAL allowing for synthesis of safe and optimal strategies for stochastic timed (hybrid) games. We describe newly developed learning methods, allowing for synthesis of significantly better strategies and with much improved convergence behaviour. Also, we describe novel use of decision trees for learning orders-of-magnitude more compact strategy representation. In both cases, the seek for optimality does not compromise safety.",
keywords = "M-learning, Q-learning, Reinforcement learning, Stochastic hybrid grame, Symbolic synthesis, Timed automata",
author = "Larsen, {Kim Guldstrand}",
year = "2019",
doi = "10.4230/LIPIcs.CONCUR.2019.2",
language = "English",
isbn = "978-3-95977-121-4",
series = "Leibniz International Proceedings in Informatics",
publisher = "Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing",
editor = "Wan Fokkink and {van Glabbeek}, Rob",
booktitle = "Proceedings of 30th International Conference on Concurrency Theory, CONCUR 2019",

}

Larsen, KG 2019, Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games. in W Fokkink & R van Glabbeek (eds), Proceedings of 30th International Conference on Concurrency Theory, CONCUR 2019., 2, Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, Schloss Dagstuhl, Leibniz International Proceedings in Informatics, vol. 140, 30th International Conference on Concurrency Theory, CONCUR 2019, Amsterdam, Netherlands, 27/08/2019. https://doi.org/10.4230/LIPIcs.CONCUR.2019.2

Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games. / Larsen, Kim Guldstrand.

Proceedings of 30th International Conference on Concurrency Theory, CONCUR 2019. ed. / Wan Fokkink; Rob van Glabbeek. Schloss Dagstuhl : Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2019. 2 (Leibniz International Proceedings in Informatics, Vol. 140).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

TY - GEN

T1 - Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games

AU - Larsen, Kim Guldstrand

PY - 2019

Y1 - 2019

N2 - UPPAAL-Stratego is a recent branch of the verification tool UPPAAL allowing for synthesis of safe and optimal strategies for stochastic timed (hybrid) games. We describe newly developed learning methods, allowing for synthesis of significantly better strategies and with much improved convergence behaviour. Also, we describe novel use of decision trees for learning orders-of-magnitude more compact strategy representation. In both cases, the seek for optimality does not compromise safety.

AB - UPPAAL-Stratego is a recent branch of the verification tool UPPAAL allowing for synthesis of safe and optimal strategies for stochastic timed (hybrid) games. We describe newly developed learning methods, allowing for synthesis of significantly better strategies and with much improved convergence behaviour. Also, we describe novel use of decision trees for learning orders-of-magnitude more compact strategy representation. In both cases, the seek for optimality does not compromise safety.

KW - M-learning

KW - Q-learning

KW - Reinforcement learning

KW - Stochastic hybrid grame

KW - Symbolic synthesis

KW - Timed automata

UR - http://www.scopus.com/inward/record.url?scp=85071652391&partnerID=8YFLogxK

U2 - 10.4230/LIPIcs.CONCUR.2019.2

DO - 10.4230/LIPIcs.CONCUR.2019.2

M3 - Article in proceeding

SN - 978-3-95977-121-4

T3 - Leibniz International Proceedings in Informatics

BT - Proceedings of 30th International Conference on Concurrency Theory, CONCUR 2019

A2 - Fokkink, Wan

A2 - van Glabbeek, Rob

PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing

CY - Schloss Dagstuhl

ER -

Larsen KG. Synthesis of Safe, Optimal and Compact Strategies for Stochastic Hybrid Games. In Fokkink W, van Glabbeek R, editors, Proceedings of 30th International Conference on Concurrency Theory, CONCUR 2019. Schloss Dagstuhl: Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing. 2019. 2. (Leibniz International Proceedings in Informatics, Vol. 140). https://doi.org/10.4230/LIPIcs.CONCUR.2019.2