On Time with Minimal Expected Cost!

Alexandre David; Peter Gjøl Jensen; Kim Guldstrand Larsen; Axel Legay; Didier Lime; Mathias Grund Sørensen; Jakob Haahr Taankvist

doi:10.1007/978-3-319-11936-6_10

On Time with Minimal Expected Cost!

Alexandre David, Peter Gjøl Jensen, Kim Guldstrand Larsen, Axel Legay, Didier Lime, Mathias Grund Sørensen, Jakob Haahr Taankvist

Department of Computer Science

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

45 Citations (Scopus)

Abstract

(Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.

Original language	English
Title of host publication	Automated Technology for Verification and Analysis
Editors	Franck Cassez, Jean-François Raskin
Number of pages	16
Volume	8837
Publisher	Springer Publishing Company
Publication date	2014
Pages	129-145
ISBN (Print)	978-3-319-11935-9
ISBN (Electronic)	978-3-319-11936-6
DOIs	https://doi.org/10.1007/978-3-319-11936-6_10
Publication status	Published - 2014
Event	Automated Technology for Verification and Analysis - Sydney, NSW, Australia Duration: 3 Nov 2012 → 7 Nov 2014

Conference

Conference	Automated Technology for Verification and Analysis
Country/Territory	Australia
City	Sydney, NSW
Period	03/11/2012 → 07/11/2014

Series	Lecture Notes in Computer Science
ISSN	0302-9743

Access to Document

10.1007/978-3-319-11936-6_10

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{14fdafd5794443969e0cdb12d6690a63,

title = "On Time with Minimal Expected Cost!",

abstract = "(Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.",

author = "Alexandre David and Jensen, {Peter Gj{\o}l} and Larsen, {Kim Guldstrand} and Axel Legay and Didier Lime and S{\o}rensen, {Mathias Grund} and Taankvist, {Jakob Haahr}",

year = "2014",

doi = "10.1007/978-3-319-11936-6_10",

language = "English",

isbn = "978-3-319-11935-9",

volume = "8837",

series = "Lecture Notes in Computer Science",

publisher = "Springer Publishing Company",

pages = "129--145",

editor = "Franck Cassez and Jean-Fran{\c c}ois Raskin",

booktitle = "Automated Technology for Verification and Analysis",

address = "United States",

note = "Automated Technology for Verification and Analysis ; Conference date: 03-11-2012 Through 07-11-2014",

}

David, A, Jensen, PG , Larsen, KG, Legay, A, Lime, D, Sørensen, MG & Taankvist, JH 2014, On Time with Minimal Expected Cost! in F Cassez & J-F Raskin (eds), Automated Technology for Verification and Analysis. vol. 8837, Springer Publishing Company, Lecture Notes in Computer Science, pp. 129-145, Automated Technology for Verification and Analysis, Sydney, NSW, Australia, 03/11/2012. https://doi.org/10.1007/978-3-319-11936-6_10

On Time with Minimal Expected Cost! / David, Alexandre; Jensen, Peter Gjøl ; Larsen, Kim Guldstrand et al.
Automated Technology for Verification and Analysis. ed. / Franck Cassez; Jean-François Raskin. Vol. 8837 Springer Publishing Company, 2014. p. 129-145 (Lecture Notes in Computer Science).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - On Time with Minimal Expected Cost!

AU - David, Alexandre

AU - Jensen, Peter Gjøl

AU - Larsen, Kim Guldstrand

AU - Legay, Axel

AU - Lime, Didier

AU - Sørensen, Mathias Grund

AU - Taankvist, Jakob Haahr

PY - 2014

Y1 - 2014

N2 - (Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.

AB - (Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.

U2 - 10.1007/978-3-319-11936-6_10

DO - 10.1007/978-3-319-11936-6_10

M3 - Article in proceeding

SN - 978-3-319-11935-9

VL - 8837

T3 - Lecture Notes in Computer Science

SP - 129

EP - 145

BT - Automated Technology for Verification and Analysis

A2 - Cassez, Franck

A2 - Raskin, Jean-François

PB - Springer Publishing Company

T2 - Automated Technology for Verification and Analysis

Y2 - 3 November 2012 through 7 November 2014

ER -