On Time with Minimal Expected Cost!

Alexandre David; Peter Gjøl Jensen; Kim Guldstrand Larsen; Axel Legay; Didier Lime; Mathias Grund Sørensen; Jakob Haahr Taankvist

doi:10.1007/978-3-319-11936-6_10

On Time with Minimal Expected Cost!

Alexandre David, Peter Gjøl Jensen, Kim Guldstrand Larsen, Axel Legay, Didier Lime, Mathias Grund Sørensen, Jakob Haahr Taankvist

Institut for Datalogi

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

45 Citationer (Scopus)

Abstract

(Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.

Originalsprog	Engelsk
Titel	Automated Technology for Verification and Analysis
Redaktører	Franck Cassez, Jean-François Raskin
Antal sider	16
Vol/bind	8837
Forlag	Springer Publishing Company
Publikationsdato	2014
Sider	129-145
ISBN (Trykt)	978-3-319-11935-9
ISBN (Elektronisk)	978-3-319-11936-6
DOI	https://doi.org/10.1007/978-3-319-11936-6_10
Status	Udgivet - 2014
Begivenhed	Automated Technology for Verification and Analysis - Sydney, NSW, Australien Varighed: 3 nov. 2012 → 7 nov. 2014

Konference

Konference	Automated Technology for Verification and Analysis
Land/Område	Australien
By	Sydney, NSW
Periode	03/11/2012 → 07/11/2014

Navn	Lecture Notes in Computer Science
ISSN	0302-9743

Adgang til dokumentet

10.1007/978-3-319-11936-6_10

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Citationsformater

@inproceedings{14fdafd5794443969e0cdb12d6690a63,

title = "On Time with Minimal Expected Cost!",

abstract = "(Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.",

author = "Alexandre David and Jensen, {Peter Gj{\o}l} and Larsen, {Kim Guldstrand} and Axel Legay and Didier Lime and S{\o}rensen, {Mathias Grund} and Taankvist, {Jakob Haahr}",

year = "2014",

doi = "10.1007/978-3-319-11936-6_10",

language = "English",

isbn = "978-3-319-11935-9",

volume = "8837",

series = "Lecture Notes in Computer Science",

publisher = "Springer Publishing Company",

pages = "129--145",

editor = "Franck Cassez and Jean-Fran{\c c}ois Raskin",

booktitle = "Automated Technology for Verification and Analysis",

address = "United States",

note = "Automated Technology for Verification and Analysis ; Conference date: 03-11-2012 Through 07-11-2014",

}

David, A, Jensen, PG , Larsen, KG, Legay, A, Lime, D, Sørensen, MG & Taankvist, JH 2014, On Time with Minimal Expected Cost! i F Cassez & J-F Raskin (red), Automated Technology for Verification and Analysis. bind 8837, Springer Publishing Company, Lecture Notes in Computer Science, s. 129-145, Automated Technology for Verification and Analysis, Sydney, NSW, Australien, 03/11/2012. https://doi.org/10.1007/978-3-319-11936-6_10

On Time with Minimal Expected Cost! / David, Alexandre; Jensen, Peter Gjøl ; Larsen, Kim Guldstrand et al.
Automated Technology for Verification and Analysis. red. / Franck Cassez; Jean-François Raskin. Bind 8837 Springer Publishing Company, 2014. s. 129-145 (Lecture Notes in Computer Science).

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - On Time with Minimal Expected Cost!

AU - David, Alexandre

AU - Jensen, Peter Gjøl

AU - Larsen, Kim Guldstrand

AU - Legay, Axel

AU - Lime, Didier

AU - Sørensen, Mathias Grund

AU - Taankvist, Jakob Haahr

PY - 2014

Y1 - 2014

N2 - (Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.

AB - (Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.

U2 - 10.1007/978-3-319-11936-6_10

DO - 10.1007/978-3-319-11936-6_10

M3 - Article in proceeding

SN - 978-3-319-11935-9

VL - 8837

T3 - Lecture Notes in Computer Science

SP - 129

EP - 145

BT - Automated Technology for Verification and Analysis

A2 - Cassez, Franck

A2 - Raskin, Jean-François

PB - Springer Publishing Company

T2 - Automated Technology for Verification and Analysis

Y2 - 3 November 2012 through 7 November 2014

ER -