On Time with Minimal Expected Cost!

Alexandre David, Peter Gjøl Jensen, Kim Guldstrand Larsen, Axel Legay, Didier Lime, Mathias Grund Sørensen, Jakob Haahr Taankvist

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

45 Citations (Scopus)

Abstract

(Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.
Original languageEnglish
Title of host publicationAutomated Technology for Verification and Analysis
EditorsFranck Cassez, Jean-François Raskin
Number of pages16
Volume8837
PublisherSpringer Publishing Company
Publication date2014
Pages129-145
ISBN (Print)978-3-319-11935-9
ISBN (Electronic)978-3-319-11936-6
DOIs
Publication statusPublished - 2014
EventAutomated Technology for Verification and Analysis - Sydney, NSW, Australia
Duration: 3 Nov 20127 Nov 2014

Conference

ConferenceAutomated Technology for Verification and Analysis
Country/TerritoryAustralia
CitySydney, NSW
Period03/11/201207/11/2014
SeriesLecture Notes in Computer Science
ISSN0302-9743

Cite this