Abstract
Interactive dynamic influence diagrams (I-DIDs) are graphical models for sequential decision making in partially observable settings shared by other agents. Algorithms for solving I-DIDs face the challenge of an exponentially growing space of candidate models ascribed to other agents, over time. Previous approach for exactly solving I-DIDs groups together models having similar solutions into behaviorally equivalent classes and updates these classes. We present a new method that, in addition to aggregating behaviorally equivalent models, further groups models that prescribe identical actions at a single time step. We show how to update these augmented classes and prove that our method is exact. The new approach enables us to bound the aggregated model space by the cardinality of other agents' actions. We evaluate its performance and provide empirical results in support.
Originalsprog | Engelsk |
---|---|
Tidsskrift | IJCAI Proceedings - International Joint Conference on Artificial Intelligence |
Udgave nummer | 21 |
Sider (fra-til) | 1996-2001 |
ISSN | 1045-0823 |
Status | Udgivet - 2009 |
Begivenhed | Proceedings of the 21st international jont conference on Artifical intelligence - Pasadena, USA Varighed: 11 jul. 2009 → 17 jul. 2009 Konferencens nummer: 21 |
Konference
Konference | Proceedings of the 21st international jont conference on Artifical intelligence |
---|---|
Nummer | 21 |
Land/Område | USA |
By | Pasadena |
Periode | 11/07/2009 → 17/07/2009 |