Abstract
We consider the situation where two agents try to solve each their own task in a common
environment. We present a general framework for representing that kind of scenario
based on Influence Diagrams (IDs). The framework is used to model the analysis depth
and time horizon of the opponent agent and to determine an optimal policy under various
assumptions on analysis depth of the opponent. Not surprisingly, the framework turns
out to have severe complexity problems even in simple scenarios due to the size of the
relevant past. We propose an algorithm based on Limited Memory Influence Diagrams
(LIMIDs) in which we convert the ID into a Bayesian network and perform single policy
update. Empirical results are presented using a simple board game.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 4th European Workshop on Probabilistic Graphical Models |
Redaktører | Manfred Jaeger, Thomas D. Nielsen |
Publikationsdato | 2008 |
Status | Udgivet - 2008 |
Begivenhed | European Workshop on Probabilistic Graphical Models (PGM) - Hirtshals, Danmark Varighed: 17 sep. 2008 → 19 sep. 2008 Konferencens nummer: 4 |
Konference
Konference | European Workshop on Probabilistic Graphical Models (PGM) |
---|---|
Nummer | 4 |
Land/Område | Danmark |
By | Hirtshals |
Periode | 17/09/2008 → 19/09/2008 |