Improved use of partial policies for identifying behavioral equivalence

Yifeng Zeng, Hua Mao, Yinghui Pan, Jian Luo

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

14 Citationer (Scopus)

Abstract

Interactive multiagent decision making often requires to predict actions of other agents by solving their behavioral models from the perspective of the modeling agent. Unfortunately, the general space of models in the absence of constraining assumptions tends to be very large thereby making multiagent decision making intractable. One approach that can reduce the model space is to cluster behaviorally equivalent models that exhibit identical policies over the whole planning horizon. Currently, the state of the art on identifying equivalence of behavioral models compares partial policy trees instead of entire trees. In this paper, we further improve the use of partial trees for the identification purpose and develop an incremental comparison strategy in order to efficiently ascertain the model equivalence. We investigate the improved approach in a well-defined probabilistic graphical model for sequential multiagent decision making - interactive dynamic influence diagrams, and evaluate its performance over multiple problem domains.

OriginalsprogEngelsk
Titel11th International Conference on Autonomous Agents and Multiagent Systems 2012, AAMAS 2012: Innovative Applications Track
Antal sider8
Vol/bind1
ForlagInternational Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Publikationsdato2012
Sider1015-1022
ISBN (Trykt)0-9817381-2-5
ISBN (Elektronisk)978-0-9817381-2-3
StatusUdgivet - 2012
Begivenhed11th International Conference on Autonomous Agents and Multiagent Systems 2012: Innovative Applications Track, AAMAS 2012 - Valencia, Spanien
Varighed: 4 jun. 20128 jun. 2012

Konference

Konference11th International Conference on Autonomous Agents and Multiagent Systems 2012: Innovative Applications Track, AAMAS 2012
Land/OmrådeSpanien
ByValencia
Periode04/06/201208/06/2012

Fingeraftryk

Dyk ned i forskningsemnerne om 'Improved use of partial policies for identifying behavioral equivalence'. Sammen danner de et unikt fingeraftryk.

Citationsformater