Compositional Shielding and Reinforcement Learning for Multi-Agent Systems

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Abstract

Deep reinforcement learning has emerged as a powerful tool for obtaining high-performance policies. However, the safety of these policies has been a long-standing issue. One promising paradigm to guarantee safety is a shield, which ''shields'' a policy from making unsafe actions. However, computing a shield scales exponentially in the number of state variables. This is a particular concern in multi-agent systems with many agents. In this work, we propose a novel approach for multi-agent shielding. We address scalability by computing individual shields for each agent. The challenge is that typical safety specifications are global properties, but the shields of individual agents only ensure local properties. Our key to overcome this challenge is to apply assume-guarantee reasoning. Specifically, we present a sound proof rule that decomposes a (global, complex) safety specification into (local, simple) obligations for the shields of the individual agents. Moreover, we show that applying the shields during reinforcement learning significantly improves the quality of the policies obtained for a given training budget. We demonstrate the effectiveness and scalability of our multi-agent shielding framework in two case studies, reducing the computation time from hours to seconds and achieving fast learning convergence.
OriginalsprogEngelsk
TitelProceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems
RedaktørerYevgeniy Vorobeychik
Antal sider9
UdgivelsesstedRichland, SC, USA
ForlagAssociation for Computing Machinery (ACM)
Publikationsdato5 jun. 2025
Udgave24
Sider399-407
KapitelResearch Paper Track
ISBN (Elektronisk)979-8-4007-1426-9
StatusUdgivet - 5 jun. 2025
Begivenhed24th International Conference on Autonomous Agents and Multiagent Systems - Renaissance Center, Detroit, USA
Varighed: 19 maj 202523 maj 2025
Konferencens nummer: 24
https://aamas2025.org/

Konference

Konference24th International Conference on Autonomous Agents and Multiagent Systems
Nummer24
LokationRenaissance Center
Land/OmrådeUSA
ByDetroit
Periode19/05/202523/05/2025
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'Compositional Shielding and Reinforcement Learning for Multi-Agent Systems'. Sammen danner de et unikt fingeraftryk.

Citationsformater