Distributed reinforcement learning for flexible UAV swarm control with transfer learning capabilities

Federico Venturini*, Federico Mason, Francesco Pase, Federico Chiariotti, Andrea Zanella, Michele Zorzi, Alberto Testolin

*Kontaktforfatter

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

10 Citationer (Scopus)

Abstract

Over the past few years, the use of swarms of Unmanned Aerial Vehicles (UAVs) in monitoring and remote area surveillance applications has become economically efficient thanks to the price reduction and the increased capabilities of drones. The drones in the swarm need to cooperatively explore an unknown area, in order to identify and monitor interesting targets, while minimizing their movements. In this work, we propose a distributed Reinforcement Learning (RL) approach that scales to larger swarms without modifications. The proposed framework can easily deal with non-uniform distributions of targets, drawing from past experience to improve its performance. In particular, our experiments show that when agents are trained for a specific scenario, they can adapt to a new one with a minimal amount of additional training. We show that our RL approach achieves favorable performance compared to a computationally intensive look-ahead heuristic.
OriginalsprogEngelsk
TitelProceedings of the 6th ACM Workshop on Micro Aerial Vehicle Networks, Systems, and Applications
Antal sider6
ForlagAssociation for Computing Machinery
Publikationsdatojul. 2020
Artikelnummer10
ISBN (Elektronisk)9781450380102
DOI
StatusUdgivet - jul. 2020
Udgivet eksterntJa
Begivenhed6th ACM Workshop on Micro Aerial Vehicle Networks, Systems, and Applications, co-located with MobiSys 2020 -
Varighed: 15 jun. 2020 → …

Konference

Konference6th ACM Workshop on Micro Aerial Vehicle Networks, Systems, and Applications, co-located with MobiSys 2020
Periode15/06/2020 → …

Fingeraftryk

Dyk ned i forskningsemnerne om 'Distributed reinforcement learning for flexible UAV swarm control with transfer learning capabilities'. Sammen danner de et unikt fingeraftryk.

Citationsformater