A customizable environment based on Deep Reinforcement Learning (DRL) is here provided. The environment, called DAMIAN (Delay-Aware MultI Aerieal Navigation), allows to use different strategies for handling cooperative multi-UAV (Unmanned Aerial Vehicles) scenarios when delays are involved in the decision-making process for tasks such as spotting, tracking, coverage and many others. DAMIAN implements a realistic benchmark for delay-aware DRL solutions of cooperative multi-UAV tasks and is useful to develop and compare new solutions for this problem. Users can choose among various combinations of tasks and parameters and customize the scenarios by im- plementing new desired functionalities. Different approaches can be compared, taking into account either implicitly or explicitly the delays applied to actions and observations for both single-agent and multi-agent scenarios. The awareness of the delay, along with the possible usage of real-world-based external files, definitely increases the reality level of the environment by possibly easing the knowl- edge transferability process of the learned policy from the simulated environment to the real one. The extensible and modifiable environment developed along with the baselines use cases provided can generate new benchmarking tools for use cases where delays need to be considered.

DAMIAN: a delay-aware multI-aerial navigation environment for cooperative DRL-based UAV systems / Brunori, Damiano. - (2024 May 06).

DAMIAN: a delay-aware multI-aerial navigation environment for cooperative DRL-based UAV systems

BRUNORI, DAMIANO
06/05/2024

Abstract

A customizable environment based on Deep Reinforcement Learning (DRL) is here provided. The environment, called DAMIAN (Delay-Aware MultI Aerieal Navigation), allows to use different strategies for handling cooperative multi-UAV (Unmanned Aerial Vehicles) scenarios when delays are involved in the decision-making process for tasks such as spotting, tracking, coverage and many others. DAMIAN implements a realistic benchmark for delay-aware DRL solutions of cooperative multi-UAV tasks and is useful to develop and compare new solutions for this problem. Users can choose among various combinations of tasks and parameters and customize the scenarios by im- plementing new desired functionalities. Different approaches can be compared, taking into account either implicitly or explicitly the delays applied to actions and observations for both single-agent and multi-agent scenarios. The awareness of the delay, along with the possible usage of real-world-based external files, definitely increases the reality level of the environment by possibly easing the knowl- edge transferability process of the learned policy from the simulated environment to the real one. The extensible and modifiable environment developed along with the baselines use cases provided can generate new benchmarking tools for use cases where delays need to be considered.
6-mag-2024
File allegati a questo prodotto
File Dimensione Formato  
Tesi_dottorato_Brunori.pdf

accesso aperto

Note: A delay-aware DRL-based framework for cooperative multi-UAV systems is here proposed.
Tipologia: Tesi di dottorato
Licenza: Creative commons
Dimensione 8.78 MB
Formato Adobe PDF
8.78 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1709432
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact