A Decentralized Multi-Agent Reinforcement Learning Framework for Cooperative UAV Search: Navigating Non-Stationarity via Reward Shaping
DOI :
https://doi.org/10.63944/e1p7kr68Mots-clés :
Multi Agent Reinforcement Learning, Cooperative Search, Decentralized Partially Observable Markov Decision Process, Reward Shaping, Non-stationarityRésumé
Due to the complex coupling between high-dimensional state spaces and the stringent constraints of local perception, collaborative search by multi UAV swarms in unknown environments poses a formidable challenge within the field of autonomous systems. Although early algorithmic attempts often assumed stationary targets or perfect communication networks, the inherent non-stationarity of real-world dynamic environments renders traditional independent learning paradigms highly inefficient. Sometimes even leading to complete divergence during training. In an effort to overcome these analytical bottlenecks, this paper explores a Decentralized Partially Observable Markov Decision Process framework and introduces specific Multi Agent Reinforcement Learning methods under a Centralized Training with Decentralized Execution architecture. To fully validate and elucidate these emergent collaborative behaviors, extensive verification in real-world physical environments, as well as further research specifically addressing communication-constrained settings.
Téléchargements
Publiée
Numéro
Rubrique
Catégories
Licence

Ce travail est disponible sous la licence Creative Commons Attribution 4.0 International .
Authors retain copyright and grant the journal the right of first publication. This work is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Provided that appropriate credit is given and the original publication in this journal is properly cited, others may copy, distribute, transmit, and adapt the work, including for commercial purposes. Authors may also deposit the published version in institutional repositories or on personal websites with a full citation to the final published article and a link to the journal page.