Academic Journal

UAV Swarm Cooperative Dynamic Target Search: A MAPPO-Based Discrete Optimal Control Method

التفاصيل البيبلوغرافية
العنوان: UAV Swarm Cooperative Dynamic Target Search: A MAPPO-Based Discrete Optimal Control Method
المؤلفون: Dexing Wei, Lun Zhang, Quan Liu, Hao Chen, Jian Huang
المصدر: Drones, Vol 8, Iss 6, p 214 (2024)
بيانات النشر: MDPI AG
سنة النشر: 2024
المجموعة: Directory of Open Access Journals: DOAJ Articles
مصطلحات موضوعية: UAVs, optimal control, dynamic target search, multi-agents, MAPPO, Motor vehicles. Aeronautics. Astronautics, TL1-4050
الوصف: Unmanned aerial vehicles (UAVs) are commonly employed in pursuit and rescue missions, where the target’s trajectory is unknown. Traditional methods, such as evolutionary algorithms and ant colony optimization, can generate a search route in a given scenario. However, when the scene changes, the solution needs to be recalculated. In contrast, more advanced deep reinforcement learning methods can train an agent that can be directly applied to a similar task without recalculation. Nevertheless, there are several challenges when the agent learns how to search for unknown dynamic targets. In this search task, the rewards are random and sparse, which makes learning difficult. In addition, because of the need for the agent to adapt to various scenario settings, interactions required between the agent and the environment are more comparable to typical reinforcement learning tasks. These challenges increase the difficulty of training agents. To address these issues, we propose the OC-MAPPO method, which combines optimal control (OC) and Multi-Agent Proximal Policy Optimization (MAPPO) with GPU parallelization. The optimal control model provides the agent with continuous and stable rewards. Through parallelized models, the agent can interact with the environment and collect data more rapidly. Experimental results demonstrate that the proposed method can help the agent learn faster, and the algorithm demonstrated a 26.97% increase in the success rate compared to genetic algorithms.
نوع الوثيقة: article in journal/newspaper
اللغة: English
تدمد: 2504-446X
Relation: https://www.mdpi.com/2504-446X/8/6/214; https://doaj.org/toc/2504-446X; https://doaj.org/article/1b8e3c7ba55f47e8a527b67e413ffeb2
DOI: 10.3390/drones8060214
الاتاحة: https://doi.org/10.3390/drones8060214
https://doaj.org/article/1b8e3c7ba55f47e8a527b67e413ffeb2
رقم الانضمام: edsbas.3812F165
قاعدة البيانات: BASE
الوصف
تدمد:2504446X
DOI:10.3390/drones8060214