Optimization of cooperative environmental data acquisition by UAV swarm based on reinforcement learning algorithm and biomechanical inspiration
Abstract
Inspired by biomechanics, to enhance the collaborative efficiency of UAV swarms in complex—environment data collection, an innovative optimization scheme is proposed. This scheme draws parallels from the principles of biomechanics, such as the coordinated movement of biological organisms and their ability to adapt to various environmental stresses. Just as living organisms adjust their postures and movements in response to external forces to maintain balance and perform tasks efficiently, the proposed UAV swarm system aims to achieve better adaptability and efficiency through the key techniques of path planning, task allocation, and load balancing, which are inspired by the biomechanical mechanisms of coordination and adaptation. Load balancing in the UAV swarm is inspired by the way biological systems distribute mechanical stress. In the human body, different muscles and bones work together to evenly distribute the load during movement. Similarly, UAVs in the swarm need to balance the data—collection load to prevent over - stressing any single UAV. The results show that in the experiment where the number of nodes is increased from 50 to 200, the data acquisition coverage is improved from 93.4% to 98.1%, the task completion time is shortened from 112 to 73 s, and the energy consumption is controlled within the range of 180 to 430 Joules. The reinforcement learning algorithm demonstrated advantages over traditional methods in several performance metrics, including reducing the average transmission delay to 18.6 ms and efficiently distributing the task load, reducing the percentage of highly loaded nodes to 5.6%. These results validate the important role of the reinforcement—learning algorithm, which is inspired by biomechanics, in UAV—swarm cooperative environmental data collection. By mimicking the efficient and adaptable mechanisms in biological systems, the proposed optimization scheme for UAV swarms can better meet the challenges of complex - environment data collection.
References
1. Iqbal U, Lee Y, Cho S, et al. Cooperative transmission of UAV swarm using orthogonal time–frequency space modulation. ICT Express. 2024; 10(6): 1240–1246. doi: 10.1016/j.icte.2024.07.002
2. Panowicz R, Stecz W. Robust Optimization Models for Planning Drone Swarm Missions. Drones. 2024; 8(10): 572. doi: 10.3390/drones8100572
3. Wang X, Zhang X, Lu Y, et al. Target Trajectory Prediction-Based UAV Swarm Cooperative for Bird-Driving Strategy at Airport. Electronics. 2024; 13(19): 3868. doi: 10.3390/electronics13193868
4. Zhao L, Chen B, Hu F. Research on Cooperative Obstacle Avoidance Decision Making of Unmanned Aerial Vehicle Swarms in Complex Environments under End-Edge-Cloud Collaboration Model. Drones. 2024; 8(9): 461. doi: 10.3390/drones8090461
5. Chen J, Chen Y, Nie R, et al. Application of improved grey wolf model in collaborative trajectory optimization of unmanned aerial vehicle swarm. Scientific Reports. 2024; 14(1). doi: 10.1038/s41598-024-65383-9
6. Yu H, Yang X, Zhang Y, et al. Cooperative Target Fencing Control for Unmanned Aerial Vehicle Swarm with Collision, Obstacle Avoidance, and Connectivity Maintenance. Drones. 2024; 8(7): 317. doi: 10.3390/drones8070317
7. Li J, Cheng H, Wang C, et al. Reinforced robotic bean optimization algorithm for cooperative target search of unmanned aerial vehicle swarm. Complex & Intelligent Systems. 2024; 10(5): 7109–7126. doi: 10.1007/s40747-024-01536-7
8. Wei D, Zhang L, Liu Q, et al. UAV Swarm Cooperative Dynamic Target Search: A MAPPO-Based Discrete Optimal Control Method. Drones. 2024; 8(6): 214. doi: 10.3390/drones8060214
9. Yin S, Wang X, Luo L, et al. Collaborative strategy research of target tracking based on natural intelligence by UAV swarm. Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering. 2024; 238(6): 549–564. doi: 10.1177/09544100241233313
10. Jing X. Research on key technologies of UAV cluster cooperative system for Internet of Things applications. Journal of Control and Decision. 2022; 11(1): 26–35. doi: 10.1080/23307706.2022.2089749
11. Wei Z, Wei R. UAV Swarm Rounding Strategy Based on Deep Reinforcement Learning Goal Consistency with Multi-Head Soft Attention Algorithm. Drones. 2024; 8(12): 731. doi: 10.3390/drones8120731
12. Shu X, Lin A, Wen X. Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem. Sensors. 2024; 24(20): 6698. doi: 10.3390/s24206698
13. Demir K, Tumen V, Kosunalp S, et al. A Deep Reinforcement Learning Algorithm for Trajectory Planning of Swarm UAV Fulfilling Wildfire Reconnaissance. Electronics. 2024; 13(13): 2568. doi: 10.3390/electronics13132568
14. Jiang Z, Song T, Yang B, et al. Fault-Tolerant Control for Multi-UAV Exploration System via Reinforcement Learning Algorithm. Aerospace. 2024; 11(5): 372. doi: 10.3390/aerospace11050372
15. Wang F, Zhu X, Zhou Z, et al. Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments. Chinese Journal of Aeronautics. 2024; 37(3): 237–257. doi: 10.1016/j.cja.2023.09.033
16. Ji H, Jin Y. Knowledge Acquisition of Self-Organizing Systems With Deep Multiagent Reinforcement Learning. Journal of Computing and Information Science in Engineering. 2021; 22(2). doi: 10.1115/1.4052800
17. Ghassemi P, Chowdhury S. An Extended Bayesian Optimization Approach to Decentralized Swarm Robotic Search. Journal of Computing and Information Science in Engineering. 2020; 20(5). doi: 10.1115/1.4046587
18. Thakur A, Sahoo S, Mukherjee A, et al. Making Robotic Swarms Trustful: A Blockchain-Based Perspective. Journal of Computing and Information Science in Engineering. 2023; 23(6). doi: 10.1115/1.4062326
19. Fang X, Wang C, Nguyen TM, et al. Graph Optimization Approach to Range-Based Localization. IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2021; 51(11): 6830–6841. doi: 10.1109/tsmc.2020.2964713
20. Ghobadi N, Sepehri N, Kinsner W, et al. Beyond Human Touch: Integrating Soft Robotics with Environmental Interaction for Advanced Applications. Actuators. 2024; 13(12): 507. doi: 10.3390/act13120507
Copyright (c) 2025 Author(s)

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright on all articles published in this journal is retained by the author(s), while the author(s) grant the publisher as the original publisher to publish the article.
Articles published in this journal are licensed under a Creative Commons Attribution 4.0 International, which means they can be shared, adapted and distributed provided that the original published version is cited.