Optimization of cooperative environmental data acquisition by UAV swarm based on reinforcement learning algorithm and biomechanical inspiration

Shouting  Xin; Jun  Yang

doi:10.62617/mcb1356

Shouting Xin Hebi Institute of Engineering and Technology, Henan Polytechnic University, Hebi 458030, China
Jun Yang Anyang Advanced Technical School, Anyang Vocational and Technical College, Anyang 455000, China

DOI: https://doi.org/10.62617/mcb1356

Keywords: reinforcement learning algorithm; biomechanical inspiration; UAV swarm; cooperative optimization

Article ID: 1356

Abstract

Inspired by biomechanics, to enhance the collaborative efficiency of UAV swarms in complex—environment data collection, an innovative optimization scheme is proposed. This scheme draws parallels from the principles of biomechanics, such as the coordinated movement of biological organisms and their ability to adapt to various environmental stresses. Just as living organisms adjust their postures and movements in response to external forces to maintain balance and perform tasks efficiently, the proposed UAV swarm system aims to achieve better adaptability and efficiency through the key techniques of path planning, task allocation, and load balancing, which are inspired by the biomechanical mechanisms of coordination and adaptation. Load balancing in the UAV swarm is inspired by the way biological systems distribute mechanical stress. In the human body, different muscles and bones work together to evenly distribute the load during movement. Similarly, UAVs in the swarm need to balance the data—collection load to prevent over - stressing any single UAV. The results show that in the experiment where the number of nodes is increased from 50 to 200, the data acquisition coverage is improved from 93.4% to 98.1%, the task completion time is shortened from 112 to 73 s, and the energy consumption is controlled within the range of 180 to 430 Joules. The reinforcement learning algorithm demonstrated advantages over traditional methods in several performance metrics, including reducing the average transmission delay to 18.6 ms and efficiently distributing the task load, reducing the percentage of highly loaded nodes to 5.6%. These results validate the important role of the reinforcement—learning algorithm, which is inspired by biomechanics, in UAV—swarm cooperative environmental data collection. By mimicking the efficient and adaptable mechanisms in biological systems, the proposed optimization scheme for UAV swarms can better meet the challenges of complex - environment data collection.

References

1. Iqbal U, Lee Y, Cho S, et al. Cooperative transmission of UAV swarm using orthogonal time–frequency space modulation. ICT Express. 2024; 10(6): 1240–1246. doi: 10.1016/j.icte.2024.07.002

2. Panowicz R, Stecz W. Robust Optimization Models for Planning Drone Swarm Missions. Drones. 2024; 8(10): 572. doi: 10.3390/drones8100572

3. Wang X, Zhang X, Lu Y, et al. Target Trajectory Prediction-Based UAV Swarm Cooperative for Bird-Driving Strategy at Airport. Electronics. 2024; 13(19): 3868. doi: 10.3390/electronics13193868

4. Zhao L, Chen B, Hu F. Research on Cooperative Obstacle Avoidance Decision Making of Unmanned Aerial Vehicle Swarms in Complex Environments under End-Edge-Cloud Collaboration Model. Drones. 2024; 8(9): 461. doi: 10.3390/drones8090461

5. Chen J, Chen Y, Nie R, et al. Application of improved grey wolf model in collaborative trajectory optimization of unmanned aerial vehicle swarm. Scientific Reports. 2024; 14(1). doi: 10.1038/s41598-024-65383-9

6. Yu H, Yang X, Zhang Y, et al. Cooperative Target Fencing Control for Unmanned Aerial Vehicle Swarm with Collision, Obstacle Avoidance, and Connectivity Maintenance. Drones. 2024; 8(7): 317. doi: 10.3390/drones8070317

7. Li J, Cheng H, Wang C, et al. Reinforced robotic bean optimization algorithm for cooperative target search of unmanned aerial vehicle swarm. Complex & Intelligent Systems. 2024; 10(5): 7109–7126. doi: 10.1007/s40747-024-01536-7

8. Wei D, Zhang L, Liu Q, et al. UAV Swarm Cooperative Dynamic Target Search: A MAPPO-Based Discrete Optimal Control Method. Drones. 2024; 8(6): 214. doi: 10.3390/drones8060214

9. Yin S, Wang X, Luo L, et al. Collaborative strategy research of target tracking based on natural intelligence by UAV swarm. Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering. 2024; 238(6): 549–564. doi: 10.1177/09544100241233313

10. Jing X. Research on key technologies of UAV cluster cooperative system for Internet of Things applications. Journal of Control and Decision. 2022; 11(1): 26–35. doi: 10.1080/23307706.2022.2089749

11. Wei Z, Wei R. UAV Swarm Rounding Strategy Based on Deep Reinforcement Learning Goal Consistency with Multi-Head Soft Attention Algorithm. Drones. 2024; 8(12): 731. doi: 10.3390/drones8120731

12. Shu X, Lin A, Wen X. Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem. Sensors. 2024; 24(20): 6698. doi: 10.3390/s24206698

13. Demir K, Tumen V, Kosunalp S, et al. A Deep Reinforcement Learning Algorithm for Trajectory Planning of Swarm UAV Fulfilling Wildfire Reconnaissance. Electronics. 2024; 13(13): 2568. doi: 10.3390/electronics13132568

14. Jiang Z, Song T, Yang B, et al. Fault-Tolerant Control for Multi-UAV Exploration System via Reinforcement Learning Algorithm. Aerospace. 2024; 11(5): 372. doi: 10.3390/aerospace11050372

15. Wang F, Zhu X, Zhou Z, et al. Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments. Chinese Journal of Aeronautics. 2024; 37(3): 237–257. doi: 10.1016/j.cja.2023.09.033

16. Ji H, Jin Y. Knowledge Acquisition of Self-Organizing Systems With Deep Multiagent Reinforcement Learning. Journal of Computing and Information Science in Engineering. 2021; 22(2). doi: 10.1115/1.4052800

17. Ghassemi P, Chowdhury S. An Extended Bayesian Optimization Approach to Decentralized Swarm Robotic Search. Journal of Computing and Information Science in Engineering. 2020; 20(5). doi: 10.1115/1.4046587

18. Thakur A, Sahoo S, Mukherjee A, et al. Making Robotic Swarms Trustful: A Blockchain-Based Perspective. Journal of Computing and Information Science in Engineering. 2023; 23(6). doi: 10.1115/1.4062326

19. Fang X, Wang C, Nguyen TM, et al. Graph Optimization Approach to Range-Based Localization. IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2021; 51(11): 6830–6841. doi: 10.1109/tsmc.2020.2964713

20. Ghobadi N, Sepehri N, Kinsner W, et al. Beyond Human Touch: Integrating Soft Robotics with Environmental Interaction for Advanced Applications. Actuators. 2024; 13(12): 507. doi: 10.3390/act13120507

Optimization of cooperative environmental data acquisition by UAV swarm based on reinforcement learning algorithm and biomechanical inspiration

Abstract

References

Further Information

Guidelines

Contact

WhatsApp: