Deep Reinforcement Learning for Autonomous Drone Navigation
Keywords:
Deep Reinforcement Learning; Autonomous Drone Navigation; UAV Path Planning; Simulation-to-Real Transfer; Sensor Fusion; Proximal Policy Optimization (PPO).Abstract
The rapid advancement of artificial intelligence, sensor technology, and robotics has significantly enhanced the autonomous capabilities of unmanned aerial vehicles (UAVs), yet achieving fully autonomous navigation in complex, dynamic, and unstructured environments remains a persistent challenge. Traditional rule-based and classical control algorithms often fail to adapt to unpredictable conditions, motivating the integration of Deep Reinforcement Learning (DRL) for intelligent, experience-driven navigation. This study proposes a DRL-based autonomous drone navigation framework that combines deep neural perception with reinforcement-driven decision-making, enabling UAVs to learn optimal flight policies through interaction with both simulated and real environments. Multiple DRL algorithms, including DQN, PPO, DDPG, SAC, and A3C, were implemented and evaluated using high-fidelity simulators such as AirSim and Gazebo, supported by sensor fusion from LiDAR, RGB-D cameras, IMU, and GPS. A structured methodology was adopted involving environment modelling, state–action space design, reward engineering, actor–critic network optimisation, and Sim2Real transfer techniques. Experimental results demonstrate that DRL-based models significantly outperform traditional navigation approaches in obstacle avoidance, trajectory optimisation, and generalisation to unseen scenarios. PPO achieved a 92% collision-free success rate, while SAC excelled in continuous control and environmental uncertainty. Further analysis confirms the importance of reward shaping, hybrid sensor inputs, and curriculum learning for robust convergence. Although training time and sim-to-real discrepancies pose challenges, the findings establish DRL as a powerful paradigm for next-generation UAV autonomy. The proposed system offers substantial potential for applications in disaster response, surveillance, environmental monitoring, and multi-drone coordination, contributing to more adaptive, intelligent, and safe aerial navigation systems.
References
Pan, Y., Yang, Z., Zhang, Z., & Chen, W. (2022). Deep reinforcement learning–based autonomous navigation for UAVs in complex environments. IEEE Transactions on Vehicular Technology, 71(4), 3891–3904.
Khan, A., Wang, F., & Ahmad, I. (2021). A DRL-enabled collision-free path planning framework for unmanned aerial vehicles. Aerospace Science and Technology, 118, 107058.
Li, X., Xu, F., & Zhao, Y. (2023). Hierarchical deep reinforcement learning for UAV autonomous decision-making in dynamic environments. Robotics and Autonomous Systems, 161, 104356.
Zhang, T., Kahn, G., Levine, S., & Abbeel, P. (2016). Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search. IEEE International Conference on Robotics and Automation (ICRA), 528–535.
Dulac-Arnold, G., Mankowitz, D., & Hester, T. (2021). Challenges of real-world reinforcement learning. Machine Learning, 110(9), 2419–2468.
Sadeghi, F., & Levine, S. (2017). CAD2RL: Real single-image flight without a single real image. Robotics: Science and Systems (RSS), 1–10.
Gama, A., Ribeiro, M., & Oliveira, E. (2020). UAV control and obstacle avoidance using deep reinforcement learning: A review. Journal of Intelligent & Robotic Systems, 100, 1045–1062.
Nguyen, H., La, H., & Le, T. (2019). Deep reinforcement learning for autonomous UAV navigation using raw sensor inputs. IEEE International Symposium on Safety, Security, and Rescue Robotics, 356–362.
Chen, B., Xu, Z., & Lin, Q. (2020). UAV path planning with deep Q-learning in unknown urban environments. Sensors, 20(22), 6482.
Luo, F., Wang, Z., & Chen, M. (2021). Multi-agent deep reinforcement learning for collaborative UAV navigation. IEEE Access, 9, 112335–112346.
Hwangbo, J., Sa, I., Siegwart, R., & Hutter, M. (2017). Control of a quadrotor with reinforcement learning. IEEE Robotics and Automation Letters, 2(4), 2096–2103.
Koch, W., Mancuso, R., West, R., & Bestavros, A. (2019). Reinforcement learning for UAV attitude control. ACM Transactions on Cyber-Physical Systems, 3(2), 1–21.
Hu, J., Zhao, W., & Zhang, L. (2023). Autonomous drone navigation via proximal policy optimization in dynamic obstacle environments. Expert Systems with Applications, 219, 119634.
Li, Y., Feng, T., & Wang, X. (2022). Safe reinforcement learning for UAV navigation under uncertain obstacles. Aerospace Science and Technology, 126, 107004.
Mantegazza, D., Nicoli, M., & Rinaldi, F. (2020). Deep RL-based autonomous navigation of UAVs with partial observability. IFAC-PapersOnLine, 53(2), 6259–6264.
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal of Engineering, Science and Humanities

This work is licensed under a Creative Commons Attribution 4.0 International License.


