TY - GEN
T1 - TD3-Based Trajectory Design for UAV-Assisted Communication in Unknown Environment
AU - Luo, Jihao
AU - Zhao, Le
AU - Yin, Chenhao
AU - Wang, Xinyi
AU - Fei, Zesong
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - With the widespread use of unmanned aerial vehicles (UAVs), leveraging UAVs to provide communication services in unknown environments when terrestrial cellular networks are unavailable has become an increasingly critical issue. In this paper, we investigate the trajectory design for a UAV offers communication services to ground users (GUs) in an unknown environment, while simultaneously using the echoes to characterize the environmental topology for obstacle avoidance. By modeling the trajectory optimization problem as a Markov Decision Process (MDP), we employ deep reinforcement learning (DRL) to simulate the UAV's flight in an unknown environment and identify the optimal strategy through repeated trials. In particular, we propose a twin-delayed deep deterministic policy gradient (TD3)-based trajectory design (TD3BTD) method to minimize the time required to provide communication services to all GUs. Simulation results demonstrate that the proposed TD3BTD significantly reduces the mission execution time.
AB - With the widespread use of unmanned aerial vehicles (UAVs), leveraging UAVs to provide communication services in unknown environments when terrestrial cellular networks are unavailable has become an increasingly critical issue. In this paper, we investigate the trajectory design for a UAV offers communication services to ground users (GUs) in an unknown environment, while simultaneously using the echoes to characterize the environmental topology for obstacle avoidance. By modeling the trajectory optimization problem as a Markov Decision Process (MDP), we employ deep reinforcement learning (DRL) to simulate the UAV's flight in an unknown environment and identify the optimal strategy through repeated trials. In particular, we propose a twin-delayed deep deterministic policy gradient (TD3)-based trajectory design (TD3BTD) method to minimize the time required to provide communication services to all GUs. Simulation results demonstrate that the proposed TD3BTD significantly reduces the mission execution time.
KW - deep reinforcement learning
KW - trajectory design
KW - unknown environment
KW - unmanned aerial vehicle
UR - https://www.scopus.com/pages/publications/105017719375
U2 - 10.1109/ICCC65529.2025.11149303
DO - 10.1109/ICCC65529.2025.11149303
M3 - Conference contribution
AN - SCOPUS:105017719375
T3 - 2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025
BT - 2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 IEEE/CIC International Conference on Communications in China, ICCC 2025
Y2 - 10 August 2025 through 13 August 2025
ER -