UAV Path Planning Based On Deep Reinforcement Learning Aided By Radio Map

Posted on:2023-12-26

Degree:Master

Type:Thesis

Country:China

Candidate:L Yu

Full Text:PDF

GTID:2532306800952989

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Thanks to the low cost and flexible mobility of UAVs,they have been applied to various complex communication scenarios.The traditional UAV communication trajectory planning is performed by manually modeling the problem mathematically and then solving it optimally,often assuming that the channel model is a simple lineof-sight(Lo S)or probabilistic Lo S,obtaining a functional expression about the UAV position,and then using convex optimization techniques for path planning.Due to the time-varying nature of the channel and the mobility of the UAV,the traditional modeling and description are too simple and the obtained trajectories cannot guide the practical application of the UAV.Deep reinforcement learning is a method for optimal decision making in complex dynamic environments.Optimization of parameters such as UAV trajectories and resources in a dynamic low-altitude network is actually an optimal decision problem in a dynamic environment,which fits with the goal of deep reinforcement learning.Therefore,in this paper,we abandon the traditional modelingbased solution method and use DRL to construct the channel environment of the lowaltitude network,generate radio maps,and then use the radio maps to assist in UAV trajectory optimization.How to use the mobility of UAV to construct radio maps and accomplish communication tasks more efficiently is a problem worth studying.This paper is dedicated to investigate the deep reinforcement learning algorithm based UAV trajectory optimization strategy,and the specific research work is as follows:We consider a terrestrial cellular network-connected UAV communication scenario,where control of the UAV is achieved through a ground base station,and a trajectory is designed for the UAV to achieve minimal mission completion time.To ensure safety,the reliability of the UAV connection to the ground network needs to be ensured as much as possible.And considering the reality that there is a coverage black hole in the coverage of base stations in low altitude,we propose two different metrics for the connection constraint,based on which the UAV’s flight time minimization problem is formulated.First,the duration of a single disruption of the UAV during flight is less than a set threshold.Second,the cumulative time of multiple interruptions of the UAV does not exceed the given threshold.We propose a multistep learning algorithm based on Dueling DDQN,modeling the flight time minimization as a Markov decision problem,rationalizing the basic elements of reinforcement learning: state space,action space,and reward function,and simulation results demonstrate the effectiveness of the designed deep reinforcement learning algorithm,which can well solve the mobile-aware trajectory design problem.Aiming at the scenario of UAV wireless energy supply communication network,where the UAV supplies energy to ground devices while receiving data sent from the base station,we study the problems of maximizing the total energy transmission,maximizing the total data throughput and minimizing the energy consumption of the joint UAV,and propose the UAV flight trajectory design and flight strategy optimization.We use an improved DDPG algorithm to obtain the control strategy of the UAV,transforming the dynamic wireless functional network problem into a Markov sequence decision problem and setting a multidimensional reward function corresponding to the three optimization objectives mentioned above.Numerical results verify the effectiveness of the DDPG algorithm,which can optimize the path to supply more energy and receive more data simultaneously with less energy consumption.The DDPG algorithm is able to obtain the optimal strategy compared with the proposed two benchmark solutions of maximum speed and most energy efficient speed.

Keywords/Search Tags:

UAV-enabled communication, path design, connectivity constraints, deep reinforcement learning, radio map

PDF Full Text Request

Related items

1	Research On Optimizing Cellular-Connected UAV Communication System Based On Machine Learning
2	Research On UAV Communication Transmission Optimization Strategy Based On Deep Reinforcement Learning
3	Research On UAV Communication Traj Ectory Optimization Based On Deep Reinforcement Learning
4	Research On Cellular-connected Unmanned Aerial Vehicle Path Planning Based On Deep Reinforcement Learning
5	Path Planning Based On Reinforcement Learning In UAV-Assisted Communication
6	Design Of Formation Control Algorithm For Autonomous Underwater Vehicles Based On Deep Reinforcement Learning
7	Research On UAV Path Planning Based On Reinforcement Learning In Mobile Edge Computing Syste
8	Research On Reinforcement Learning Algorithm For Mobile Vehicle Path Planning In A Special Traffic Environment
9	Research On Multi-Path Following For Unmanned Surface Vehicle Based On Deep Reinforcement Learning
10	Reliable And Efficient Wireless Communication For Vehicular Network