Research On Model And Algorithm Of Spectrum Resource Sharing Based On Deep Reinforcement Learning In Cognitive Wireless Network

Posted on:2022-01-16

Degree:Master

Type:Thesis

Country:China

Candidate:Z S Fan

Full Text:PDF

GTID:2518306575967139

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of wireless communication technology,radio spectrum resources become increasingly scarce,and the fixed allocation of spectrum resources can no longer meet the actual needs of today.The spectrum sharing technology in cognitive wireless network provides a solution model for this problem.However,the traditional cognitive spectrum sharing technology needs to obtain the priori information of the environment,which is mainly applicable to the relatively simple and predictable radio environment,but it is not satisfactory in the actual complex wireless environment.Deep reinforcement learning can process complex environmental information and autonomously learn the optimal actions,which provides a great possibility for performance improvement and practical application of cognitive spectrum sharing technology.Therefore,this thesis discusses a dynamic spectrum sharing solution based on deep reinforcement learning in cognitive wireless network.Firstly,in view of the problem of spectrum resource shortage,this thesis considers a spectrum sharing scenario in a cognitive wireless network without cooperation between primary user and secondary user,the primary users and the secondary user update their power according to their own power policies,the goal of the secondary user is to learn the optimal power control policy according to the obtained information,to successfully share the spectrum resources of the primary users for communication,and to ensure the normal communication of the primary users.Then,this thesis carries on the mathematical modeling to the scene,quantifies the model and analyzes the variable and formulas in the model in detail.Aiming at this model,a spectrum sharing scheme based on Proximal Policy Optimization algorithm is proposed.Different from the deep reinforcement learning algorithm based on value function,this deep reinforcement learning algorithm based on policy can not only process the complex and continuous environmental information,but also effectively process the continuous action space.Experimental results show that the proposed algorithm has good performance under different environmental parameters,can help secondary user learn effective continuous power control policy,and shows speed advantage compared with Deep Q Network algorithm.Finally,in order to further improve the training speed of PPO algorithm,a Distributed PPO algorithm based on multi-threading is proposed.The experimental results show that the secondary user trained by this method can perform well in different parameter settings,and it is found in the comparative experiment with the PPO algorithm that the distributed PPO algorithm can train faster under the condition of achieving the same effect.

Keywords/Search Tags:

cognitive wireless network, spectrum sharing, power control, deep reinforcement learning, proximal policy optimization

PDF Full Text Request

Related items

1	The Application Of Deep Learning In Wireless Communication
2	Research On Agent Decision-making And Control Based On Deep Reinforcement Learning
3	Robust Policy Gadient Algorithm Based On Actor-Critic In Deep Reinforcement Learning
4	Self Learning Control Of Mechanical Arm Based On Reinforcement Learning
5	Research On Collaborative Spectrum Sharing Based On Reinforcement Learning For Distributed Cognitive Networks
6	Research On Robotic Arm Grabbing Method Based On Deep Reinforcement Learning
7	Research On Automatic Driving Control Decision Based On Deep Reinforcement Learning
8	Research On Dynamic Spectrum Allocation Methods Based On Deep Reinforcement Learning
9	Research On Resource Optimization Problems In Cognitive Wireless Networks
10	Resource Allocation Of Heterogeneous Wireless Networks Based On Deep Reinforcement Learning