Jointly Resource Allocation And Power Control Algorithm Based On Collaborative Madrl

Posted on:2024-02-29

Degree:Master

Type:Thesis

Country:China

Candidate:Y H Yang

Full Text:PDF

GTID:2568306944961829

Subject:Communication Engineering (including broadband network, mobile communication, etc.) (Professional Degree)

Abstract/Summary:

PDF Full Text Request

With the increasing demand for data services in the post-5G era,the problem of high operating costs of 5G networks has become increasingly prominent.At the same time of high-speed construction,its operating efficiency needs to be improved.The number of 5G connections is constantly creating new highs.However,spectrum resources are always limited.Therefore,an effective resource allocation scheme is needed to improve the utilization of limited bandwidth resources,thus reducing the operating costs and ensuring the quality of communication services for users.Intelligent resource allocation and power control schemes are considered to be important methods to alleviate the problems caused by the sharp increase in the number of users and operating costs.Therefore,based on Multi-Agent Deep Reinforcement Learning(MADRL),this paper studies and explores intelligent schemes for resource allocation and power control in frequency domain.The main work and innovation of this paper are as follows:First,this paper proposes a novel algorithm based on multi-agent deep reinforcement learning to jointly optimize resource block(RB)allocation and power control.Its purpose is to maximize the average spectral efficiency(SE)of the system under the premise of meeting the quality of service constraints.In view of the advantages of centralized training while reducing the amount of computation and signaling overhead,the distributed execution of centralized training can adopt MADRL technology.Considering that centralized training and distributed execution retains the advantages of centralized training while reducing the computation and signaling overhead,MADRL technology can be used.In the proposed MADRL model,the action value function of each agent is aggregated through the value decomposition network,which strengthens the cooperation between agents and improves the convergence of the algorithm.Secondly,this paper innovatively adds a reward discount network to the original MADRL framework to further improve the average spectral efficiency of the proposed algorithm in the multi-cell multi-user communication environment.The reward discount network adjusts the degree of attention to future rewards in real time and adaptively according to the performance of the agent in the training process.In this way,the value of the reward discount factor can be dynamically adjusted,which is most suitable for the convergence of the neural network.In order to avoid the laziness of the agent,this paper adds a correction term to the loss function used to train the reward discount network,so as to maximize the value of the reward discount factor and extend the planning scope of the agent for the future.Simulation results show that the algorithm has better performance and stability than the existing alternatives.

Keywords/Search Tags:

Resource Block Allocation, Power Control, MADRL, Reward Discount Network

PDF Full Text Request

Related items

1	Research On Multi-agent Deep Reinforcement Learning Based Resource Allocation For NOMA Systems
2	.3 The Gpp Lte Downlink Ofdma System Resource Allocation Study
3	Research On Wireless Technology LTE Network Resource Allocation
4	Research On Resource Allocation For Low Power Internet Of Things
5	Research On Resource Allocation And Power Control Strategies In Heterogeneous Networks
6	Research On Wireless Resource Allocation Algorithm For D2D Communication In Heterogenous Networks
7	Resource Allocation For Spectrum Sharing In D2D Mode Based On 4G Network
8	Research On D2D Communication Resource Allocation Mechanism Based On Cellular Network
9	A Research On Resource Allocation And Power Control For D2D Communication In Social Networks
10	Research On M2M Channel And Power Resource Allocation Algorithms In 5G System