Research On Dynamic Resources Management Algorithm Based On Reinforcement Learning In Network Slice

Posted on:2020-07-05

Degree:Master

Type:Thesis

Country:China

Candidate:Y C Yang

Full Text:PDF

GTID:2428330590471561

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

In the future,communication services will be more diverse,and different services have different communication requirements.However,traditional networks could only satisfy a single service demend,so the network slice technology emerges to adapt to diverse services.Network slice technology allows the establishment of multiple logical networks in a physical network to respectively meet the demends of various services.In order to ensure the excellent communication quality of network slices,efficient resources management mechanism is extremely important.Therefore,the target of this thesis is to optimize resource management in access network slice scenario,and the research work of this thesis is summarized as follows:Aiming at the problem that topology perception is not practical in many literatures,a multiresource joint management scheme based on Partial Observation Markov Decision Process(POMDP)is proposed.For the physical network topology,a heartbeat packet detection mechanism is designed to partially observe the current topology condition.Based on complete queue state and partial topology state,POMDP is used to determine the orchestration and mapping of computing and link resources for service function chains in slices.The mingle heuristic value iteration algorithm is used to solve the proposed POMDP model because of its low complexity,and the approximate optimal resource management strategy could be obtained.The simulation results show that the proposed scheme could optimize the slices' resource orchestration and mapping on dynamic topology,and reduce delay and minimize repair cost.As a result,the proposed scheme could realize low-cost and low-delay communication and improve resource utilization.In order to solve the high latency caused by the terminals' moving in the access network slice,a mobility-aware resource management scheme based on reinforcement learning is proposed.Based on Markov Decision Process(MDP),a MDP model is proposed.Especially,it could observe the terminals' mobility and environments' changes,and it could make decisions about the allocation of computing,link and radio resources.The Q-learning method is applied to the MDP model to solve the problem that the transition probability of mobile state and environment state can not be obtained.In order to reduce the complexity of Q-learning method,a Deep Q Network(DQN)is designed and trained to obtain the approximate optimal resource management strategy.The simulation results indicate that the proposed scheme can dynamically optimize the delay of access network slice with mobile terminals,meanwhile it reduce the data loss in the slice.So the proposed scheme could achieve low data loss and low delay,and improve the throughput of slice.

Keywords/Search Tags:

network slice, resources management, partial observation Markov decision process, Q network

PDF Full Text Request

Related items

1	Research On Slice Resources Management And Orchestration Algorithms Based On Reinforcement Learning
2	Research On Intelligent Resource Management Algorithm For Delay Optimization In Network Slicing
3	Multi-domain Resource Management In Network Slicing Based On Markov Decision Process
4	Research On Path Planning Based On Markov Decision Process For AUV
5	Research On Agent Decision Problem Based On Markov Decision Process Theory
6	Research On Path Planning Based On Markov Decision Processes For Palletizing Robot
7	Markov Theory Based Planning And Sensing Under Uncertainty
8	Source Identification And Control For Propagation Of New-type Network Risks Based On Partial Observation Information
9	Research On Spectrum Sensing Scheme And Spectrum Resources Allocation In Cognitive Radio Network
10	Design And Implementation Of Identifier Network Domain Management System