Research On Multi-agent Cooperative Modeling Method Based On Reinforcement Learning

Posted on:2022-05-21

Degree:Master

Type:Thesis

Country:China

Candidate:D P Hu

Full Text:PDF

GTID:2518306323460384

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

In the background of "Made in China 2025",this paper tends to realize intelligent manufacturing,and improves the level of manufacturing in the uncertain situation of globalization through integrating industrial scenarios and Internet technology.At present,industrial intelligent manufacturing faces common problems such as difficulty in resource sharing,difficulty in process coordination,and dependence on domain experts.Based on agent,multi-agent system and reinforcement learning and other related technologies,a multi-agent model of industrial hybrid intelligent control is constructed,and the agent cooperation ability,flow shop scheduling policy and other key issues in the multi-agent system are studied,in order to realize the industrial production process intelligent control.Based on the multi-agent technology,this paper abstracts the main key points and key entities of the industrial process into individual agents,and establishes a three-tier hybrid multi-agent intelligent model.The model is based on the JADE framework to realize the underlying basic functions.In addition,through some interfaces,the intelligent module of the multi-agent collaboration method based on state representation learning and the intelligent module of the multi-agent system task scheduling method based on reinforcement learning are realized to meet industrial requirements.In this paper,we have improved the M3 DDPG algorithm based on the complex environment that the Agent contacts in the industry,the poor robustness,and the lack of the agent's ability to perceive important features.At the same time,the acquisition of important features by the equipment agent is improved,so that the agent's actions can achieve the expected effect.The improved algorithm uses state representation learning to help capture features,constructs a mapping between observations and state values through deep neural networks,and then the Actor and Critic networks in M3 DDPG learn from the new neural network instead of learning from the initial observations.So that the Agent's actions can reach expectations and can adapt to high-dimensional data.For the flow shop scheduling problem in a multi-agent system,this paper proposes a TS?Qlearning algorithm that combines the tabu search algorithm and Q-learning algorithm.The early training experience of the tabu table storage algorithm of the tabu search algorithm is used to guide the early training of the algorithm.And change the policy of Q-learning algorithm in solving scheduling problems.Based on the exploration advantages of the Q-learning algorithm,our method guides the algorithm in the initial training stage,thereby improving the quality of algorithm training better optimizing resource allocation.

Keywords/Search Tags:

Multi-Agent system, Reinforcement learning, SRL_M3DDPG algorithm, TS_Qlearning algorithm

PDF Full Text Request

Related items

1	Multi-agent Confrontation Algorithm Based On Reinforcement Learning
2	Research On Deep Reinforcement Learning Technology For Multi-agent Collaboration
3	Decentralized Multi-agent Reinforcement Learning Algorithm Research
4	Research On Multi-Agent Cooperative Algorithm Based On Deep Reinforcement Learning
5	Research On Multi-Agent Collaborative Algorithm Based On Reinforcement Learning
6	Research And Application On Reinforcement Learning And Communication Technology In Agent
7	Cooperation Mechanism Of Simulation 2D Soccer Robot Based On Reinforcement Learning
8	Study Of Multi-agent Foraging Based On CE-Q Reinforcement Learning And K-means Clustering Integrated Algorithm
9	Research On Reinforcement Learning Algorithm And Equilibrium Of Multi-Agent Game
10	A Multi-agent Reinforcement Learning Algorithm Based On Sparse Interactions