Study On Optimal Consensus Of Multi-agent Systems Based On Q-learning

Posted on:2020-03-05

Degree:Master

Type:Thesis

Country:China

Candidate:Q Zhao

Full Text:PDF

GTID:2518306518969499

Subject:Control Science and Engineering

Abstract/Summary:

Inspired by the phenomenon of natural biological clusters,multi-agent systems are proposed and the bio-cluster phenomenon is extended to scientific research.Compared to the single agent,multi-agent systems can perform more complex and dangerous tasks in a more efficient manner.Due to its theoretical research significance and wide-ranging applied background,cooperative control of multi-agent systems is still one of the key research directions.Consensus control is one of the basic problems of cooperative control for multi-agent systems.Because it is difficult to obtain the accurate system model in many practical applications,it is necessary to propose a model-free control method to achieve the optimal consensus of the multi-agent systems.Based on the model-free Q-learning method,this paper studies the optimal consensus control of a series of multi-agent systems.The main research contents are as follows:Firstly,the application of Q-learning method in discrete-time linear single-agent system is realized.The feasibility of Q-learning method is verified.The consensus control of single-agent system is the general tracking problem.In order to realize the optimal control of the system without knowing the accurate system model,a model neural network is designed to identify the system.Then based on adaptive dynamic programming and Q-learning,the modeless controller is designed.The optimal control is obtained by reinforcement learning and least square method.The effectiveness of the controller is verified comparatively by simulation examples.Secondly,Q-learning is applied to the multi-agent system,where the Q-learning method is extended to the consensus control problem for multi-agent systems from the tracking control problem for single agent systems.Considering that the system information is not easy or impossible to be obtained in real-world applications,the data-based Q-learning method is adopted for a class of discrete-time multi-agent systems,which are known as graphic game.Reinforcement learning and least square method are used to obtain the analytical solutions of optimal control.Finally,the simulation examples prove the effectiveness of the proposed algorithm.Finally,the implementation of Q-learning method in heterogeneous multi-agent system is studied.Considering the uncertainty of information exchange between the agents,the Q-learning algorithm based on policy iteration is proposed.Firstly,the observers for estimating the leader’s dynamics are designed and trained under the switching topologies.Then the trained observers are applied to the Q-learning algorithm.The effectiveness of the algorithm is illustrated by numerical simulations.

Keywords/Search Tags:

Adaptive Dynamic Programming, Multi-agent Systems, Q-learning, Reinforcement Learning, Policy Iteration, Switching Topologies

Related items

1	Research On Adaptive Learning Control Of Multi-agent Cooperative Formation
2	Model-free Adaptive Dynamic Programming And Its Applications On Multi-agent Systems
3	On Adaptive Optimal Control For Multi-agent Systems Via Integral Reinforcement Learning
4	Research On Motion Control Of Mobile Robots Based On Reinforcement Learning
5	Research On H∞ Distributed Cooperative Control Problem Of Multi-agent Systems With Switching Topologies And Random Delays
6	Research On Goal-Conditioned Hierarchical Multi-Agent Reinforcement Learning For Cooperative Environment
7	Multi-domain Dialog Policy Learning Based On Multi-agent Reinforcement Learning
8	Adaptive Dynamic Programming Based Control Of Linear Systems With Unknown Parameters
9	Tracking Control For Nonlinear Multi-agent Systems Under Switching Topologies
10	Research On Multi-agent Distributed Cooperation Method Based On Deep Reinforcement Learning