Research Of Multi-agent Cooperation Mechanism Based On Reinforcement Learning

Posted on:2016-10-01

Degree:Master

Type:Thesis

Country:China

Candidate:Q Sun

Full Text:PDF

GTID:2308330464969456

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

Multi-agent system is composed of multiple interacting autonomous agents, the core issue of the research is to seek to establish an effective cooperation mechanism, making the simple and independent agents complete tasks or solve problems of complex targets through consultation,coordination and cooperation.Reinforcement learning algorithm learns through trial and error with the dynamic environment, which is an important branch in the field of machine learning and artificial intelligence, and is a leap to control agents through rewards and punishments, instead of task implementation.This thesis focuses on multi-agent cooperation mechanism for large-scale, real-time, and uncertainty problems, based on the reinforcement learning algorithm. The main work and results are summarized as follows:1. Based on the Markov properties of the problem, several kinds of Markov decision process models were introduced. Considering the partially observable property of multi-agent problem, the partially observable Markov model was built, and converted to semi-Markov model to meet MAXQ-Q algorithm.2. In view of large scale features of the problem, MAXQ-Q hierarchical reinforcement learning algorithm was introduced to decompose the multi-agent problems whose state-action space is huge, and algorithm complexity is in exponential growth, into small linear semi-Markov decision problems which can be solved recursively.3. In view of the MAXQ reinforcement learning algorithm proposed in this thesis, the and-or graph was applied for strategy representation in large state-action space strategy search problems, to combine with the state abstraction methods for state reduction to meet the real-time demand in soccer robot problems, such as action selection and state transition.4. A new multi-agent cooperation framework was proposed based on the above work, and its real-time performance and stability were tested in Robocup2 D soccer robots simulation platform and a cooperation task with two NAO robots developed by Aldebaran Co., a France company.

Keywords/Search Tags:

Multi-agent Cooperation, Markov Decision Model, Reinforcement Learning, Hierarchical Task, And-or Graph, State Abstract

PDF Full Text Request

Related items

1	Continuous Time Hierarchical Reinforcement Learning Algorithm
2	Multi-Agent Dynamic Hierarchical Reinforcement Learning Based On Hybrid Abstraction
3	Study On The Mothed Of Multi-Robot Cooperation Based On The Multi-Agent System
4	Hierarchical reinforcement learning in continuous state and multi-agent environments
5	Research On Multi-agent Cooperation Based On Task Prediction In RoboCup Rescue Simulation
6	Adaptive Agent Based On Model-free Reinforcement Learning Collaborative Planning
7	Self-Organizing Collaborative Target Search Of Mobile Multi-Agent Based On Reinforcement Learning
8	Research On Multi-agent Cooperation Method Based On Deep Reinforcement Learning
9	Research Of Multi-agent Cooperation Based On Deep Reinforcement Learning
10	Hierarchical Reinforcement Learning