Research And Design Of Soccer Robot Decision-making System Based On Multi Agent Reinforcement Learning

Posted on:2014-10-20

Degree:Master

Type:Thesis

Country:China

Candidate:K L Zhou

Full Text:PDF

GTID:2298330422990418

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

How to make the agent to think and make decisions like the human is theessence of artificial intelligence. To solve the markov decision process describeddecision making problems under uncertain environment, the agent use themethod of trial and error in the environment which called reinforcement learning.RoboCup (Robot World Cup) which is the international robot soccercompetitions and academic activities is aim at to promote the development ofmulti-agent system and distributed artificial intelligence in related areas. Amongall the leagues, Robocup2D simulation competition is a league focus onmulti-agent system decision making problems.The main manner of this paper is based on multi-agent reinforcementlearning, especially, MAXQ hierarchical reinforcement learning method andcollaborative action learning. This paper regards Robocup2D simulation game asthe experimental platform for multi-agent reinforcement learning and decisionmaking problems to do the research and apply the related methods andachievement on players’ privy skills and cooperation actions.Firstly, this article analysises and summerizes the theory of multi-agentsystems decision-making and reinforcement learning. Secondly, in order to sovlethe complex dimension disaster problem of the state of reinforcement learningsystem, this article applies CMAC neural network as the way of generalization toaccurate the speed of learning and improve the accuracy of the learning result.Using simulated annealing strategy, so that the learning process can escape fromthe local optimal solution and obtain the global optimal solution by exploration.In addition, the way of MAXQ hierarchical reinforcemnt learning has beenimplemented through a hierarchical structure of decision-making tasks, whichaccording to the system environment and the complexity of the task to be divided.In this manner, the large dimensions of the state space is divided into some smalldimensional state space. What’s more, heuristic techniques are used tocompletion function. It’s an effective way to reduce the space of statedimensions.Finally, in the Robocup simulation platform, this article designs the player’s decision making and function module, adds in dynamic potential field heuristicmodel to allows the player agent obtained the informatin of the location of thefootball field and the players to make adaptive judgement. Through forward treesearch algorithm, the player can make the plan to achieve collaboration withtheir teammates, this process uses the result of reinforcement learning of players’behavior, make the player to select the optimal strategy and to complete theprocess of decision-making. In this way, the ability of collaboration among theplayers has been improved.All of the proposed mechanisms and methods are applied to the RoboCup2D simulation platform team. In order to analysis the proposed algorithm, thisarticle uses the task of intercept and matches between different version of ourteam with related algorithm. The most important work is compare with otherteams in the form of the game. The results of the experiment were analyzed toverify the multi-agent decision-making methods which proposed in this paper isfeasible.

Keywords/Search Tags:

Multi-Agent System, MAXQ Hierarchical Reinforcement Learning, Soccer Robot

PDF Full Text Request

Related items

1	A Hierarchical Reinforcement Learning Method With Variable Learning Rate For Soccer Robot Multi-agent Adversarial System
2	Research On Decision-Making Strategy Of Soccer Robot Based On Multi-Agent Reinforcement Learning
3	Application Of Reinforcement Learning Based On Robot Soccer Simulation
4	Design, Decision-making System Based On Reinforcement Learning For Soccer Robots
5	Dynamically Tiered Model-based Reinforcement Learning Algorithm Research
6	Research On Soccer Robots Cooperation Mechanism Based On Markov Decision Theory
7	Distributed Soccer Robot Learning And Collaboration
8	Mobile Robot Path Planning Based On Hierarchical Reinforcement Learning
9	Multi-Agent Dynamic Hierarchical Reinforcement Learning Based On Hybrid Abstraction
10	Collaborative Planning In Simulation Soccer Robots