Study On The Improved Average Reward Reinforcement Learning Algorithm Based On Performance Potentials

Posted on:2015-01-03

Degree:Master

Type:Thesis

Country:China

Candidate:W L Yang

Full Text:PDF

GTID:2268330428997077

Subject:Control theory and control engineering

Abstract/Summary:

PDF Full Text Request

Reinforcement learning is an important learning control method for solving the problem of learning control in the field of artificial intelligence. The robot soccer simulation game, with the characteristics of dynamic real-time, distributed control, cooperation and confrontation in uncertain environment, is a benchmark problem for Multi-agent System research. Also it is a significant research for the progress of artificial intelligence, control decision and intelligent robotics. However, towards the disadvantages of conventional reinforcement learning algorithms in solving agents’ strategies, such as slow convergence, leading to environmental uncertainty and parameter sensitivity, this thesis proposes the corresponding improvements. The main contributions and achievements of this thesis are given below:Firstly, we give a fundamental introduction about reinforcement learning and performance potentials with their development progress, basic theory and typical algorithms. After that we will analyze their advantages and disadvantages respectively.Secondly, using conventional average reinforcement learning in solving agent’s personal skills may cause some problems with slow solution speed, local optimal, etc. For this reason, we employ G-learning algorithm for the off-line training of kick skill to improve the performance of agent’s skill. Then we perform a series of experiments and the results which show that G-learning is superior to conventional reinforcement learning both for convergence speed and successful rate of kick skill.Finally, we propose a multi-agent collaboration algorithm based on G-learning which not only need to solve the state space in multi-agent system, but consider about the learning and reward problems. In this part, we apply the improved algorithm to the Keepaway platform which enable the team to get a better performance.In conclusion, this work is achieved based on GDUT TiJi, a2D soccer simulation team of RoboCup. After the code programming, we also take part in RoboCup2013WorldCup and RoboCup China Open2013with the satisfactory results.

Keywords/Search Tags:

RoboCup, Multi-agent, reinforcement learning algorithm, averagereinforcement learning algorithms, performance potentials

PDF Full Text Request

Related items

1	Study On The Learning And Planning Algorithm Of Intelligent Agent Based On Performance Potentials
2	Research On Robocup Simulation System And Program Design Based On Reinforcement Learning
3	Study Of Multi-agent Learning Problem Based On Reinforcement Learning
4	The Design And Realization Of Multi-Agent System Based On Robocup
5	Unified Algorithms For Semi-Markov Decision Processes With Discounted And Average Criteria Based On Performance Potentials By Reinforcement Learning
6	Research On Deep Reinforcement Learning Technology For Multi-agent Collaboration
7	NDP Optimization For Large-scale Markov Systems Based On Performance Potentials-learning
8	A Hierarchical Reinforcement Learning Method With Variable Learning Rate For Soccer Robot Multi-agent Adversarial System
9	Research On Reinforcement Learning Algorithm And Equilibrium Of Multi-Agent Game
10	Decentralized Multi-agent Reinforcement Learning Algorithm Research