A Hierarchical Reinforcement Learning Method With Variable Learning Rate For Soccer Robot Multi-agent Adversarial System

Posted on:2005-09-12

Degree:Master

Type:Thesis

Country:China

Candidate:T B Wei

Full Text:PDF

GTID:2168360125465790

Subject:Signal and Information Processing

Abstract/Summary:

Based on soccer robot simulation as its research platform, this paper studies the learning of high level strategy of multi-agent adversarial system. According to the study and analysis of soccer robot system, fuzzy method is introduced to describe the complex state space of soccer robot. This paper applies MAXQ, a multi-agent hierarchical reinforcement learning method, to the learning of attacking strategy of soccer robot. Moreover, a learning principle with variable learning rate is integrated into the MAXQ algorithm. The ultimate algorithm is validated to be suitable for multi-agent adversarial system.Multi-agent system is one of the research hotspots in the area of Artificial Intelligence. Compared with single agent system, multi-agent system has the properties of collective intelligence and sociality which are more conformable to the characteristics of many applications. Soccer robot system is one of the standard problems of this area. All the hotspots in multi-agent, such as agent, collaboration, communication, etc, are included in soccer robot simulation platform. This paper focuses on the hierarchical learning of high level strategy of soccer robot system. The attacking problem is abstracted for this paper.The description of environment is a precondition of agent learning. Agents in multi-agent system are always in complex, dynamic and not self-contained. The description depends on the specific case. This paper utilizes fuzzy method to describe the state space of the soccer player. With a reasonable state number, the state is effectively described.MAXQ, a hierarchical reinforcement learning method for multi-agent system, is proposed in recent years. It improves the reinforcement learning method to adapt for the multi-agent learning environment. This paper adopts MAXQ algorithm to the attacking strategy of soccer robot system. Experiment shows that this algorithm can learn attacking skills.Soccer robot system is an adversarial system. WOLF (Win or Learn Fast) is a newly proposed learning principle which is proven to be suitable for adversarial system. Using this principle together with MAXQ, a new method is proposed. Experiment result shows that this method has some good characteristics towards adversarial system.

Keywords/Search Tags:

Multi-agent system, adversarial system, MAXQ algorithm, WOLF principle, RoboCup

Related items

1	The Design And Realization Of Multi-Agent System Based On Robocup
2	Collaborative Planning In Simulation Soccer Robots
3	Research On MAS Of Structure And Cooperation Mechanism And It's Application In The Robocup
4	Continuous-Time Unified MAXQ Algorithm And Its Application
5	Cooperation Models For The Multi-agent System And Application To The RoboCup Soccer Simulator
6	The Construction Of MAS In RoboCup Simulation
7	Multi-agent Collaboration Model And Its Application In Robocup
8	The Application Of Multi-agent In RoboCup
9	Research For Multi-Agent Cooperation Based On Formation And Role, Tactics Design
10	Multi-agent Cooperation In RoboCup Rescue Simulation