A Research Of Hierarchical Multi-agents Deep Reinforcement Learning For Action Game

Posted on:2021-01-17

Degree:Master

Type:Thesis

Country:China

Candidate:J L Chen

Full Text:PDF

GTID:2428330623467792

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Game AI is the combination of artificial intelligence and the game field,and is an important experimental carrier for reinforcement learning(RL).RL focuses on the problem of sequential decision making,which coincides with action selection in games.In this field,many remarkable research results have appeared in recent years,such as DQN which play Atari game near human level,AlphaGO which defeat the world top Go players,OpenAI Five which win a top Dota2 e-sport team.Traditional RL uses low-dimensional data.Also,its state and action space is relatively small.But now it faces more and more challenges when facing high-dimensional state and action space,sparse and delayed rewards,non-stable environment in multi-agent system.Based on a multiplayer football game,this paper proposes a hierarchical multi-agent RL algorithm with inter communication mechanism(HMARL-ICM)to solve the cooperation and competition between football players.Non-stable environment is an unavoidable problem in multi-agent RL.The reason is that other agents will also affect the environment,which may cause the deviation of understanding of environment state.Following the agent-independent method,this paper designs a decentralized training and an inter communication mechanism to make environment stable.And this idea also promotes the cooperative behavior between multiple players.Multiplayer football game is complicated and its episode is a bit long,which makes the rewards are spare and delayed.To solve this problem,this article constructs a hierarchical Actor-Critic network based on temporal abstraction theory,which network is similar to the structure of decision-making by brain and action by trunk.The core algorithm of high and low level is Fast-PPO,which releases the restrictions of proximal policy optimization to get improved.Applying the algorithm HMARL-ICM on a 3v3 multiplayer football game which is made by unity,this paper gets better results and faster convergence than other advanced RL algorithms.Agents can cope with long decision making in complex games based on hierarchical structure.And the inter communication mechanism enables multiple agents to share their information,which leads to a variety of excellent football strategies.

Keywords/Search Tags:

deep reinforcement learning, multi-agent reinforcement learning, hierarchical reinforcement learning, game AI

PDF Full Text Request

Related items

1	Supervised Reinforcement Learning:methods And Applications
2	Research On Deep Reinforcement Learning Technology For Multi-agent Collaboration
3	Research On Group Confrontation Strategies Based On Deep Reinforcement Learning
4	Hierarchical Reinforcement Learning
5	Multi-Agent Dynamic Hierarchical Reinforcement Learning Based On Hybrid Abstraction
6	Research On The Sparse Reward Problem Based On Hierarchical Reinforcement Learning
7	Continuous Time Hierarchical Reinforcement Learning Algorithm
8	Study Of Multi-agent Learning Problem Based On Reinforcement Learning
9	Research And Implementation On Game Control Algorithm Based On Deepening Reinforcement Learning
10	Research On Reinforcement Learning System Based On Audio-Visual Semantic Coherence