Research And Implementation Of Micro Real-time Strategy Game Based On Deep Reinforcement Learning

Posted on:2021-06-01

Degree:Master

Type:Thesis

Country:China

Candidate:Z T Wang

Full Text:PDF

GTID:2506306557489384

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

In future wars,the traditional combat using single equipment will be replaced by multi equipment.Collaborative electronic combat uses computer and communication technology to connect existing electronic combat systems into a network in order to maximize the use of existing technology and tactical resources,improve the system’s perception,attack and protection abilities so that ultimately achieve the goal of cooperation and improve the efficiency of existing electronic combat systems.The centralized learning technology is difficult to adapt to the heterogeneous,complex,dynamic and large-scale real-time systems.However,the multi-agent reinforcement learning technology has the characteristics of autonomy,distribution,coordination,self-organization and self-learning,which is suitable to build a cooperative and competitive multi-agent system with strong robustness and reliability.This thesis is dedicated to using deep reinforcement learning technology to solve problems in multi-agent antagonism systems.This study mainly focuses on the mini game scenarios and the full game scenarios in real-time strategy games:1)In mini game scenarios,the traditional action-value function has been widely used,but action-value function is difficult to adapt to the real-time strategy game systems,because the number of agents can change at any time.In order to make the multi-agent architecture more scalable,we explore a multi-agent reinforcement learning framework for real-time strategy game that avoids the use of action-value functions,i.e.using the state-value function shared globally by all agents and multiple switchable actors in the Actor-Critic framework.We also design a suitable global reward function for this framework.We then propose the algorithm without human knowledge which can work for semi-markov decision process where rewards cannot be received until actions last for a while,which is more suitable for real-world scenarios.2)In full game scenarios,the problem has a higher state and action space than mini game scenarios.Based on the presented reinforcement learning framework in mini game scenarios,this thesis proposes the counterfactual advantage function and the league learning method respectively to solve the two problems,that is,lack of cooperation between agents and training is more likely to fall into local optimum in full game scenarios.3)This thesis gives the details of the design and the implementation of the reinforcement learning system.To evaluate the performance of our method,this thesis conducts mini game and full game experiments on a simplified real-time strategy game,!RTS.The results show that the AI trained by this method is highly competitive against strong baseline AIs.

Keywords/Search Tags:

Markov Decision Process, Multi-Agent Systems, Deep Reinforcement Learning, Real-Time Strategy Games, Game Theory

PDF Full Text Request

Related items

1	The Study Of Multi-Agent Cooperative Reinforcement Learning Methods
2	Research On Air Combat Game Decision Based On Deep Reinforcement Learning
3	Learning In Cooperative Multi-agent Team
4	Research On Multi-Agent Intelligent Decision Support Systems Based On Theory Of Evidence
5	Research On The Recommendation Method Of Community Correction Strategy Based On Reinforcement Learning
6	Research On Group Decision Support Systems Based On Multi-Agent
7	A Study On Distributed Real-time Multi-agent Techonology And It's Application In NCW
8	Path Planning Method Based On Deep Reinforcement Learning And Its Application
9	Reinforcement Learning With Its Application In Coplanar Air Combat
10	Research On Analyzing, Modeling And Applications Of Military Conflict Decision-making Oriented To Human-caused Complexity