Design And Implementation Of Combat Agent Based On Deep Reinforcement Learning

Posted on:2023-11-23

Degree:Master

Type:Thesis

Country:China

Candidate:C G Shi

Full Text:PDF

GTID:2532306914456564

Subject:Computer technology

Abstract/Summary:

With the development of computer technology,the development and design of combat game platforms has gradually become the research focus of major institutions.However,there is less research work on agent modeling on combat platforms,especially the application of deep reinforcement learning technology to mixed units.The accusation is a major difficulty in the modeling of combat agents.This thesis proposes a modeling method of mixed-units command and control agent based on deep reinforcement learning.The specific design and research work includes:First of all,for the problem of homogeneous-units accusation,a deep reinforcement learning agent based homogeneous units is proposed and designed,which solves the problem of complex action commands through a hierarchical action space scheme,and reconstructs the reward function by adding additional rewards such as distance rewards.Through simulation experiments,compares the effectiveness of single-moment and multi-moment representation methods for state representation.Secondly,in response to the problem of heterogeneous-units accusation,a heterogeneous arms accusation agent based on deep reinforcement learning is proposed and designed,which aligns the granularity of action commands by means of knowledge rules and reinforcement learning hierarchical action spaces,and solves the interaction through independent training.Inconsistent frequency problem,and combined with a variety of multi-agent reinforcement learning algorithms to complete the task of heterogeneous arms command and control.Finally,based on the above two accusation problems,a mixed-units accusation agent based on deep reinforcement learning is designed and implemented,and two formation schemes are proposed:first homogeneous division and then heterogeneous division,and first heterogeneous division and then homogeneous division,so as to solve the problem of a large number of combat units and a large action space.In the state space design scheme,the multi-time state design method is used to improve the effectiveness of state representation,and the reward function is reconstructed to improve the training efficiency for dense rewards.The experimental results show that the combat agent designed in this thesis can complete the combat deduction with good effect in the combat platform.

Keywords/Search Tags:

reinforcement learning, intelligent confrontation, Markov decision process, multi-agent

Related items

1	Research On Reinforcement Learning Countermeasures For Heterogeneous Multi-agents With Sparse Rewards
2	Multi-agent Deep Reinforcement Learning In Multi-UAV Confrontation
3	Application Research Of Reinforcement Learning On Multi-agent Competition
4	Research On Three-dimensional Trajectory Design And Resource Scheduling Optimization Algorithm For Complex UAV Network
5	Multi-agent Affective Decision Learning Method With Its Application In Flow Intelligent Transportation
6	Research On Confrontation Of Multiple Space Vehicles Maneuvering Strategy
7	Cooperative Reconnaissance And Attack Assistance Decision Based On Reinforcement Learning
8	Intelligent Anti-jamming Decision-making Scheme Based On Reinforcement Learning
9	The Design And Implementation Of Asymmetric Air Combat Multi-agent System Based On Reinforcement Learning
10	Research On Deep Reinforcement Learning Algorithm For Intelligent Military Decision