Cooperative Reconnaissance And Attack Assistance Decision Based On Reinforcement Learning

Posted on:2023-02-08

Degree:Master

Type:Thesis

Country:China

Candidate:Z Z Shen

Full Text:PDF

GTID:2542307061953589

Subject:Control theory and application

Abstract/Summary:

PDF Full Text Request

With the continuous development of technology,the battlefield situation is becoming more and more complex,and there may be mistakes in using only manual decision-making.Using reinforcement learning to assist in the decision-making of collaborative operations command in complex environments is a current research direction to solve this problem.Aiming at the large-scale reconnaissance decision-making and long-distance cooperative attack assistance decision-making of multi-agent system in large scenes,this paper proposes a training algorithm based on reinforcement learning,designs the corresponding reward value function,and validates it through simulation experiments.This paper mainly completes the following work:1.This part addresses the problem of reinforcement learning formation reconnaissance in large scenes,and a PPO-QMIX multi-agent formation reconnaissance algorithm is designed.The behavior of the multi-agent system is divided into two parts: formation adjustment and synchronous movement reconnaissance.The multi-agent reinforcement learning algorithm QMIX is designed to adjust the formation,and the reinforcement learning algorithm PPO is designed to realize the marching reconnaissance.The effectiveness of the algorithm is verified by several sets of simulation experiments in different scenarios.2.This part addresses the problem of multi-target and long-distance reinforcement learning cooperative attack assistance,and a PPO-QMIX intelligent attack assistance decision-making algorithm is designed.The behavior of the multi-agent system is divided into two parts: the overall path planning and the cooperative operation.The PPO algorithm is designed to realize the shortest path planning decision,and the QMIX algorithm is designed to complete the operational optimal decision.In addition,the state boundary of the multi-agent system of the two-part behavior is set,and the switching conditions of the decision algorithm are given.The effectiveness of the algorithm is verified by several sets of simulation experiments in different scenarios.3.This part addresses the problem of multi-agent reconnaissance and attack assistance decision problem in large scenes,and a two-stage training framework is designed based on the algorithms in the first two chapters.In the first stage,the multi-agent formation investigation reinforcement learning algorithm is used to train a network model for completing the investigation task,and the investigation model is used for investigation in the second stage,Then,according to the detected information,the subsequent cooperative operation model simulation is completed,and a cooperative operation model is trained by using multi-agent cooperative operation reinforcement learning algorithm.The effectiveness of the algorithm is verified by several sets of simulation experiments in different scenarios.

Keywords/Search Tags:

Intelligent decision, Reinforcement learning, Multi-agent system, PPO, QMIX

PDF Full Text Request

Related items

1	The Design And Implementation Of Asymmetric Air Combat Multi-agent System Based On Reinforcement Learning
2	Multi-agent Affective Decision Learning Method With Its Application In Flow Intelligent Transportation
3	Design And Implementation Of Combat Agent Based On Deep Reinforcement Learning
4	Research On Deep Reinforcement Learning Algorithm For Intelligent Military Decision
5	Research On Decision-Making Of Beyond-Visual-Range Air Combat Based On Multi-Agent Reinforcement Learning
6	Application Research Of Multi-agent Reinforcement Learning In Multiple Target-ships Collision Avoidance Decision-making
7	Research On Reinforcement Learning Countermeasures For Heterogeneous Multi-agents With Sparse Rewards
8	Multi-agent Reinforcement Learning Algorithm Design And ZYNQ Implementation For UAV Online Decision Making
9	Cooperative Overtaking Strategy Based On Multi-agent Reinforcement Learning
10	Research On Confronting Policy Generation Method Of Multi-Agent System Based On Reinforcement Learning