Research On Improved Search And Rescue Optimization Algorithm Based On Reinforcement Learning

Posted on:2024-03-06

Degree:Master

Type:Thesis

Country:China

Candidate:W J Zhou

Full Text:PDF

GTID:2568307124486204

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The search and rescue optimization algorithm(SAR)is a meta-heuristic optimization algorithm proposed in 2020 for solving constrained optimization problems in engineering.As a new type of swarm intelligence optimization algorithm,SAR is flexible,simple,easy to implement and not easily trapped in local optimality.SAR consists two phase,namely social phase and individual phase.The former is used for local search,while the latter is used for global search.The two phases alternate.In this update strategy,individuals are unable to adaptively select the appropriate operator according to their own state,which affects the convergence performance of SAR.In this dissertation,the shortcomings of SAR are analyzed and the algorithm is further improved so as to extend its applications.The main work is as follows:(1)To solve the problem of slow convergence of SAR,an improved SAR named i SAR is proposed.A total of 25 functions,including single-peak function,multi-peak function,multi-modal function and composite function,are selected to test the performance of i SAR.The experimental results show that i SAR outperforms SAR,particle swarm optimization,gray wolf optimization,squirrel search and whale optimization in 25 function tests.(2)To solve the problem that SAR cannot adaptively select operators,an improved SAR named RLSAR is proposed based on reinforcement learning,which combines reinforcement learning with SAR.Using individuals of SAR as agents,a reinforcement learning model is established and the A3 C network is used for training.The training results show that the convergence speed of RLSAR is better than that of SAR.In addition,RLSAR is used to solve the rotating round table balancing pendulum problem,and the experimental results show that RLSAR can plan a better minimum envelope circle radius than SAR,which verifies the good convergence ability of RLSAR.(3)To solve the unmanned aerial vehicles path planning problem,the RLSARPP algorithm is proposed.A path adjustment phase is proposed on the basis of RLSAR.The three actions of RLSAR-PP are the path adjustment phase and the social and personal phases of SAR.RLSAR-PP is trained in randomly generated environments.The experimental results show that RLSAR-PP plans a more economical and safer path than SAR,the squirrel search algorithm and the whale optimization algorithm.The rationality of the above three action designs is verified through ablation experiments and particle execution sequence analysis,which also indicate that they have a positive impact on RLSAR-PP and enable it to effectively solve the path planning problem of unmanned aerial vehicles.

Keywords/Search Tags:

reinforcement learning, search and rescue optimization algorithm, path planning, balanced performance constraints, packing layout optimization problem, path adjustment, unmanned aerial vehicle

PDF Full Text Request

Related items

1	Research On Online Path Planning Algorithm For Unmanned Equipment With Energy Constraints
2	Research And Application Of A Novel Swarm Intelligence Optimization Technique
3	Research On UAV Path Planning Strategy Based On Deep Reinforcement Learning For Data Collection
4	The Research On Heuristic Ant Colony Optimization For The Circle And Rectangle Packing Problem With Equilibrium Constraints And Its Application
5	Path Planning For Mobile Robot And Rescue Based On Improved Ant Colony Optimization Algorithm
6	Research On TSP-D Optimization Algorithm Based On Drone And Distribution Vehicle
7	Research On Path Planning Of Reconnaissance Unmanned Vehicle In Extremely Dangerous Environment
8	Research And Application Of Hybrid Flower Pollination Algorithm Based On Improved Search Strategy
9	Path Allocation And Planning For Multiple Unmanned Land Vehicle System
10	Research On Robot Path Planning Based On Fusion Of Reinforcement Learning And Heuristic Search Algorithms