The Research Of Improved Q-Learning Based On Physarum Polycephalum Algorithm On Shortest Path Problem

Posted on:2020-05-11

Degree:Master

Type:Thesis

Country:China

Candidate:J Zhu

Full Text:PDF

GTID:2428330575996931

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The shortest path problem is a classic problem in theoretical research such as graph theory analysis and algorithm design.It is also a basic problem in practical application fields such as logistics planning and driving navigation.Due to features of model free and offline learning,Q-Learning Algorithm can solve shortest path problem without establishing a specific environment model and combining with other optimization algorithm easily,and the result shows Q-Learning Algorithm performed well in solving shortest path problem.This thesis proposed improved Q-Learning Algorithm to solve shortest path problem aimed at shortcomings of Q-Learning Algorithm contains large initial search space,low searching efficiency in middle stage and unstable convergence in late stage.To solve Q-Learning Algorithm's drawbacks,an Adaptive Choosing Q-learning based on Physarum Polycephalum Algorithm(ACQPPA)is proposed.ACQPPA preprocesses network through Physarum Polycephalum Algorithm to reduce size of initial search space by replace initial Q matrix with local optimal path matrix.ACQPPA changes exploration rate dynamically to ensure diversity of solutions by giving exploration rate high initial value and converge stably by reducing exploration rate to low value with iteration of algorithm through Simulated Annealing Algorithm.Simulation results show that the success rate of the ACQPPA algorithm converges to the optimal path is 100% when the number of network nodes is 128,which is higher than 80% of the ACO algorithm and Q(?)algorithm,and the number of iterations is lower than 57.2% of Q-Learning,32.9% of ACO algorithm and 35.1% of Q(?)algorithmAiming at the problem of slow searching in middle stage,and accelerate efficiency of preprocessing of network by Physarum Polycephalum Algorithm,this thesis proposes a Bidirectional ACQPPA based on Network Compression(B-ACQPPANC).B-ACQPPANC compresses size of network by CH Algorithm to save time which cost by preprocessing of network through Physarum Polycephalum Algorithm,and starts a reverse direction exploration by combining bidirectional thoughts with Q-Learning to accelerate efficiency in middle stage through store experience in early stage.Simulation results show that average success frequencies of converging to global shortest path of B-ACQPANC lower 43.26% than ACQPPA,and the success frequencies of B-QNC Algorithm and B-Q(?)NC Algorithm reduced 39.40% and 29.76% respectively than Q-Learning Algorithm and Q(?)Algorithm.The results also show that the success rate of B-ACQPPA converges to the optimal path is 100%,which is higher than 80% of B-QNC Algorithm and 20% of B-Q(?)NC Algorithm.

Keywords/Search Tags:

Shortest path problem, Q-Learning Algorithm, Physarum Polycephalum Algorithm, Simulated Annealing Method, CH Algorithm, Bidirectional exploration

PDF Full Text Request

Related items

1	Studies Of Bio-inspired Optimization Algorithm Based On Positive Feedback Mechanism
2	Optimization And Application Of The Physarum-Inspired Algorithm
3	The Application Of Heuristic Algorithm In VRP Based On The Simulated Annealing Algorithm
4	The Research Of Router Nodes Placement Problem Based On Simulated Annealing--Genetic Algorithm In Wireless Mesh Networks
5	Applied Research Of Hybrid Genetic Algorithm And Simulated Annealing Algorithm In Traveling Salesman Problem
6	Research On Path Planning Of Mobile Robot Based On Genetic Algorithm
7	Research On Dynamic Route Guidance System Based On Improved Genetic Algorithm
8	The Research On The Optimization Of Multimodal Trans-Port In Mix Genetic Algorithm Based On Simulated Annealing
9	Path Planning In A Static Environment Based On Genetic Simulated Annealing Algorithm
10	Application Research Of Adaptive Simulated Annealing Polymorphic Ant Colony Optimization Algorithm To Mobile Robot Path Planning