Research On Self-driving Decision Algorithm Based On DDPG Algorithm

Posted on:2022-03-22

Degree:Master

Type:Thesis

Country:China

Candidate:K Z Jiang

Full Text:PDF

GTID:2492306572960479

Subject:Control Engineering

Abstract/Summary:

PDF Full Text Request

With the rapid development of computer technology,unmanned driving technology is gradually entering people’s field of vision and becoming possible.However,the realization of unmanned driving still needs many ways to go,such as safety,stability,versatility and other issues.Deep reinforcement learning algorithms combine the strategy generation ability of reinforcement learning with the approximation and fitting ability of deep learning to the model.,Can efficiently handle various complex and high-dimensional problems,and provide a way forward for unmanned driving.Many scholars have invested in the research of deep reinforcement learning algorithms.As a result,many excellent deep reinforcement learning algorithms have emerged.This article focuses on improving the DDPG algorithms that have been excellent in recent years,and conducted three sets of experiments based on this.The first set of experiments is the demonstration and simulation of the DDPG algorithm.First,the theoretical basis of the DDPG algorithm is discussed,and then the simulation platform TORCS that will be used in this subject is introduced,and some environment settings and mode settings of the simulation platform are given.The state data was analyzed,and finally the DDPG algorithm was simulated and analyzed on the platform,and improved ideas were put forward for the problem of too long training time and not many laps in the final completion of the DDPG algorithm.Next,we introduced the concept of entropy,designed an improved DDPG algorithm based on maximum entropy reinforcement learning,realized the breakthrough of the original algorithm from a single-step strategy to multiple distribution strategies,improved the problem of insufficient environmental exploration by deterministic strategies,and improved After the second set of experiments is completed,the data is sorted in the form of graphs and tables,and the algorithms before and after the improvement are rationally compared and analyzed,verifying the improved DDPG algorithm based on maximum entropy reinforcement learning.Effectiveness.Finally,in the second set of experiments,the problem of continuous collisions in certain corners in the initial training of the trolley was proposed.The idea of combining discrete and continuous was proposed.The original Actor network was modified and added to the original output layer.A set of parallel output layers of 6neurons,and the introduction of Gumbel-Softmax Trick method,designed an unmanned driving decision method based on gear control,and conducted a third set of experiments,which proved that the method can effectively improve The early exploration efficiency of the trolley has realized the diversification of algorithm control.

Keywords/Search Tags:

Deep Deterministic Policy Gradient Algorithm, Maximum entropy, Discrete and continuous form, Gear control, Neural Networks

PDF Full Text Request

Related items

1	Deep Deterministic Policy Gradient-based Smart Generation Control And Coordination Technology
2	Research On Real-time Trajectory Planning Of Parafoil Based On Deep Reinforcement Learning Algorithm Under Complex Constraints
3	Research Of Unmanned Driving Policy Based On Aggregated Multiple Deep Deterministic Policy Gradient
4	Operational Optimization Of Regional Integrated Energy System Based On State Enhanced Deep Deterministic Policy Gradient Algorithm
5	Research On Intelligent Operation Ticket System Based On Shaping-deep Deterministic Policy Gradient
6	Research On Driverless Control Policy Based On Deep Reinforcement Learning
7	Research Of Decision-making Technology For Intelligent Driving Based On Deep Reinforcement Learning Algorithm
8	Resource Allocation And Trajectory Optimization In Multi-UAV Assisted Vehicular Networks
9	Path Planning Algorithm For UAV Based On Intelligent And Cognition
10	Research On Reinforcement Learning Algorithm For Longitudinal Control Of Autonomous Vehicles