Research On Reinforcement Learning Based Control Method Of Magnetic Navigation AGV

Posted on:2019-01-10

Degree:Master

Type:Thesis

Country:China

Candidate:S T Zhao

Full Text:PDF

GTID:2428330548476316

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the continuous development of the flexible manufacturing systems,automated warehouse system and other advanced production system,AGV(Automatic Guided Vehicle),as a wheeled mobile robot,plays an increasingly important role in logistics and advanced manufacturing systems.The path following control is a key technology to achieve AGV high precision guidance control.In order to avoid a significant amount of manual parameter tuning for the design of the magnetic navigation AGV path following controller when the system parameters are unknown and apply the previously learned knowledge to speed up training when the algorithm is applied to another AGV with different parameters,this thesis proposed two novel methods.The main content includes the following two aspects:(1)This thesis designs a path following controller using an improved model-free deep reinforcement learning algorithm.Based on the kinematic and dynamics model of magnetic navigation AGV derived in the discrete-time domain,the path following problem is formulated as continuous-state,continuous-action Markov decision process.Two neural networks are used to implement an Actor-Critic architecture off-policy model-free reinforcement learning algorithm as a controller implementation.The temporal-difference algorithm and the deterministic policy gradient algorithm are respectively used to update the parameters of Critic and Actor network to approach the optimal action value function and strategy function respectively.(2)Introduce representation learning method to speed up the training process when algorithm is applied to a new AGV with different parameters.By using auto-encoder technique to learn a general mapping from input signals to state representations to disentangles the state representation learning and the task-related strategy learning.When algorithm is applied to a new AGV,the pre-trained encoder replaces the first hidden layer of the network to implement the process of encoding the input signal into a state vector,thereby shortening the training process of the algorithm.Besides,this thesis presents a synchronous training auto-encoder method.The auto-encoder is trained synchronously by using the data generated during the training process of the tracking control algorithm,so that the training process of the auto-encoder is accelerated and the utilization rate of the algorithm is improved.

Keywords/Search Tags:

Automatic Guided Vehicle, Path Following, Reinforcement Learning, Deep Reinforcement Learning, Transfer Learning

PDF Full Text Request

Related items

1	Research On Laser Navigation AGV Control Method Based On Reinforcement Learning
2	Research And Implementation Of Reinforcement Learning Method About Transport Strategy Between Carrier-based Aircraft Station
3	Supervised Reinforcement Learning:methods And Applications
4	Research On Multi-AGVs Path Planning And Scheduling Technology Based On Reinforcement Learning
5	Study On Emergency Escape Route Planning Based On Reinforcement Learning
6	Path Planning For Mobile Platforms In Known Environments Based On Deep Reinforcement Learning
7	Reinforcement Learning Based On Spectral Graph Theory
8	Research On Security Deep Reinforcement Learning Based On Experiences
9	Research On Group Confrontation Strategies Based On Deep Reinforcement Learning
10	Application Of Artificial Potential Field Algorithm Optimized By Deep Reinforcement Learning In Indoor Escape Path Planning