Improved Q-learning-based Algorithm Research On Navigation Knowledge Acquisition

Posted on:2007-06-07

Degree:Master

Type:Thesis

Country:China

Candidate:Z D He

Full Text:PDF

GTID:2132360212985901

Subject:Control theory and control engineering

Abstract/Summary:

PDF Full Text Request

Reinforcement learning algorithm is used on intelligent navigation based on analyzing of mobile robot navigation control and comparability of reactive navigation and reinforcement learning model, Q-learning algorithm based navigation knowledge acquisition is mainly studied.Temporal Difference (TD) algorithm, Adaptive Heuristic Critic (AHC) algorithm and Q-learning algorithm that are main reinforcement learning algorithm are researched. The key problems in reinforcement learning algorithmâ€”tradeoff of exploration and exploitation, continuous state and action, credit assignment, partially observable Markov decision processes are analyzed, and some solutions are presented. Distributed reinforcement learning is introduced. The brief algorithms of distributed reinforcement learning are presented; the problems and their solutions are discussed. The action chosen strategy in standard Q-learning algorithm is greedy, i.e., exploitation, which tends to fall in local optimization. Some solutions are presented, but blind exploration and repeatedly learning after finding optimal path exist. An improved Q-learning algorithm based on exploration region expansion is proposed to avoid the local optimization and blind exploration. Meanwhile, other feasible path is sought where agent encounters obstacles, which makes the implementation of the algorithm on real robot easy. An automatic termination condition is also put forward, therefore, the redundant learning after finding optimal path is avoided, and the time of learning is reduced. The validity of the algorithm is proved by simulation experiments.The generalization algorithms of reinforcement learning are analyzed aiming at continuous state and action. However, the previous solutions have the problems about continuous action, neural network based continuous state and action Q-learning is proposed. The dimension disaster of reinforcement learning is resolved; it makes the implement of reinforcement learning on robot possible.

Keywords/Search Tags:

Reinforcement learning, Q-learning, exploration region expansion, simulated annealing, neural network

PDF Full Text Request

Related items

1	Investigation Of The Span Design Of High Speed Railway Bridge Based On Reinforcement Learning
2	Research On Routing Technology In UAV Communication Network
3	Research On Unmanned Strategy Learning Method Based On TORCS Simulation Platform
4	Research On Traffic Signal Control Based On Deep Reinforcement Learning
5	Collision Avoidance For Indoor UAV Based On Deep Reinforcement Learning
6	Object Recognition And Learning Control Based On Deep Neural Networks For Intelligent Vehicles
7	Research On Intelligent Control Technology Of Hydraulic Excavator Based On Reinforcement Learning
8	Research And Implementation Of Unmanned Vehicle Path Planning Based On Reinforcement Learning
9	Resource Allocation And Path Planning For UAV Network Based On Reinforcement Learning
10	Autonomous Behavior-learning And Planning Of AUV Space Motion