Navigation Control For Autonomous Mobile Robots Based On Reinforcement Learning

Posted on:2011-12-19

Degree:Master

Type:Thesis

Country:China

Candidate:J G Ren

Full Text:PDF

GTID:2178330338489636

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

Autonomous mobile robot can work in the complex and unstructured environment without human intervention. It has a high degree of self-planning and adaptability in uncertain environments. The navigation technology is a key to the intelligence of mobile robots. Robots need to learn quickly in order to improve the ability of adapting unknown environments, and resolve complex problems without complete knowledge. Reinforcement learning algorithm provides the self-studying ability, and has been received extensive attention in the field of the navigation of autonomous mobile robot systems.Reinforcement learning imitates the process of animal learning, and seeks the optimal mapping from states to actions based on trial and error method. Traditional learning algorithms always have several problems, such as slow convergence, time constraints when estimating delayed reward, memory-consuming and time-consuming during calculation. In this thesis, relative value iteration reinforcement learning (RVI- RL) algorithm is proposed with reasonable deformation, which is based on the traditional theory of reinforcement learning, and combined with the relative value iterarion theory and the optimization theory. This algorithm is based on the Markov decision process (MDP) environments, and does not need to estimate the average reward.RVI-RL algorithm is a model-free algorithm, which not only eliminates the estimation of the average reward in the task, but also eliminates the bias due to the constant modification of estimation error of the average reward constantly. The classical taxi problem is taken as our experiment model. The environment is initialized as a grid map. However, the information is unknown to the taxi. The taxi drives autonomously through trial and error techniques to obtain the state information of the environment, and ultimately gets the optimal mapping from states to the actions.Experiments show that RVI-RL algorithm could converges much faster and stable than R-learning algorithm and Q-learning algorithm to the optimal strategy in the navigation of autonomous mobile robots. The new algorithm shows the ability of inducing cooperation between two robots, which is more appropriate than the traditional Q-learning algorithm in the collaborative planning problem. Although RVI-RL algorithm eliminates the estimation of the average reward, there still exists "Dimension Disaster" problem in the large state space. This thesis also discusses the feasibility of introducing the stratification into the RVI-RL algorithm, and experiments show the better convergence to the optimal strategic than MAXQ algorithm.

Keywords/Search Tags:

autonomous mobile robot, navigation, reinforcement learning, relative value iteration

PDF Full Text Request

Related items

1	The Research On Autonomous Mobile Robot Navigation Based On Reinforcement Learning
2	Research On Reinforcement Learning Methods For Navigation And Control Of Autonoumous Mobile Robots
3	Reinforcement Learning Based Mobile Robot Autonomous Navigation
4	Autonomous Learning And Navigation Control For Mobile Robots Based On Reinforcement Learning
5	Research On Indoor Localization And Navigation Technology Of Mobile Robot
6	Research On Autonomous Navigation Of Mobile Robot Based On Reinforcement Learning
7	Research On Autonomous Robot Navigation In Unknown Dynamic Environment Based On Safe Reinforcement Learning
8	Research On Autonomous Navigation Of Omnidirectional Mode Mobile Robot For Environmental Detection
9	A Mobile Robot Control Method In Complex Scenes Based On Deep Reinforcement Learning
10	Reinforcement Learning Based Research On Robot Foraging Problem