Research On Reinforcement Learning Methods For Navigation And Control Of Autonoumous Mobile Robots

Posted on:2011-10-31

Degree:Master

Type:Thesis

Country:China

Candidate:Z B Li

Full Text:PDF

GTID:2178330338490047

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

To improve the control performance and adaptive ability in unknown ervironments with machine learning, especially reinforcement learning, is an imorptant research topic in the navigation and control of mobile robots. Under the support of the National Natural Science Foundation Project -- Research on kernel-based reinfrocement learning and approximate dynamic planning method,this paper studied the performance evaluation of approximate policy iteration (API) algorithms, parameters optimization of kernel function in kernel-based least-squares policy iteration (KLSPI), autonomous obstacle avoidance with API and learning control of longitudinal velocity of autonomous vehicles.The main contributions and innovations of this paper can be summarized as follows:1,API algorithms were tested in their performances. By carrying out experiments and analyzing their results, the much better performance of API algorithm was validated. It is demonstratred that KLSPI can have better performance when solving sequential decision-making problems with smooth value functions. It is verified that whether is the sequential decision-making problems with smooth value functions or not will play an important role in the performance of approximate policy iteration. Then, a new sample sparseness method which is analysed byε-neigthbor is proposed based on KLSPI, and furthermore, a width optimization method of kernel function based on the Bellman error gradient descent was proposed. Simulation results indicated the efficiency of the method.2,A Markov Decision Processe (MDP) model was given for autonomous obstacle avoidance of mobile robots. As a result, a new autonomous obstacle avoidance method, combining the API algorithm with the rolling window planning method, was proposed for mobile robots in unknown environment. Furthermore, the generalization and adaptation of the proposed method was tested in simulation, and the reliability of the two API-based control methods for autonomous obstacle avoidance was analyzed comparatively. It was indicated that the KLSPI-based autonomous obstacle avoidance method converged to the near optimal policy more quickly than other methods.3,An analysis was given on the key difficulties of autonomous vehicles, and the development of control systems for autonomous vehicles with learning ability. The motion control of vehicles on high way was modeled as MDPs, and then, an API learning and contrl method was designed to control the longitudinal velocity of vehicles on high way. The simulation results show that the API learning and contrl method can realize high-precision control for the expected velocity,what's more, it lay the foundation for further research on learning control of autonomous vehicles.

Keywords/Search Tags:

Reinforcement Learning, Markov Decision Processes, Approximate policy iteration, Rolling Windows, Autonomous obstacle avoidance, Autonomous Vehicle

PDF Full Text Request

Related items

1	Research On Motion Control Of Mobile Robots Based On Reinforcement Learning
2	Obstacle Avoidance For Autonomous Vehicle Driving On Highways Based On MPC
3	Theories, Algortihms And Applications Of Policy Gradient Reinforcement Learning
4	Research On Reinforcement Learning Based On Value Function Approximation And State Space Decomposition
5	Obstacle Avoidance Of Autonomous Underwater Vehicle
6	Efficient approximate policy iteration methods for sequential decision making in reinforcement learning
7	Obstacle avoidance for AUVs (autonomous underwater vehicles)
8	Obstacle Avoidance And Path Planning Of Autonomous Ground Vehicle
9	Research On System Architecture And Obstacle Avoidance Control Technology Of Autonomous Underwater Vehicle
10	Implementation of autonomous navigation and obstacle avoidance on an unmanned ground vehicle