Researches On Reinforcement Learning Algorithm Based On Nonparametric Approximation

Posted on:2015-01-25

Degree:Master

Type:Thesis

Country:China

Candidate:J H Zhu

Full Text:PDF

GTID:2268330428998400

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Reinforcement learning gets input information through interaction with theenvironment, and then uses the information to improve the policy. In recent years,reinforcement learning has received extensive attention in academic circles, and hasbecome an important branch in the field of machine learning. Traditional reinforcementlearning in large-scale state spaces usually uses parametric function approximation torepresent the value function, but it can not solve the problems of slow convergencespeed and low precision of policy. Nonparametric function approximation is a flexible,fully sample-based function approximation, with high accuracy and fast convergencecharacteristics. It is consistent with the fundamental principle of reinforcement learning.On the basis of the existing algorithm, this paper combines the non-parametricfunction approximation and reinforcement learning algorithm and proposes:(1) According to the low sample-efficiency in reinforcement learning problem, wepropose a Prioritized Sweeping Based Nonparametric LSPI algorithm. It combinesprioritized sweeping with nonparametric function approximation to reduce explorationby model learning;(2) According to the high time complexity of algorithm based on nonparametricfunction approximation, we propose a Sparse Sample-based Gaussian Process PolicyIteration algorithm. It builds models of reinforcement learning problem based onGaussian process and use kernel sparse to reduce the redundancy of sample space.These algorithms improve the accuracy and convergence speed by improvingsample-efficiency and reducing redundant explores.

Keywords/Search Tags:

Reinforcement learning, Nonparametric Function Approximation, KernelFunction, Gaussian Process, Policy Iteration

PDF Full Text Request

Related items

1	Research On Nonparametric Value Function Approximation Reinforcement Learning
2	Analysis And Research On Off-policy Algorithms In Reinforcement Learning
3	Research On Policy Iteration Algorithm Within Bayesian Reinforcement Learning
4	Policy Iteration Reinforcement Learning Based On Geodesic Gaussian Kernel
5	Research On Reninforcement Learning Network Algorithm With Self-adaptive Basis Function
6	Research On Reinforcement Learning Based On Value Function Approximation And State Space Decomposition
7	Research On Value Function Approximation Methods In Reinforcement Learning
8	Research On Multiagent Cooperation And Applications Based On Reinforcement Learning
9	Research On Reinforcement Learning Based On Gaussian Process Regression
10	Research And Implementation On Reinforcement Learning Algorithm Based On Gaussian Process