On Reinforcement Learning

Posted on:2010-10-06

Degree:Master

Type:Thesis

Country:China

Candidate:Y Zhao

Full Text:PDF

GTID:2208360275998375

Subject:Control theory and control engineering

Abstract/Summary:

PDF Full Text Request

Reinforcement learning is an important machine learning method. For gaining maximum return, reinforcement learning system can learn the optimal mapping policy through repeatedly interact with environment. Compared to the other learning technologies, the most remarkable advantage is that it can learn without any prior knowledge about the environment. Especially in unknown environment, reinforcement learning can also keep well adaptability and robustness.In real world, some learning problems have successive state space and action space. Some are difficult to be modeled and some have not only one objectives, etc. At the same time, they require quick learning speed and good adaptability. Therefore, for expanding application field, this thesis made systematically research on reinforcement learning and the primary solutions of some existing problems are presented.The basic idea, may steps and existing problems of reinforcement learning areintroduced based on a classical reinforcement learning algorithm——Q-learning algorithm.The experience storage and exploitation in reinforcement learning are researched. Based on the analytical on three existing experience storage methods, a new combinative method is researched. Following the dynamic character, combinative method has been improved for enhance utilization rate of experience and learning performance.The state space compression technologies are investigated. Considering discrete and successive state space separately, a entropy based reinforcement learning algorithm and an auto-generating neural network function approximator method for reinforcement learning are researched. The former makes use of the interrelated degree between state and learning target to make further compression to state space. The latter makes use of evolutionary algorithm to search fit structure and parameters of function approximator, which will ease decision burden of decision maker. Both of two algorithms can save computation and storage source. Thus, they can improve the efficiency of reinforcement learning.A multi-objective reinforcement learning algorithm is researched. Considering the uncommensurability among objectives, combining preference of decision maker with a fuzzy reasoning system can produce a scale to compare two actions. Thus, this method can guide the learning direction and make the learning speed more quickly.

Keywords/Search Tags:

Reinforcement Learning, Q-Learning, State Space, Function Approximation, Multi-Objective

PDF Full Text Request

Related items

1	Research On Reinforcement Learning Based On Value Function Approximation And State Space Decomposition
2	Research On Value Function Approximation Methods In Reinforcement Learning
3	Research On Basis Function Construction Methods In Reinforcement Learning
4	Study Of Reinforcement Learning Algorithms Based On Value Function Approximation
5	Research On The Reinforcement Learning Method And Its Application
6	Research On Learning Of The Optimal Policy In Largescale State Space
7	Research On Reinforcement Learning In Continuous Spaces
8	Multi-step Unified Approaches With Function Approximation In Reinforcement Learning
9	Research On Reinforcement Learning Methods Based On Fuzzy Approximation
10	Research On Nonparametric Value Function Approximation Reinforcement Learning