The Analysis And Research Of Exploration Strategies And Algorithms In Reinforcement Learning

Posted on:2017-05-18

Degree:Master

Type:Thesis

Country:China

Candidate:Z H Chen

Full Text:PDF

GTID:2308330485466188

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

Reinforcement learning is a kind of important Robot Learning. Unlike most of machine learning, Reinforcement Learning is not order agent to know how to pick up the correct action, but through Trial-and-Error to find out what actions will produce maximum reward. Such imprecise information feedback form, improve the control requirements as lacking of information environment. Reinforcement Learning is not only in artificial intelligence, also in certain areas, such as motion control, mobile robot path planning with fairly strong applicability.In an unknown environment, Reinforcement Learning basic idea is to mimic the natural human way of learning. The goal oriented, in the process of interaction with the environment, through the heuristic learning on convergence to the optimal strategy. Reinforcement Learning is also the process of interaction with the environment necessary knowledge completeness, and action selection mechanism both need to exploit learned knowledge and explore of the unknown knowledge. Exploitation and Exploration both have advantages and disadvantages. The balance between them affected agent, which is vital to understand the environment and the optimization of algorithm performance.After clarifying the basic theories of Reinforcement Learning, there is a detailed analysis of the Reinforcement Learning classic exploration strategies including: Random Walk,ε-Greedy Strategy, Softmax Method and Probabilistic Action Selection Mechanism. Reinforcement Learning is aimed at reducing the uncertainty of the selection operation. Due to the lack of tools analyzing search strategies, introducing of Exploration Entropy concept to quantitatively describe and analyze the exploration strategy. And based on Exploration Entropy, the conditions deciding to terminate the algorithm are presented.In the experimental part, this paper verified the applicability of the Exploration Entropy in both macro and micro aspects. We adopted different search strategies to simulate a Markovian state transition problem and a more complex Learning Control problem of micro 1/2 spin quantum systems. Further proved, the Exploration Entropy as a tool can provide a reference index for analyzing the algorithm performance.

Keywords/Search Tags:

Reinforcement Learning, Exploration Strategy, Exploration Entropy

PDF Full Text Request

Related items

1	Research On Active Exploration Reinforcement Learning Algorithm
2	Research On Exploration Strategy In Deep Reinforcement Learning
3	Mapless Exploration Navigation Based On Reinforcement Learning
4	Exploration Strategy Of Deterministic Policy In Deep Reinforcement Learning
5	Research On Intelligent Exploration Algorithm Of Reinforcement Learning
6	Research On Exploration Enhancement Deep Reinforcement Learning Methods
7	Researches On Efficient Exploration Driven By Reward Function
8	Using Task Prior In Reinforcement Learning Exploration
9	Hierarchical reinforcement learning using automatic task decomposition and exploration shaping
10	Research On Model-based Deep Reinforcement Learning With Active Exploration