Research On Feature Selection In Reinforcement Learning

Posted on:2022-07-31

Degree:Master

Type:Thesis

Country:China

Candidate:J Lin

Full Text:PDF

GTID:2518306557468234

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The problem of dimensional disasters in large-scale state and action spaces is currently a difficult problem in reinforcement learning research.The widely used technique is the method of value function estimation.However,there is a gap between the estimated value function and the true value function.In addition to the fitting ability of the learning method itself,the source of the error also has a great influence on the error due to the quality of the selected feature of the value function.Therefore,this paper focuses on the feature selection problem of reinforcement learning,and designs some algorithms for feature selection in value function estimation.The main work is as follows:(1)Aiming at the problem of the error between the value function estimation and the true value function,this paper proposes a greedy-based wrapper feature selection method to construct a good feature for the value function.And use the method of piecewise value function to deal with the unstable characteristics of feature selection.Experimental results show that this method improves the stability of feature selection and also enhances the accuracy of value function estimation.(2)Aiming at the problem of slow feature selection in reinforcement learning,and the greedy strategy that only considers the current optimal and short-sighted issues,this paper proposes a distributed top-k greedy method to select features based on the nature of the weak submodular function.Experimental results show that this method improves the speed of feature selection and explores better features.(3)The wrapper feature selection is based on the effect of the subsequent algorithm as the criterion for feature selection,and the long training of reinforcement learning leads to the difficulty of solving the time-consuming problem even if the distributed method is used.This paper proposes a filtering feature selection method based on experience replay,which generates a reinforcement learning data set through experience replay,and then directly completes the feature selection work on the data set,and finally only needs one reinforcement learning training to test the effect of feature selection.The experimental results show that the feature selection speed of the filtering method based on experience replay is significantly faster than that of the wrapper method.

Keywords/Search Tags:

Reinforcement Learning, Value Function Approximation, Feature Selection, Submodular Function, Distributed Optimization

PDF Full Text Request

Related items

1	Research On The Reinforcement Learning Method And Its Application
2	Study Of Reinforcement Learning Algorithms Based On Value Function Approximation
3	Submodular Maximization Problems In Combinatorial Optimization
4	Research On Value Function Approximation Methods In Reinforcement Learning
5	Research On Reinforcement Learning Methods Based On Fuzzy Approximation
6	Research On Nonparametric Value Function Approximation Reinforcement Learning
7	Researches On Reinforcement Learning Based On TileCoding Function Approximation
8	Sparse Value Function Approximation for Reinforcement Learning
9	Multi-step Unified Approaches With Function Approximation In Reinforcement Learning
10	Research On Basis Function Construction Methods In Reinforcement Learning