Hyperparameter Optimization Of Identification Algorithm Based On Reinforcement Learning

Posted on:2022-02-26

Degree:Master

Type:Thesis

Country:China

Candidate:X Qi

Full Text:PDF

GTID:2480306572451324

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

In the system identification theory,there are many identification algorithms with parameters that need to be set manually,which can be called hyperparameters,such as the decaying weight of fading memory least square method and the hyperparameters of neural networks,etc.The configuration of hyperparameters in the identification algorithm significantly affects the identification accuracy.And the identification of different systems often require different hyperparameter configurations.Therefore,setting the hyperparameters reasonably is the key to achieving high-precision system identification.In this paper,a series of optimization methods for the hyperparameters of identification algorithms based on reinforcement learning was proposed.Hyperparameters optimization in identification algorithms is a typical black box optimization problem.So model-free reinforcement learning algorithms can be used to solve this problem.Since the decaying weight of fading memory least square need to be selected in a continuous space,DDPG algorithm was chosen to solve the decaying weight optimization problem.In order to improve the search efficiency,this method is modified as follows:First,process the instant reward to limit its upper and lower limits.Then,select the value of the current decaying weight with the best performance as the initial state at the beginning of each episode.These improvements can ensure that the reinforcement learning agent learns effectively in each step and searches more in the high-performance values.In order to verify the performance of the forgetting factor optimization method based on DDPG,conduct several optimization experiments on the forgetting factor in the fading memory least square algorithm.The experimental results shows that this method can effectively optimize the decaying weight for different identification objects,making sure the fading memory least square method to realize high-precision system identification.In order to optimize the hyperparameter configuration of a neural network,select the hyperparameter to be optimized and its candidate values at first.Then,use the Q-learning algorithm to solve this hyperparameter optimization problem.In order to speed up the progress,do not set fixed initial state and terminal state for each episode in Q-learning algorithm.To verify the performance of the neural network hyperparameter optimization method based on Q-learning,this method is used to conduct several optimization experiments on the hyperparameters of a BP neural network.The experimental results show this hyperparameter optimization method can effectively optimize the network hyperparameters for the nonlinear systems identification,so that it can achieve highprecision system identification.

Keywords/Search Tags:

Optimization Problem, Reinforcement Learning, System Identification, Fading Memory Least Square, BP Neural Networks

PDF Full Text Request

Related items

1	Solution And Optimization Of TSP Problem On Reinforcement Pointer Network
2	Study On The Model Of Action Selection Based On Reinforcement Learning
3	Time Series Forecasting Based On Gated Neural Networks And Reinforcement Learning
4	Long-time Dynamical Behavior Of Global Solutions For The Equation With Fading Memory
5	Research On Geophysical Inversion Based On Reinforcement Learning
6	Reinforcement Learning-based Black-box Evasion Attacks To Link Prediction In Dynamic Graphs
7	Method Development For Predicting Protein Subcellular Localization Based On Deep Learning
8	Partial Observation Of Memory-based Reinforcement Learning Problems In Markov Decision Process
9	SgRNA Activity Prediction Method Based On Reinforcement Learning
10	Research And Application Of Biomedical Named Entity Recognition Based On Reinforcement Learning