On-line Human Simulated Intelligent Control Tuning Using Reinforcement Learning

Posted on:2011-10-05

Degree:Master

Type:Thesis

Country:China

Candidate:X Q Gan

Full Text:PDF

GTID:2178360308458854

Subject:Pattern Recognition and Intelligent Systems

Abstract/Summary:

PDF Full Text Request

Human Simulated Intelligent Control (HSIC) is based on recognition of multiple controllers and multi-mode control structure. One notable characteristic of control is the change between the proportion (closed loop) and keeping (open loop) mode. It has broken the control information processing of the traditional control theory with single map structure and properly solved the contradiction among the stability, accuracy and fast. It can solve the control feasibility of complex system.However, because of its multi-mode controller and the multi-control structure, it has many feature and control parameters. So, it's hard to design the controller. In addition, the system can change itself by environment and the existence of uncertainty. So the control parameters can't keep the same value on the whole processing. It must be regulated on time. The AVR system using HSIC learning parameters can improve high-quality of speed and real-time control.Online learning and optimization parameters play an important role in the control. The biggest difference of learning parameters between online and offline is that online learning parameters learning can adapt to dynamic environment, but offline learning parameters are only adapt to static environment. The methods of online learning parameters include: Simulated Annealing (SA), Particle Swarm Optimization (PSO), the Simplex and Reinforcement Learning method. This paper bases study of AVR system and use reinforcement learning online learning the parameters of HSIC.Continuous action reinforcement learning automata (CARLA) is a kind of reinforcement learning,which gets the parameter value in the continuous space. The algorithm uses a continuous probability density function (CPDF) to deal with every decision variable. It is through several iterations to modify the parameters and will eventually converge to a stable value of parameter. Each modified process is determined by a value of the reinforcement signal.This paper achieved to use CARLA online learning the parameters of HSIC with multi-mode control structure and the hierarchical structure. At the end of this paper, chose one system to test the algorithm and has accomplish both online optimizing HSIC parameters based on CARLA and PID control parameters based on CARLA. In addition, this system also uses genetic algorithm optimizing parameters. Under the different controllers, HSIC is better than PID controller for the system. Under the same controller, optimizing parameters with CARLA is much better than using GA.

Keywords/Search Tags:

HSIC, CARLA, CPDF, Online learning, Genetic algorithm

PDF Full Text Request

Related items

1	Based On Genetic Algorithm Personalized Data Structure Courses Online Learning System
2	Research On Personalized Online Learning System Model Based On Genetic Algorithm
3	Design And Implementation Of Online Learning Platform Based On Automatic Test Paper Grouping
4	Based On Improved Genetic Algorithm Of Online Test Intelligent Test Paper Generation System
5	Research On NN-HSIC Strategy Based On The MEA In The Inverted Penduium System
6	Study On Application Of Motor Schema Based HSIC Theory On Motion Control For Soccer Robot
7	Application Of Optimization Of HSIC Parameter Based On Particle Swarm Optimization
8	Research On Evolution Analysis Methods Of Online Social Networks
9	Trajectory Tracking Control Of Three-wheel Omni-directional Mobile Robot Based On HSIC
10	Research And Implementation Of Online Examination System Based On Genetic Algorithm