Discrete-time control algorithms and adaptive intelligent systems designs

Posted on:2008-05-15

Degree:Ph.D

Type:Dissertation

University:The University of Texas at Arlington

Candidate:Al-Tamimi, Asma Azmi

Full Text:PDF

GTID:1448390005954694

Subject:Engineering

Abstract/Summary:

In this work, approximate dynamic programming (ADP) designs based on adaptive critic structures are developed to solve the discrete-time H2/Hinfinity optimal control problems in which the state and action spaces are continuous. This work considers linear discrete-time systems as well as nonlinear discrete-time systems that are affine in the input. This research resulted in forward-in-time reinforcement learning algorithms that converge to the solution of the Generalized Algebraic Riccati Equation (GARE) for linear systems. For the nonlinear case, a forward-in-time reinforcement learning algorithm is presented that converges to the solution of the associated Hamilton-Jacobi Bellman equation (HJB).; The results in the linear case can be thought of as a way to solve the GARE of the well-known discrete-time Hinfinity optimal control problem forward in time. Four design algorithms are developed: Heuristic Dynamic programming (HDP), Dual Heuristic dynamic programming (DHP), Action dependent Heuristic Dynamic programming (ADHDP) and Action dependent Dual Heuristic dynamic programming (ADDHP). The significance of these algorithms is that for some of them, particularly the ADHDP algorithm, a priori knowledge of the plant model is not required to solve the dynamic programming problem.; Another major outcome of this work is that we introduce a convergent policy iteration scheme based on the HDP algorithm that allows the use of neural networks to arbitrarily approximate for the value function of the discrete-time HJB equation. This online algorithm may be implemented in a way that requires only partial knowledge of the model of the nonlinear dynamical system.; The dissertation includes detailed proofs of convergence for the proposed algorithms, HDP, DHP, ADHDP, ADDHP and the nonlinear HDP. Practical numerical examples are provided to show the effectiveness of the developed optimization algorithms. For nonlinear systems, a comparison with methods based on the State-Dependent Riccati Equation (SDRE) is also presented. In all the provided examples, parametric structures like neural networks have been used to find compact representations of the value function and optimal policies for the corresponding optimal control problems.

Keywords/Search Tags:

Discrete-time, Dynamic programming, Algorithms, Systems, Optimal control, HDP

Related items

1	Researches On Optimal Tracking Control For A Class Of Discrete-Time Nonlinear Systems With Time Delays Based On Heuristic Dynamic Programming
2	Research On Adaptive Dynamic Programming Optimal Control Strategy For Data-Driven Discrete Time-Delay Systems
3	Research On Optimal Disturbances Rejection Control Approaches For Discrete-Time Systems With Time-Delays
4	Research On The Consensus Control Algorithms For Multi-agent Systems Based On Adaptive Dynamic Programming
5	Studies On The Approximate Approac Hes Of Optimal Control For Nonlinear Discrete Systems
6	Adaptive Dynamic Programming Theory On Optimal Control Scheme For Several Classes Of Nonlinear Time-delay Systems
7	Adaptive Optimal Control For Continuous Nonlinear Systems Based On Adaptive Dynamic Programming
8	Optimal Disturbance Rejection Methods For Systems With Time-delays In High-speed Networks
9	Approximate Design To Optimal Output Tracking Controllers For Discrete-time Systems With Time-delay
10	Sub-optimal Control Of A Class Of Nonlinear Singularly Perturbed Systems Based On Adaptive Dynamic Programming