Robust Control For Uncertain Semi-Markov Decision Processes Based On Performance Potentials

Posted on:2008-03-16

Degree:Master

Type:Thesis

Country:China

Candidate:Y Cheng

Full Text:PDF

GTID:2178360215451391

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

As a basic model of discrete event dynamic systems (DEDS), semi-Markov decision process is aiming to address the problem of stochastic sequential decisions. Its performance analysis and optimization may be very helpful to most real systems. In SMDPs, transition probabilities and instant performance are crucial to the system's performance accumulation. However, due to the difficulty of modeling and external disturbance, the exact transition probabilities are always hard to get in practical, and the instant performance may varies with the change of outside environment as well. On behalf of the need in such uncertain systems, this paper is concerned with the robust control for a class of SMDPs with uncertain parameters, while finding a policy that generates optimal performance in the worst case is a major task.According to the structure of embedded chain, SMDP can be categorized as recurrent chain, unichain and multichain. Consider the simplest model at first. A policy-iteration algorithm is provided to search the optimal robust control policy for the recurrent chain in the case of independent parameters. Then we discuss the convergence of policy iteration. As for the case of dependant parameters, we introduce the application of genetic algorithm in detail. For multichain, given the existence of multiple recurrent classes and transient states, the optimal robust control policy is hard or even impossible to obtain. With the constraint of some assumptions, we study the optimal equation for multichain SMDPs under the average criteria, and the corresponding policy iteration is also provided. Since the states number is increasing, in order to improve the efficiency, we introduce a parallel genetic algorithm for multichain under the case of dependant parameters. With the parallel search on multiple processors, the optimization can be accelerated greatly. Here two implementation of migration operation are suggested for the parallel genetic algorithms. Since the unichain can be treated as a special multichain with only one recurrent class, so that all the algorithms developed for multichain can be applied directly to the unichain models.Several numerical examples are presented to illustrate the application of these algorithms, and we analyze the effectiveness of these algorithms by our experiment results.

Keywords/Search Tags:

Semi-Markov decision process, Robust control, Performance potential, Policy iteration, Genetic algorithm

PDF Full Text Request

Related items

1	Continuous-Time Unified MAXQ Algorithm And Its Application
2	Optimal Robust Decision Problems For Uncertain Semi-Markov Systems
3	Asynchronous Optimization Algorithms For SMDP Based On Performance Potential
4	The Design And Implementation Of Point-based POMDP Policy Iteration Algorithm
5	Performance Potential-based NDP Optimization Approaches And Application Research For SMDP
6	Robust Control Of Single Conveyor-Serviced Production Station
7	Parallel Algorithms For Large-Scale Markov Decision Processes Based On Performance Potentials
8	Look-ahead Control For CSPS Model Based On Learning
9	Research On Power Control And Performance Optimization For Large Scale Network Service Systems
10	Research On Motion Control Of Mobile Robots Based On Reinforcement Learning