A binary dynamic programming problem with affine transition and reward functions: Properties and algorithm

Posted on:2004-07-17

Degree:Ph.D

Type:Dissertation

University:Georgia Institute of Technology

Candidate:Gatica, Ricardo Antonio

Full Text:PDF

GTID:1450390011453374

Subject:Engineering

Abstract/Summary:

We consider a deterministic, binary-decision, dynamic programming problem with affine transition and reward functions (BATR). We limit our analysis to stationary threshold policies, which we prove to be optimal for the problem. The distinctive feature of our approach to the problem is that, instead of focusing directly on the set of threshold values, we base our analysis on a set

J

of decision sequences that serve as unique representatives of threshold policies. More specifically, we show that a decision sequence J is in

J

if and only if it is a unique representative of some threshold policy. This suggests that instead of solving the continuous optimization problem of finding an optimal threshold value, we may solve an equivalent discrete optimization problem of finding an optimal decision sequence in

J

. Two main results support that conclusion: First, we show that under a lexicographic order

≺

, the average reward function w is unimodal on (

J

≺

). Second, we show that the average reward of every non-periodic sequence can be approximated to any desired level of precision by the average reward of a periodic decision sequence. Therefore the search for an optimal decision sequence can be restricted to the subset of periodic decision sequences in

J

, which can easily be shown to be countable. Based on these results we develop an approximation scheme for BATR that resembles a binary search over the set

K

that contains the periods of the periodic sequences in

J

. Given a value &egr; > 0, the algorithm finds an &egr;-optimal periodic sequence and provides an associated &egr;-optimal threshold value.

Keywords/Search Tags:

Problem, Reward, Sequence, Decision, Threshold, Optimal, Periodic

Related items

1	Optimal Control Of Discrete-Time Systems:Average-Reward-Based Reinforcement Learning Methods
2	Periodic Dividend Optimization Problem In A Markovian Environment Model Under The Control Of Survival Probability
3	Discrete Time Markov Decision Processes Based On Variance Constraint
4	Variance Optimization For Continuous-time Markov Decision Processes
5	Study On The Solving Of Optimal Assignment Problem Based On Intelligent Computation And Its Application
6	Optimal Threshold Class Algorithms For Sparse Split Feasible Problem
7	Pigment Dispersal Factor Signaling Regulates Threat-Reward Decision-Making In Caenorhabditis Elegans
8	The Study On Optimization Method For Sequential Decision-making In Supply Chain
9	The Optimal Control Problem Of State-dependent Impulse Model Under Random Influence
10	Some Results Of Almost Periodic Type Sequences