Bayesian exploration in Markov decision processes

Posted on:2008-09-11

Degree:M.Sc

Type:Thesis

University:McGill University (Canada)

Candidate:Castro, Pablo Samuel

Full Text:PDF

GTID:2448390005471014

Subject:Computer Science

Abstract/Summary:

Markov Decision Processes are a mathematical framework widely used for stochastic optimization and control problems. Reinforcement Learning is a branch of Artificial Intelligence that deals with stochastic environments where the dynamics of the system are unknown. A major issue for learning algorithms is the need to balance the amount of exploration of new experiences with the exploitation of existing knowledge. We present three methods for dealing with this exploration-exploitation tradeoff for Markov Decision Processes. The approach taken is Bayesian, in that we use and maintain a model estimate. The existence of an optimal policy for Bayesian exploration has been shown, but its computation is infeasible. We present three approximations to the optimal policy by the use of statistical sampling.;The first approach uses a combination of Linear Programming and Q-learning. We present empirical results demonstrating its performance. The second approach is an extension of this idea, and we prove theoretical guarantees along with empirical evidence of its performance. Finally, we present an algorithm that adapts itself efficiently to the amount of time granted for computation. This idea is presented as an approximation to an infinite dimensional linear program and we guarantee convergence as well as prove strong duality.

Keywords/Search Tags:

Decision, Bayesian, Exploration, Present

Related items

1	Path Planning For Mobile Robot Based On Bayesian Decision
2	Ray based exploration of volumetric data
3	Research And Application On Learning & Decision Methods Based On Bayesian Network
4	Research On Decision Analysis Method Based On Improved Bayesian Rough Set And Evidence Theory
5	Study On Intelligent Decision-Making Of Air-to-Ground Attack With UCAV Teams Based On Bayesian Networks
6	Research On Development And Application Of Maintenance Decision System Based On Bayesian Network
7	Research On Methods And Application Of Fault Diagnosis And Maintenance Decision Based On Bayesian Networks
8	Research On Active Exploration Strategy In The Unknown Environment
9	Research On Local Classification Methods Based On Bayesian Decision Theory And The Applications
10	The Research On Several Problems Of Bayesian Theory