Reinforcement learning in environments with independent delayed-sense dynamics

Posted on:2009-10-16

Degree:M.Sc

Type:Thesis

University:University of Alberta (Canada)

Candidate:Shahamiri, Masoud

Full Text:PDF

GTID:2448390002997949

Subject:Computer Science

Abstract/Summary:

This thesis is a detailed investigation into applying reinforcement learning to environments with independent delayed-sense dynamics (IDSD), where some of state variables evolve independently of both agent's actions and other state variables, and can be sensed only after a delay. These independent state variables are analogous to disturbances, since they are independent of control actions and are not observable before the agent commits a course of action.;In this thesis, we first formalize IDSD problems and then develop four reinforcement learning algorithms that exploit the structure of IDSD problems to achieve better efficiency. Two of the algorithms are partially model-based and two are model-free. We discuss that for the same amount of experiments the quality of the policy learned by the proposed algorithms is better than that of learned by conventional reinforcement learning algorithms.;We demonstrate the effectiveness of our algorithms by applying them to traffic grid-world problems and to a hybrid vehicle problem, in which the traffic and driver acceleration play the role of the independent state variable respectively. We show experimentally that our algorithms evaluate a given policy more accurately than the corresponding TD(0). We also show that in the case of control, the learning speeds of our algorithms are substantially higher than the learning speed of conventional reinforcement learning algorithms that do not use the knowledge of the IDSD structure.

Keywords/Search Tags:

Reinforcement learning, Environments with independent delayed-sense dynamics, IDSD problems

Related items

1	Reinforcement learning control with approximation of time-dependent agent dynamics
2	Research On Reinforcement Learning Methods Towards Unfixed Tasks And Non-static Environments
3	Research On Agent Decision-making And Control Based On Deep Reinforcement Learning
4	Research On Reinforcement Learning Algorithms For Complex Problems
5	Path Planning For Mobile Platforms In Known Environments Based On Deep Reinforcement Learning
6	On the convergence of model -free policy iteration algorithms for reinforcement learning: Stochastic approximation under discontinuous mean dynamics
7	A Learning Based Adaptive Network Selection Strategy In Dynamic Hetnet Environments
8	Reputation-oriented reinforcement learning strategies for economically-motivated agents in electronic market environments
9	Supervised Reinforcement Learning:methods And Applications
10	Research On Platform Independent Adaptive Streaming Media Transmission Based On Reinforcement Learning