Learning state and action space hierarchies for reinforcement learning using action -dependent partitioning

Posted on:2007-02-12

Degree:Ph.D

Type:Dissertation

University:The University of Texas at Arlington

Candidate:Asadi, Mehran

Full Text:PDF

GTID:1458390005983226

Subject:Computer Science

Abstract/Summary:

Autonomous systems are often difficult to program. Reinforcement learning (RL) is an attractive alternative, as it allows the agent to learn behavior on the basis of sparse, delayed reward signals provided only when the agent reaches desired goals. Recent attempts to address the dimensionality of RL have turned to principled ways of exploiting temporal abstraction where decisions are not required at each step but rather invoke the execution of temporally-extended activities which follow their own policies until termination. This leads naturally to hierarchical control architectures and associated learning algorithms. This dissertation reviews several approaches to temporal abstraction and hierarchical organization that machine learning researchers have recently developed and presents a new method for the autonomous construction of hierarchical action and state representations in reinforcement learning, aimed at accelerating learning and extending the scope of such systems.;In this approach, the agent uses information acquired while learning one task to discover iv subgoals for similar tasks. The agent is able to transfer knowledge to subsequent tasks and to accelerate learning by creating useful new subgoals and by off-line learning of corresponding subtask policies as abstract actions (options). At the same time, the subgoal actions are used to construct a more abstract state representation using action-dependent state space partitioning. This representation forms a new level in the state space hierarchy and serves as the initial representation for new learning tasks (the decision layer). In order to ensure that tasks are learnable, value functions are built simultaneously at different levels of the hierarchy and inconsistencies are used to identify actions to be used to refine relevant portions of the abstract state space.;This representation serves as a first layer of the hierarchy. In order to estimate the structure of the state space for learning future tasks, the decision layer is constructed based on an estimate of the expected time to learn a new task and the system's experience with previously learned tasks. Together, these techniques permit the agent to form more abstract action and state representations over time. Experiments in deterministic and stochastic domains show that the presented method can significantly outperform learning on a flat state space representation.

Keywords/Search Tags:

State, Reinforcement learning, Space, Action, Representation, Agent

Related items

1	Encoding Robot Topology Information For Deep Reinforcement Learning With Continuous Action Space
2	Multi-Agent Dynamic Hierarchical Reinforcement Learning Based On Hybrid Abstraction
3	Research On Reinforcement Learning Based On Value Function Approximation And State Space Decomposition
4	Based On State Prediction Of Collaborative Multi-agent Reinforcement Learning Algorithms
5	On Reinforcement Learning Control For Bionic Underwater Robots
6	Research On Hierarchical Reinforcement Learning Based On Action Space Partitioning
7	Research On Multiagent Reinforcement Learning Algorithm In Continuous Action Space
8	Research On Obstacle Avoidance Of Robotic Manipulator Based On Reinforcement Learning
9	Neural Network-Based Research On Reinforcement Learning In Continuous State Space
10	Research And Implementation On The Agent-Oriented State Calculus And Its Application In Intelligent Virtual Human