Apprenticeship learning and reinforcement learning with application to robotic control

Posted on:2009-11-17

Degree:Ph.D

Type:Dissertation

University:Stanford University

Candidate:Abbeel, Pieter

Full Text:PDF

GTID:1448390002992529

Subject:Engineering

Abstract/Summary:

Many problems in robotics have unknown, stochastic, high-dimensional, and highly nonlinear dynamics, and offer significant challenges to both traditional control methods and reinforcement learning algorithms. Some of the key difficulties that arise in these problems are: (i) It is often difficult to write down, in closed form, a formal specification of the control task. For example, what is the objective function for "flying well"? (ii) It is often difficult to build a good dynamics model because of both data collection and data modeling challenges (similar to the "exploration problem" in reinforcement learning). (iii) It is often computationally expensive to find closed-loop controllers for high dimensional, stochastic domains.;We describe learning algorithms with formal performance guarantees which show that these problems can be efficiently addressed in the apprenticeship learning setting---the setting when expert demonstrations of the task are available. Our algorithms are guaranteed to return a control policy with performance comparable to the expert's. We evaluate performance on the same task and in the same (typically stochastic, high-dimensional and non-linear) environment as the expert.;Besides having theoretical guarantees, our algorithms have also enabled us to solve some previously unsolved real-world control problems: They have enabled a quadruped robot to traverse challenging, previously unseen terrain. They have significantly extended the state-of-the-art in autonomous helicopter flight. Our helicopter has performed by far the most challenging aerobatic maneuvers performed by any autonomous helicopter to date, including maneuvers such as continuous in-place flips, rolls and tic-tocs, which only exceptional expert human pilots can fly. Our aerobatic flight performance is comparable to that of the best human pilots.

Keywords/Search Tags:

Reinforcement learning, Performance

Related items

1	Study On The Improved Average Reward Reinforcement Learning Algorithm Based On Performance Potentials
2	Apprenticeship learning and reinforcement learning with application to robotic control
3	Continuous Time Hierarchical Reinforcement Learning Algorithm
4	Research On Reinforcement Learning Based Control Method Of Magnetic Navigation AGV
5	High-Performance IEEE 802.15.4 MAC Protocol Based On Reinforcement Learning
6	Reinforcement Learning Based On Spectral Graph Theory
7	Study Of Multi-agent Learning Problem Based On Reinforcement Learning
8	Research On Sample-efficient Reinforcement Learning Methods
9	Research And Implementation Of Reinforcement Learning Method About Transport Strategy Between Carrier-based Aircraft Station
10	Research On Reinforcement Learning Algorithm And Equilibrium Of Multi-Agent Game