Teaching old dogs new tricks: Incremental multimap regression for interactive robot learning from demonstration

Posted on:2011-06-04

Degree:Ph.D

Type:Dissertation

University:Brown University

Candidate:Grollman, Daniel H

Full Text:PDF

GTID:1448390002967177

Subject:Engineering

Abstract/Summary:

We consider autonomous robots as having associated control policies that determine their actions in response to perceptions of the environment. Often, these controllers are explicitly transferred from a human via programmatic description or physical instantiation. Alternatively, Robot Learning from Demonstration (RLfD) can enable a robot to learn a policy from observing only demonstrations of the task itself. We focus on interactive, teleoperative teaching, where the user manually controls the robot and provides demonstrations while receiving learner feedback. With regression, the collected perception-actuation pairs are used to directly estimate the underlying policy mapping.;This dissertation contributes an RLfD methodology for interactive, mixed-initiative learning of unknown tasks. The goal of the technique is to enable users to implicitly instantiate autonomous robot controllers that perform desired tasks as well as the demonstrator, as measured by task-specific metrics. With standard regression techniques, we show that such "on-par" learning is restricted to policies typified by a many-to-one mapping (a unimap) from perception to actuation. Thus, controllers representable as multi-state Finite State Machines (FSMs) and that exhibit a one-to-many mapping (a multimap) cannot be learnt. To be able to do so we must address the three issues of model selection (how many subtasks or FSM states), policy learning (for each subtask), and transitioning (between subtasks). Previous work in RLfD has assumed knowledge of the task decomposition and learned the subtask policies or the transitions between them in isolation.;We instead address both model selection and policy learning simultaneously. Our presented technique uses an infinite mixture of experts and treats the multimap data from an FSM controller as being generated from overlapping unimaps. The algorithm automatically determines the number of unimap experts (model selection) and learns a unimap for each one (policy learning). On data from both synthetic and robot soccer multimaps we show that the discovered subtasks can be used (switched between) to reperform the original task. While not at the same level of skill as the demonstrator, the resulting approximations represent significant improvement over ones for the same tasks learned with unimap regression.

Keywords/Search Tags:

Robot, Regression, Multimap, Interactive, Unimap

Related items

1	The Design And Implementation Of Autonomous Guided Voice Interactive Robot
2	How to teach a new robot new tricks - an interactive learning framework applied to service robotics
3	Interactive Lasso Model And Improved ADMM Algorithm
4	Design And Implementation Of Interactive Navigation System For Mobile Robot In Dynamic Environments
5	Research And Implementation Of Interactive Multi Robot System Simulation
6	Study On Interactive Image Segmentation
7	Design And Implementation Of A Mobile Robot System Based On Interactive Communication
8	Research And Application On Service Robot Interaction Design
9	Interactive Learning and Adaptation for Personalized Robot-Assisted Trainin
10	Research On Remote Intelligent Interactive Robot Control Platform For Special Area And Its Key Technologies