Learning by imitation and exploration: Bayesian models and applications in humanoid robotics

Posted on:2008-09-07

Degree:Ph.D

Type:Thesis

University:University of Washington

Candidate:Grimes, David B

Full Text:PDF

GTID:2448390005974038

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Learning by imitation is an important mechanism for rapid acquisition of new skills in humans and robots. A critical requirement for learning by imitation is the ability to reason under uncertainty. Uncertainty arises during the process of observing the teacher as well as from the imitator's own dynamics and interactions with the environment. This dissertation introduces new probabilistic methods for learning novel, generalizable behaviors in a humanoid robot via imitation. At the heart of my approach is a proposed algorithm for selecting actions based on probabilistic inference in a learned Bayesian network. This inference-based action selection technique affords an efficient, straightforward method of exploiting rich, yet uncertain, sensory data gathered by the robot.; Many existing methods for planning robotic actions require an engineer to explicitly model the complex physics of the robot and its environment. This process can be costly, tedious, error-prone, brittle, and inflexible to changes in the environment or the robot. The method I propose involves learning a predictive model of the robot's dynamics, represented directly in terms of sensor measurements, solely from exploration. Experiments are performed with a Fujitsu HOAP-2 25-degrees-of-freedom humanoid robot and the Webots dynamic simulation software. I present results demonstrating that the robot can learn dynamically stable, full-body imitative motions simply by observing a human demonstrator and performing explorative learning. Additional results show how the inference-based action selection technique can be used for policy learning, where sensory feedback can be used to adapt behavior online. I present policy learning results for a lifting behavior (learned via imitation) that generalizes to a wide range of objects of novel, unknown density. Besides imitation-based learning, this dissertation makes other contributions to the emerging area of robotic learning. First, intractability due to very high-dimensional state and control spaces is tackled using dimensionality reduction techniques. Second, nonparametric techniques are introduced to handle the problem of learning and inference with continuous-valued random variables.; Ultimately, this thesis seeks to contribute novel ideas which one day may form the basis for a powerful human-robot interface which allows people to quickly and effortlessly train robots to perform new skills.

Keywords/Search Tags:

Robot, Imitation, New, Humanoid

PDF Full Text Request

Related items

1	Research On The Balance Control Of Motion Posture In Imitation Learning Of Humanoid Robot
2	Research On Imitation Posture Judgment Strategy Of Humanoid Robot Based On Machine Learning
3	Research On HMM-based Humanoid Robot Motion Imitation Learning Method
4	Research On Imitation Learning Based On Trajectory Matching In Motion Behavior Of Humanoid Robot
5	Study On Humanoid Action Imitation Of Robot Based On Deep Reinforcement Learning
6	Research On Imitation Of NAO Robot Based On Kinect Sensor
7	Research On Kinect-Based NAO Robot Motion Imitation
8	Imitation Of Simple Finger Design And Motion Control Of Humanoid Robot
9	Research On Facial Expression Recognition And Representation Method For Humanoid Robot
10	Research Of Robot Arm System With Imitation Learning Mechanism