Koolio: Path-planning using reinforcement learning on a real robot in a real environment

Posted on:2010-03-02

Degree:Ph.D

Type:Dissertation

University:University of Florida

Candidate:Zamstein, Lavi Michael

Full Text:PDF

GTID:1448390002470602

Subject:Engineering

Abstract/Summary:

There are many cases where it is not possible to program a robot with precise instructions. The environment may be unknown, or the programmer may not even know the best way in which to solve a problem. In cases such as these, intelligent machine learning is useful in order to provide the robot, or agent, with a policy, a set schema for determining choices based on inputs.;The two primary method groups of machine learning are Supervised Learning, methods by which the supervisor provides training data in order to help the agent learn, and Reinforcement Learning, which requires only a set of rewards for certain choices. Of the three categories of Reinforcement Learning, Dynamic Programming, Monte Carlo, and Temporal Difference, the Temporal Difference method known as Q-Learning was chosen.;Q-Learning is a Markov method which uses a weighted decision table to determine the best choice for any given set of sensor inputs. The values in this Q-table are calculated using the Q-formula, which weighs the expected value of a decision based on the known reward and uses a discounting factor to give more recent choices a greater effect on the values than older choices. The Q-table also allowed the learning to be modular, as a learning agent would only need the file containing the table to be able to use the learned policy generated by a different agent.;Because of the large number of iterations required for Q-Learning to reach an optimal policy, a simulator was required. This simulator provided a means by which the agent could learn behaviors without the need to worry about such things as parts wearing down or an untrained robot colliding with a wall.;After a policy was found in simulation, the Q-table was transferred into Koolio, a refrigerator robot, to allow it to navigate the hallways with the experience gathered in simulation. This Q-table was then further refined through more learning on the real robot.

Keywords/Search Tags:

Robot, Reinforcement learning, Real, Q-table

Related items

1	Making reinforcement learning work on real robots
2	Research On Locomotion Control And Sim-to-real Of Quadruped Robot Based On Deep Reinforcement Learning
3	Highly-efficient Robot Self-learning With Deep Reinforcement Learning
4	Reinforcement Learning And Its Application In Robot System
5	Reinforcement learning for mobile robot controllers: Theory and experiments
6	Learning To Return Table Tennis Ball For Robots
7	Research On Multi-Robot Cooperation Control Method Based On Reinforcement Learning
8	Reinforcement Learning Based Research On Robot Foraging Problem
9	Research On Real-time Data Transmission Scheduling Algorithm Based On Reinforcement Learning For WSNs
10	Detection Based On Reinforcement Learning For Mobile Robot Navigation And State Of The Environment