Reinforcement learning for robots using neural networks

Posted on:1993-02-11

Degree:Ph.D

Type:Dissertation

University:Carnegie Mellon University

Candidate:Lin, Long-Ji

Full Text:PDF

GTID:1478390014995330

Subject:Artificial Intelligence

Abstract/Summary:

Reinforcement learning agents are adaptive, reactive, and self-supervised. The aim of this dissertation is to extend the state of the art of reinforcement learning and enable its applications to complex robot-learning problems. In particular, it focuses on two issues. First, learning from sparse and delayed reinforcement signals is hard and in general a slow process. Techniques for reducing learning time must be devised. Second, most existing reinforcement learning methods assume that the world is a Markov decision process. This assumption is too strong for many robot tasks of interest.;This dissertation demonstrates how we can possibly overcome the slow learning problem and tackle non-Markovian environments, making reinforcement learning more practical for realistic robot tasks: (1) Reinforcement learning can be naturally integrated with artificial neural networks to obtain high-quality generalization, resulting in a significant learning speedup. Neural networks are used in this dissertation, and they generalize effectively even in the presence of noise and a large of binary and real-valued inputs. (2) Reinforcement learning agents can save many learning trials by using an action model, which can be learned on-line. With a model, an agent can mentally experience the effects of its actions without actually executing them. Experience replay is a simple technique that implements this idea, and is shown to be effective in reducing the number of action executions required. (3) Reinforcement learning agents can take advantage of instructive training instances provided by human teachers, resulting in a significant learning speedup. Teaching can also help learning agents avoid local optima during the search for optimal control. Simulation experiments indicate that even a small amount of teaching can save agents many learning trials. (4) Reinforcement learning agents can significantly reduce learning time by hierarchical learning--they first solve elementary learning problems and then combine solutions to the elementary problems to solve a complex problem. Simulation experiments indicate that a robot with hierarchical learning can solve a complex problem, which otherwise is hardly solvable within a reasonable time. (5) Reinforcement learning agents can deal with a wide range of non-Markovian environments by having a memory of their past. Three memory architectures are discussed. They work reasonably well for a variety of simple problems. One of them is also successfully applied to a nontrivial non-Markovian robot task.;The results of this dissertation rely on computer simulation, including (1) an agent operating in a dynamic and hostile environment and (2) a mobile robot operating in a noisy and non-Markovian environment. The robot simulator is physically realistic. This dissertation concludes that it is possible to build artificial agents than can acquire complex control policies effectively by reinforcement learning.

Keywords/Search Tags:

Reinforcement learning, Agents, Dissertation, Robot, Neural, Complex

Related items

1	Integrating complexity science and artificial intelligence: GIS, agents and reinforcement learning for modeling forest cover change
2	Reputation-oriented reinforcement learning strategies for economically-motivated agents in electronic market environments
3	Reinforcement Learning And Its Application In Robot System
4	A Mobile Robot Control Method In Complex Scenes Based On Deep Reinforcement Learning
5	Research On Adaptive Software Model Of Mobile Robot Based On Reinforcement Learning
6	The Research On Autonomous Mobile Robot Navigation Based On Reinforcement Learning
7	Theorization, implementation, system architecture, and analysis of fast reinforcement learning techniques, with application to autonomous agents
8	Control Design And Research Of Flapping Wing Flight Robot Based On Reinforcement Learning
9	Research On Robot Grasping Based On Reinforcement Learning With Dynamic Motion Primitive
10	Obstacle Avoidance Skill Learning Algorithm For Home Service Robot Based On Improved Deep Reinforcement Learning