Imitation Learning Based On Generative Adversarial Network

Posted on:2020-02-25

Degree:Master

Type:Thesis

Country:China

Candidate:Y T Li

Full Text:PDF

GTID:2428330590473975

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

Machine learning uses data or past experience to improve the performance of computer programs.It is considered as an important way to achieve artificial intelligence,and has received a lot of attention and extensive applications in the fields of computer vision,data mining,and natural language processing.With data sampled from expert guidance,imitating learning can get a direct reference to each step of decision and learn the end-to-end strategy model.Thus,imitation learning has received extensive attention from researchers in recent years.The generative adversarial networks(GAN)has become a hot research direction in the field of artificial intelligence.We will use the generative adversarial networks to realize the imitation learning,avoid the acquisition of reward function and realize the end-to-end learning from expert trajectory to strategy.We mainly analyze the implementation ideas of traditional imitation learning algorithms,exploit the metrics that can uniquely determine the strategy,and give the feasibility of implementing imitation learning through the metrics.We exploit the consistency of the generated model objective function and the simulated learning objective function in the generative adversarial networks,realize the optimization process of the imitation learning algorithm by the combination of the adversarial network and the generative network.Thereby imitation learning is linked with the generative adversarial networks.The traditional generation of GAN has certain defects.We analyze the characteristics and reasons of defects,then uses the WGAN to realize imitation learning.By determining the new objective function,the imitation learning algorithm can avoid the defects caused by traditional GAN.We discusses the update process of the strategy model in the process of training network,adopting the algorithm that can update the strategy model.We uses OpenAI's simulation physics engine Mujoco as a simulation platform.By performing simulation learning algorithms on multiple tasks provided in Mujoco,this paper verifies the feasibility of using the GAN idea to realize imitation learning and realize the end-to-end learning from expert trajectory to strategy.

Keywords/Search Tags:

generative adversarial networks, imitation learning, reinforcement learning, generative models

PDF Full Text Request

Related items

1	Study On The Generative Adversarial Imitation Learning Based On State Features
2	End-To-End Active Tracking System Via Deep Reinforcement Learning
3	Optimization For Generative Modeling And Its Applications In Imitation Learning
4	Research And Implementation Of Imitation Learning For Complex Tasks In Large-scale Environments
5	Research On Robotic Generation Method Of Chinese Calligraphy Strokes Based On Generative Adversarial Networks And Inverse Reinforcement Learning
6	Research On Image Captioning By The Method Of Generative Adversarial Networks
7	The Research Of Generative Algorithms Based On Adversarial
8	Research On Transfer Learning Based On Generative Adversarial Networks
9	Data Augmentation For Chinese Language Models Based On Generative Adversarial Networks
10	Research On Generative Adversarial Networks-Based Terrain Mapping