Imitation Learning Based On Generative Adversarial Nets With Multiple Kinds Of Demonstrations

Posted on:2020-08-03

Degree:Master

Type:Thesis

Country:China

Candidate:J H Lin

Full Text:PDF

GTID:2428330578979393

Subject:Management Science and Engineering

Abstract/Summary:

PDF Full Text Request

In recent years,the field of artificial intelligence has paid more and more attention to how to learn decision models that are similar to or even better than human.Imitation learning is a feasible method to solve decision-making problems.Imitation learning refers to learning from expert decision data to obtain decision models that close to expert.Generative adversarial imitation learning(GAIL)is an emerging imitation learning method.It achieves better robustness,representation capability and computation efficiency,and is able to handle complicated,large-scale problems and applicable in realistic tasks.However,GAIL has strong limitations on the assumption of expert demonstrations.It assumes that the expert sample is simplex and perfect.Due to the different preferences of individual experts and the possibility of making error,this assumption is difficult to be satisfied in practical application.In order to extend GAIL to more practical applications,this paper relaxes the limitation on the assumption of demonstrations,and proposes two imitation learning method based on GAIL with multiple kinds of demonstrations.The main research includes the following two parts:i.Generative adversarial imitation learning with auxiliary classifier is proposed.To deal with the situation where there are multiple kinds of demonstrations,this research adds an auxiliary classifier to the original generative adversarial imitation learning method,and proposes the algorithm of generative adversarial imitation learning with auxiliary classifier.The experimental results on the simulation environment show that the algorithm is able to learn the category of the demonstrations by leveraging the auxiliary classifier.Thus it can achieve imitation learning with multiple kinds of demonstrations.Moreover,eomparing to an existing unsupervised method,it achieves better accuracy and effectiveness.ii.Generative adversarial imitation learning with failure demonstrations.The existence of failure demonstrations is a special situation of multiple kinds of expert demonstrations.To solve this problem,this research proposes to eonstruct a memory pool to store and roll back failed samples,and reuses the failed samples by means of resampling.On this basis,a training algorithm based on the generative adversarial imitation learning with failure demonstrations is proposed.By reusing failure demonstrations,the method not only achieves better action success rates than experts,but also improves sample efficiency.Experiments show that this method can deal with the special multi-class sample imitation learning problem of expert samples with both successful and failed samples.

Keywords/Search Tags:

Imitation learning, Generative adversarial nets, Multiple kinds of demonstrations, Failure demonstrations

PDF Full Text Request

Related items

1	Research On Data Efficient Third-person Imitation Learning Methods
2	Imitation Learning Based On Generative Adversarial Network
3	Robot life-long task learning from human demonstrations: A Bayesian approach
4	Study On The Generative Adversarial Imitation Learning Based On State Features
5	Research On Learning From Demonstrations And Intelligent Control Methods For Robotic Manipulation
6	Demonstrations With Dynamic Bonus For Deep Reinforcement Learning
7	Research And Implementation Of Imitation Learning For Complex Tasks In Large-scale Environments
8	Motion Imitation Learning And Execution For Robot Manipulators
9	Signal Reconstruction Based On Generative Adversarial Networks
10	Single And Multiple Modal Image Transformation Based On Generative Adversarial Learning