Research On Image Captioning By The Method Of Generative Adversarial Networks

Posted on:2021-02-15

Degree:Master

Type:Thesis

Country:China

Candidate:H Li

Full Text:PDF

GTID:2428330623467789

Subject:Computer Science and Technology

Abstract/Summary:

Image captioning is an important task in the area of computer vision and aitificial intelligence.Depending on both visual and linguistic understanding,it generates descriptive sentences for the related image.The generated sentences should not only to accurately describe the image,but also be more natural for human reading.The traditional models only focus on the accuracy and fidelity of generated sentences,but lack the diversity and disctinctiveness,so the generated sentences are monotonous.In this paper,we use the method based on Generative Adversarial Networks(GAN)to solving the problem caused by Maximizing Likelihood Estimation(MLE).Because of the property of randomness in generating of GAN,our proposed model could generate more diverse and distinctive image captions.Moreover,to guarantee the accuracy of the generated captions,we use some external text data to train the Discriminator in our model.In our method,the external text data are captions that in the same semantic but in other language.Therefore,captions generated by our model are diverse and accurate.Our contribution are as follows: 1.We propose a novel model based on GAN,which use some external text data to train the discriminator,so the generated captions are diverse and accurate.2.Our model yields a new evaluation metric,which is stronger than other metrics in a comprehensive way.3.The resulst on various experiments show that our model consistently outperforms other traditional models.

Keywords/Search Tags:

Image Captioning, Maximizing Likelihood Estimation (MLE), Generative Adversarial Networks(GAN), Reinforcement Learning(RL), Deep Learning

Related items

1	Research On Image Captioning Algorithms Based On Deep Learning
2	Research And Application Of Deep Reinforcement Learning Based On Generative Adversarial Networks
3	Image Feature Understanding And Semantic Representation Based On Deep Learning
4	Image Captioning Based On Generative Adversarial Network With Temporal Attention
5	Research On Image Aesthetic Description Method Combined With Image Captio
6	Image Caption Generation Based On Generative Adversarial Networks
7	Video Captioning With Adversarial Reinforcement Learning
8	Research On Text Generation Of An Image Based On Generative Adversarial Networks
9	End-To-End Active Tracking System Via Deep Reinforcement Learning
10	Automatic Auido Captioning Based On Reinforcement Learning