Research On Image Captioning Algorithm Based On Scene Graphs

Posted on:2023-03-20

Degree:Master

Type:Thesis

Country:China

Candidate:C H Zhang

Full Text:PDF

GTID:2568306827475354

Subject:Software engineering

Abstract/Summary:

Taking an image as input,automatically generating meaningful text description by computer,is called image captioning.Because of its location at the intersection of computer vision and natural language processing and its wide application prospect,more and more researchers are working to it.Image captioning is one of the research hotspots in recent years.The scene graphs annotate the semantic relationships between objects in the image.By generating the scene graphs of the image,we can introduce the guidance of the relationships between objects into the image captioning model to enhance the region-level features,which is conducive to reasoning out the correct text description.However,the existing scene graph generation models inevitably predict a large number of redundant and noisy relations,which has a great negative impact on the image captioning task.In order to effectively utilize the semantic relations that play a positive role in the generated description in the scene diagram and reduce the interference of noise relations,after constructing the scene semantic graphs of the image,a gated graph attention encoder is proposed in this paper,which combines the attention mechanism and the gated mechanism to automatically focus on the relations useful for generating descriptions and aggregate these relations to generate region-level features of relation perception.Specifically,the attentional mechanism assigns weights to a set of relationships in the input to distinguish useful and useless relationships.Gating mechanism re-evaluates the exploitable value of the relationship after attention so as to reduce the impact of redundant relationship on description generation.In addition,at the decoder for generating descriptions,a global adaptive attention module is designed,which makes comprehensive use of both global and region-level features to guide description generation.Finally,extensive experiments are carried out on the popular m S-COCO benchmark of image description generation dataset.Experimental results show that the proposed model is superior to the latest methods that introduce semantic relations to guide image description generation.The validity of each module in the model was verified by ablation experiments.

Keywords/Search Tags:

Image Captioning, Scene Graph, Gated Graph Attention Network

Related items

1	Knowledge Scene Graph And Topic Correlation Graph For Image Captioning
2	Scene Graph With 3D Information For Change Captioning
3	Research On Visual Semantic Graph Construction And Its Application In Image Captioning
4	Research On Method Of Robot Vision Scene Understanding
5	Research And Implementation Of Scene Graph Retrieval Method Based On Graph Theory
6	Research And Application Of Recommendation Algorithm Based On Graph Neural Network
7	Research On Image Caption Algorithm Based On Graph Convolution Network
8	Research Of Scene Graph Generation Method Based On Object Relation Enhancement
9	Fine-grained Image Generation Model Based On Scene Graph
10	Research And Implementation Of Scene Graph Generation Algorithm Based On Attention Mechanism