Research On Image Description Algorithm Fusion With Local Semantic Information

Posted on:2020-01-21

Degree:Master

Type:Thesis

Country:China

Candidate:X Liu

Full Text:PDF

GTID:2428330575495220

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Image description technology can convert image to text and realize cross-modal conversion of information,which is widely used in the human-computer dialogue,the search between image and text,the education of children and the life support for visually impaired people.With the advancement of communication technology,image data began to be widely distributed and disseminated on Internet.How to describe image content in natural language automatically has become a hot research topic.This paper focuses on the problem of automatic generation of image description from the methods based on emotional representation,the method based on local spatial semantic information,and the emotional analysis method based on image description.The research work of this paper mainly includes:(1)An image description method based on sentiment representation is proposed.This method is based on the encoder-decoder model,using convolutional neural network for image feature extraction,and LSTM for sentence generation.First,the existing tools are used to extract the emotional representations(including visual semantics and expressions)in the graph,and their corresponding rectangular bounding boxes.The visual semantic information and the expression information are then represented as vectors,mapped to specific dimensions as additional inputs to the LSTM and involved in training and prediction.In this way,the generated sentences are emotionally colored and improve the accuracy of the image description.The experimental results show that the method can effectively improve the accuracy of image description and make the generated sentences more emotional.(2)An image description method based on local spatial semantic information and global information is proposed.Firstly,the existing target detection model is used to extract the existing objects in the image and their corresponding rectangular bounding boxes,and then the attention model is used for each bounding box.The rectangular bounding boxes give different weights.And the input of the bidirectional grid LSTM(bi-Grid LSTM)is dynamically weighted so that it focuses on different regions at different times.The experimental results show that this method can effectively alleviate the problem that the model is easy to lose small area targets in image description,and the performance is better than the current method.(3)A method of emotional analysis of social network data based on image description is proposed.Firstly,an image description model is trained as an image feature extractor,and the generated description sentences are taken as single image convolutions as image features.The text feature vector is extracted by multi-layer convolution,and then the image feature vector is spliced with the text feature vector and passed into the fully connected layer for prediction,so that the social network data sentiment tendency is automatically recognized.Experiments have shown that this method achieves better performance than similar methods in the field.

Keywords/Search Tags:

Image Description, Image Understanding, Local Semantic features, Sentiment Analysis, LSTM

PDF Full Text Request

Related items

1	Image Sentiment Analysis Integrating Global And Local Features
2	Research On Image Semantic Classification Techniques In Content-Based Image Retrieval
3	Study On Local Invariant Features In Image Classification
4	Research On Image Feature Extraction Method For Design Patent Image Retrieval
5	Image Scene Understanding Based On Deep Learning Fusion Model
6	Research On Semantic Description Of Commodity Image Based On Target Word Vector
7	A Study On Image Retrieval Based On Semantic Understanding
8	The Study Of Image Objects Description And Its Completeness Based On Local Features
9	Image Semantic Segmentation Algorithms Based On Feature Fusion And Non-local Features
10	Research And Application Of Sentiment Analysis Based On Image-text Fusion