Zero-shot Learning Image Target Recognition Based On Deep Learning

Posted on:2020-05-29

Degree:Master

Type:Thesis

Country:China

Candidate:Y J Du

Full Text:PDF

GTID:2428330575456506

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

In recent years,with the rapid development of deep convolutional neural networks,image recognition methods based on supervised learning have made great progress.However,in order to achieve excellent classification accuracy,a large number of manual annotated data are needed,and the learnt classifier cannot effectively migrate to scenes with no annotated pictures.Therefore,how to reduce the manual work on data and how to make the model adapt to the emerging new samples quickly,become an urgent problem to be solved.Zero-shot learning is an effective solution to this problem,which aims to make the model have the ability to identify data categories that have never been seen before.At present,the resear-ch on zero-shot learning image recognition has made some progress in classification model and category definition,but there are still some shortcomings.On the one hand,the classification model itself is not expressive enough.On the other hand,the category definition does not make full use of the auxiliary information such as external description or knowledge.In order to solve these problems,this paper will make some improvements on classification model and category definition,mainly including the following innovations:(1)To solve the problem of insufficient constraints on mapping space in mapping-based classification model,a semantic mapping model SEMIIR which based simultaneously on inter-class and intra-class constraints is proposed in this paper,which optimizes the distance within and between classes at the same time.(2)In order to solve the ambiguity and non-visual problems in category definition based on word vectors,this paper proposes a semantic embedding algorithm VMSE based on visual modification,which modifies category vectors by introducing external descriptions.(3)Facing the problem of underutilization of prior knowledge,this paper proposes a residual graph convolutional network model ARGCN-DKGs with attention mechanism based on different types of knowledge graphs.By introducing residual mechanism and attention mechanism,and integrating different knowledge graphs,knowledge transfer between different categories can be realized.The three improved models proposed in this paper are applied to many standard datasets,and the experiment results verify the effectiveness and superiority of the proposed models in a variety of evaluation indicators.

Keywords/Search Tags:

image recognition, zero-shot learning, deep learning, cross-modal mapping, prior knowledges

PDF Full Text Request

Related items

1	Deep Ranking For Zero-Shot Multi-Label Image Classification
2	Zero-shot Image Classification Based On Cross-modal Semantic Alignment
3	A Study On Few-shot Learning Of Non-independent Identically Distributed Data
4	Zero-Shot Learning For Deep Cross-Modal Hashing
5	Zero-Shot Learning Based On Cross-Modal Feature Synthesis
6	Zero-shot Image Classification Based On Deep Learning And Knowledge Mining
7	Research On The Method Of Few-shot Deep Learning For Image Recognition
8	Research On Few Shot Image Recognition Based On Semantic Prior
9	Study On Few-shot Learning Based On Deep Learning For Image Classification
10	Research On Several Issues Of Image Generation And Recognition Based On Deep Learning