Zero Shot Latent Space Mapping Method Based On Bimodal Dictionary And Zero Shot Adversarial Generation

Posted on:2020-04-20

Degree:Master

Type:Thesis

Country:China

Candidate:X S Tan

Full Text:PDF

GTID:2428330590960937

Subject:Electronics and communications engineering

Abstract/Summary:

PDF Full Text Request

Image classification algorithms based on supervised learning require sufficient training samples with labels,but it is impractical to collect sufficient samples for each category.ZeroShot Learning(ZSL)uses modal information shared by categories to transfer discriminant knowledge,which can provide classification basis for new categories.In this dissertation,we focus on visual and semantic bimodal dictionary mapping and zero shot generation method in ZSL image classification.Main works are as follows:Ainming at the problem that visual and semantic features have structural dismatch,which makes the direct mapping difficult.A ZSL image classification method based on local sensitive double dictionary learning is proposed.The visual and semantic association is established by jointly optimizing the latent feature learning and the cross-modal mapping.Since dictionaries and cross-modal mapping functions are learned under image visual and semantic features respectively,the complementary information of visual and semantic is compatible in latent space,which reduces the structural differences.In the double dictionary learning,the local structure information is vital,a local sensitive regularization is proposed to preserve local structure information.In addition,to alleviate the problem of projection domain shift,a domain adaptation based on self-learning is proposed,which uses the structural information of training data and test data to improve the generalization of the model.Experimental results in AWA databases show that,compared with the SJE algorithm,the proposed method is increased by 12.4% in classification accuracy.Aiming at the problem that the classification effect is biased due to the missing data of new classes,a visual feature generation method based on conditional variational autoencoder and adversarial learning is proposed.Firstly,the latent probability distribution of the visual features conditioned on the semantic imformation is modeled by conditional variational autoencoder.Then adversarial training is adapted to enhance the similarity between the generated data and real data.To further improve the discriminative of generated data,a classification loss is adopted to classify the generated data into corresponding class,thus increasing the separability between different classes.Experimental results in AWA database show that compared with the ALE algorithm,the proposed method is increased by 7.7% in classification accuracy.

Keywords/Search Tags:

zero-shot learning, knowledge transfer, double dictionary, locality sensitive, advertising learning

PDF Full Text Request

Related items

1	Research On Key Technologies For Zero-Shot Learning
2	Research On Zero-Shot Image Learning
3	Deep Learning-based Few-shot Learning In Computer Vision
4	Research On Relation Classification Based On Few-Shot Learning
5	Research On Approaches To Knowledge Transfer In Domain Adaptation And Zero-shot Learning
6	Research On Zero-shot Learning Techniques And Their Applications In Semantic Segmentation
7	Robust Classification And Ranking Methods Based On Metric Learning And Knowledge Transfer
8	Research On Transfer Learning Based Relation Extraction Methods
9	Zero-Shot Learning Based On Reverse Prediction
10	Discriminant Self-adapted Locality-sensitive Sparse Representation For Video Semantic Analysis