Study Of Zero-Shot Object Detection Algorithm

Posted on:2021-05-10

Degree:Master

Type:Thesis

Country:China

Candidate:Y L Shao

Full Text:PDF

GTID:2428330623469210

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Fast-developing deep learning techniques and large-scale labeled datasets have promoted the advancement of computer vision tasks,such as image classification,object detection and image segmentation.The performance of sophisticated CNN models relies heavily on labeled training data.However,the difficulty in collecting and annotating images of all the concepts that beyond daily objects hinders their application in a larger scale.To address this issue,some work has recently been done on zero-shot recognition(ZSR),which can identify completely unseen classes by using the intermediate semantic knowledge of class labels,such as the typical attributes and word vectors.Taking the classification a step further,the closely related object detection problem has remained almost unexploited in zero-shot setting,namely zero-shot detection(ZSD).Its goal is to simultaneously detect and locate each individual instance of unseen object class,even in the absence of any visual examples of those classes during the training phase.At present,several researchers have proposed some methods,trying to tackle the problem by directly using ZSR algorithms in object detection model.It works but the results are not satisfactory.Actually,researchers used to pay their attention on unseen object recognition and ignore the significance of transferability in object proposal.As a detection problem,the first to solve is how to localize the target,that is,to distinguish regions of unseen objects from backgrounds.This is the key difference between ZSD and ZSR and one of the difficulties of ZSD task.Being a challenging realistic problem,ZSD is the topic of increased interest within the community.The main work of this paper is devoted to improving the performance of zero-shot detection.In the early stage,basic research is conducted to get an overview of this field so as to grasp the core of this problem.By analyzing the internal mechanism of generic detector,we promote two novel algorithms to enhance the transferability of object proposal and optimize category prediction,respectively.The contributions of this paper are threefold as follows:i.Summarize the research on zero-shot recognition and object detection,as wellas the recent progress on zero-shot detection,and systematically define ZSDproblem and describe its process.We also highlight the critical technologies inalgorithms,such as visual feature extraction,region proposal and visual-semantic mapping,where our new algorithms are based.ii.Propose the algorithm of confidence distribution to enhance the transferabilityof zero-shot object proposal.Confidence distribution encourages the model toevaluate the likelihood of all the categories by associating the occurrence ofseen and unseen objects,making it transferable in object proposal.The methodincreases the recall of unseen objects during testing,so that zero-shot detectorsachieve better performance.iii.Propose the algorithm to optimize category prediction in zero-shot detectionmodels based on the superclass guidance.Contextual features extracted withdilated convolution layers are leveraged to predict superclass,which is aneffective guidance for final classification.Compared with predicting unseenclasses with merely semantic embeddings,this method gains improvements onthe category prediction step.

Keywords/Search Tags:

zero-shot learning, object detection, zero-shot detection, transfer learning, confidence distribution, semantic knowledge, context, superclass

PDF Full Text Request

Related items

1	Research On Few-shot Object Detection Based On Deep Learning
2	Research On Few-shot Object Detection Algorithm
3	Research On Key Technologies For Zero-Shot Learning
4	Research On Few Shot Object Detection And Recognition
5	Zero-Shot Learning Based On Reverse Prediction
6	Zero Shot Latent Space Mapping Method Based On Bimodal Dictionary And Zero Shot Adversarial Generation
7	The Research And Implementation Of One-shot Learning In Object Detection And Visual Tracking
8	Research On Zero-shot Learning Techniques And Their Applications In Semantic Segmentation
9	Few Shot Learning Algorithm Based On Semantic Information
10	Research On Few-shot Semantic Segmentation And Multi-object Tracking