Research On Multi-index Combined Active Learning For Image Classification

Posted on:2022-10-07

Degree:Master

Type:Thesis

Country:China

Candidate:X S Jiang

Full Text:PDF

GTID:2518306722992669

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The progress of deep learning applications is partly due to the large amount of annotated data.However,it is not easy to obtain a large amount of labeled data in many application scenarios,and active learning is one of the effective methods to alleviate this problem.Active learning methods based on pre-designed selection strategies can obtain higher-performance models with less labeled data,but a single selection strategy cannot always maintain high efficiency throughout the learning stage.It is more reasonable to use complementary multi-index sampling strategies.This paper combines the classic uncertainty sampling strategy and the source domain difference sampling strategy to construct a dynamic combination strategy,and proposes a new active learning method CDU(active learning method based on Combining source-domain Difference and target-domain Uncertainty,referred to as CDU).The source domain differential sampling strategy,which does not require the source task sample data,but only the source task model and the source task sample feature Gaussian mixture model,can solve the problem of low learning efficiency of the target task model in the initial stage of transfer learning(in real application of transfer learning,it is usually easy to obtain the source task model,but difficult to obtain the source task data for comparison with the target task samples,which makes it possible to repeatedly label and learn samples that is similar or even the same as the source task samples during the process of fine-tuning the target task model,resulting in waste of labeling costs and low learning efficiency).The CDU method first uses source task model and source task sample feature Gaussian mixture model to calculate the difference between target task samples and source task samples(source domain difference);then the source task model is used as the target task initial model to start active learning iteration;In the process of active learning iteration,the scores based on the dynamic integration of source domain differences and target domain uncertainty are used to screen out the most valuable target task samples for labeling and learning of the model.In addition,in view of the problem of unbalanced samples among categories when using actively learning to select samples in multi-category classification,the CDU method introduces high-score-and-random hybrid sampling to select samples.To prove the validity and rationality of the CDU method,experiments were carried out on the cross-domain transfer data sets of cat and dog recognition and handwritten digit recognition.The experimental results show that,compared with baseline methods,the CDU method can improve the learning efficiency of the target model in the initial stage of transfer learning,make the model more efficient in the entire learning stage,and effectively reduce the cost of labeling.

Keywords/Search Tags:

deep learning, active learning, uncertainty, source domain difference

PDF Full Text Request

Related items

1	Research On Multi-index Combined Active Learning For Image Classification
2	Research And Applications Of Uncertainty In Bayesian Deep Label Distribution Learning
3	A Study On The Theory And Application Of Active Learning For Small Size Samples
4	Collaborative Query Of Adversarial And Uncertain Samples For Deep Active Learning
5	Research On Active Learning Algorithm Based On Reinforcement Learning
6	Research On Deep Active Learning Technology Based On Unsupervised Feature Matching
7	Transductive Transfer Classification With Active Learning From Source Domain
8	Research On Domain Adaptation Method Based On Deep Learning
9	Study Of Active Learning Algorithms Based On Uncertainty
10	A Remote Sensing Image Classification Based On Active Deep Learning