Noisy Label Learning Methods Via Sample Selection

Posted on:2023-12-31

Degree:Master

Type:Thesis

Country:China

Candidate:H Yang

Full Text:PDF

GTID:2568307061953889

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

In recent years,deep neural networks have achieved impressive success in image classification,semantic segmentation and many other tasks.At present,the success of deep neural networks generally relies on a large number of high-quality labeled samples.In other words,the labeling information of training example is assumed to be accurate and complete.However,it is extremely expensive and time-consuming to label high-quality datasets,thus deep models are usually trained on data with lots of corrupted labels.Noisy label learning via sample selection aims to distinguish clean samples and noise samples from noisy datasets,and then utilizes them separately to improve the robustness of the model.In this paper,we investigate the problem of noisy label learning via sample selection and propose two novel methods accordingly.On one hand,numerous researches have proved that deep neural networks can fit all training data in the end even given data with noisy labels,and thus result in poor generalization performance.In this paper,a novel noisy label learning method via Dynamic Loss Thresholding(DLT)is proposed.During the training process,through recording the loss value of each sample and calculating dynamic loss thresholds,DLT compares the loss value of each sample with the current loss threshold and extracts clean samples for the next step of training.Experiments on several image classification datasets demonstrate substantial improvements over recent state-of-the-art methods.On the other hand,by introducing self-training semi-supervised learning techniques,a novel noisy label learning method named Semi DLT is proposed to accommodate unlabeled data exploitation.The proposed method discards the corrupted labels of potentially noisy samples and regards them as unlabeled data.After that,Semi DLT generates pseudo-labels for the unlabeled data,and utilizes the Mixup techniques for data augmentation.The loss function is instantiated by integrating cross entropy and mean square error.Experimental results on several benchmark datasets show that the proposed method can take advantage of unlabeled data effectively and improve the classification performance.This paper consists of four chapters.The first chapter introduces the background,related work and the problems to be solved.The second chapter introduces the noisy label learning method named DLT via dynamic loss thresholding.The third chapter introduces the noisy label learning method Semi DLT via unlabeled data exploitation.The fourth chapter summarizes the thesis.

Keywords/Search Tags:

Weakly Supervised Learning, Noisy Label Learning, Sample Selection, Semi-Supervised Learning

PDF Full Text Request

Related items

1	Research On Weakly-supervised Learning Based On Sample Selection Strategy And Contrastive Learning
2	Research On Weakly Supervised Learning Based On Controlled Random Walk Model
3	Research On The Scheme Of Optimizing The Performance Of Classification Model In Weakly Supervised Learning
4	Research On Partially Labeled Problem Based On Active Learning And Semi-supervised Mechanism
5	Research On Weakly-supervised Classification Methods Based On Samples And Labels Modeling
6	Robust Image Classification Algorithms With Weakly Supervised Learning
7	Partial Label Learning Algorithms Based On Maximum Margin And Semi-supervised Learning
8	Research On Weakly Supervised Label Distribution Learning Metho
9	Research And Implementation Of General Weakly Supervised Object Detection Algorithm Based On Deep Learning
10	Research On The Utilization Techniques Of Partial Label Data