Supervised Contrastive Learning-Based Long-Tailed Image Recognition

Posted on:2023-08-04

Degree:Master

Type:Thesis

Country:China

Candidate:X B Ma

Full Text:PDF

GTID:2568306827474834

Subject:Computational Mathematics

Abstract/Summary:

PDF Full Text Request

Deep learning is a branch of machine learning,which acquires the inherent laws and representation levels of data through hierarchical learning.At present,deep learning is widely used in computer vision,natural language processing,speech recognition,quantum chemistry,medicine and other fields,especially in image recognition.This is inseparable from largescale,high-quality datasets and GPU performance improvements.In traditional image recognition tasks,the class distribution of training data is often artificially influenced,that is,the number of samples of different classes is basically the same.However,datasets in practical application scenarios usually present a long-tailed distribution,which poses a great challenge for image recognition tasks,that is,deep models trained on long-tailed datasets perform poorly on tailed categories.How to use long-tailed data to train classification models that perform well in various categories has become one of the research hotspots in the field of computer vision and deep learning.This paper studies an improved method for long-tailed image classification based on deep learning.The main work is as follows:1.This paper designs a convolutional neural network loss function for long-tailed image classification.The most commonly used loss function in image classification neural networks is the cross-entropy loss.While this loss function performs well on balanced datasets,on long-tailed datasets,this loss function causes model performance to be biased towards the head category,while the tail category performs extremely poorly.To solve this problem,this paper proposes a new loss function.This loss function not only improves the representation ability of the feature extraction network during training,but also balances the performance of the classifier on each class.Specifically,the loss function not only enables the model to learn the representative features of each class of images,that is,the features of the same class are similar and the features of different classes are far apart,but also makes the model pay more attention to the tail class.2.This paper adopts the post-hoc tuning strategy of convolutional neural network for long-tailed image classification.At present,most of the datasets used in long-tailed recognition work are training sets with long-tailed distribution and validation and test sets that satisfy uniform distribution.Although the existing resampling and reweighting strategies improve the model performance to a certain extent,they do not solve the contradiction between the training phase and testing phase objectives of long-tailed learning.In order to solve this problem,according to the idea and method of optimal transport,we redistributed the labels of the network prediction results in the testing phase,so that the predicted label distribution is close to a uniform distribution.Finally,this paper evaluates the proposed method on four long-tailed datasets,and the evaluation results show that our method is more effective than most long-tail classification methods.

Keywords/Search Tags:

Long-Tailed Recognition, Supervised Contrastive Learning, Deep Learning

PDF Full Text Request

Related items

1	The Research On Long-tailed Visual Recognition Algorithms
2	Research On Semi-Supervised Expression Recognition Algorithm Based On Contrastive Learning
3	Research On Long-tail Recognition Based On Deep Learning
4	Research On Image Classification For Long-Tailed Data
5	Research On Self-supervised Action Recognition Based On Contrast Learning
6	Research On Distantly-Supervised Long-Tailed Relation Extraction
7	Basic Algorithm And Framework Research On Contrastive Learning
8	Research On Self-Supervised Contrastive Learning Video Representation Model
9	Research On Self-Supervised Contrastive Learning Based Text Recognition
10	Research On Causal Representation Learning For Long-Tailed Visual Recognition Task