Research On Multi-Label Learning Under The Limitation Of Labeling

Posted on:2021-05-15

Degree:Master

Type:Thesis

Country:China

Candidate:N C Sun

Full Text:PDF

GTID:2518306548490824

Subject:Master of Applied Statistics

Abstract/Summary:

PDF Full Text Request

With the rapid increase of data collection and label acquisition,there are more situations where an instance associates with more than one labels.It is different from traditional single label situation where each sample has and only has one label.To tackle this problem,a new learning paradigm,named as multi-label learning has been widely investigated.Nevertheless,since the increments of the amount of data for labeling and labels for acquisition,it is more difficult to give full labels for each data.Meanwhile,We can ask experts to make targeted annotations,and also have some prior according to the historical observations,such as the label proportion information.In view of limited labels or label proportion,this paper proposes relevant multi-label learning algorithms.The main work is as follows:(1)In the case that the labeled data is limited,combining with ECOC(Error Correcting Output Codes)mechanism,we put forward the active Learning algorithm MAOC(Multi-label Active Learning with Error Correcting Output Codes).The MAOC algorithm uses the ECOC classification model to predict the label,and combines the two strategies of prediction uncertainty and label base inconsistency to select the most valuable unlabeled samples.So that experts can mark them specifically,and then use the new labeled data set to learn the classification model,it will improve the classification effect and efficiency.Finally,the effectiveness of the algorithm is verified by experiments.(2)In the case that the marked data label is missing and has label proportion constraint,considering the effectiveness of label proportion in limiting the model flexibility,we propose the IMLLP(Incomplete Multi-label Learning with Label Proportion)algorithm based on the prior information of data label.The IMLLP algorithm simultaneously realizes the labeling of unlabeled data and the training of classifier.Through the consistency of labels and the constraint of the proportion of labels,the reconstruction of the labels for unlabeled data is fulfilled.When using the reconstructed complete label to train the classifier,the low rank and regularizer constraints are added for the sake of improving the robustness.It is more effective in dealing with the problem of marking unlabeled data than the traditional method.Finally,experimental results present the performance of our algorithm is superior to the related compared methods.

Keywords/Search Tags:

Multi-label, Classifications, Active Learning, Missing Labels

PDF Full Text Request

Related items

1	Research On Multi-Label Learning Under The Limitation Of Labeling
2	Missing Multi-label Learning For Label Semantic Space Mining
3	Research On Multi-label Learning With Missing Labels For Image Classification
4	Research On Multi-label Classification Algorithm With Label Correlations
5	Learning Label Correlations For Multi-label Classification
6	Multi-label Feature Selection Method In The Context Of Missing Labels
7	Robust Multi-Label Learning With Missing Label
8	Research On Multi-label Learning With Inaccurate Labels
9	Research On Weakly-supervised Classification Methods Based On Samples And Labels Modeling
10	Multi-Instance Multi-Label Learning Based On Neighborhood Consensus