Research On Semi-Supervised Multi-Label Feature Selection Algorithm

Posted on:2020-08-14

Degree:Master

Type:Thesis

Country:China

Candidate:L Q Wang

Full Text:PDF

GTID:2428330590986881

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

In the field of machine learning,traditional supervised learning assumes that a learning object corresponds to only one concept label.However,in real life,a learning object may be associated with multiple concept labels at the same time.For example,a movie can be marked as sci-fi,action,and the United States at the same time;a picture may also be tagged with a log house,a tree,a lawn,a path and so on.Multi-label learning is a learning framework for studying such tasks,and attracting to a great deal of researchers.However,there are two problems with existing multi-label learning algorithms: on the one hand,the number of labels is much more and the semantic information is complex,which spend a lot of manpower and time tagging multi-label data,and it is difficult to obtain a large amount of labeled data;on the other hand,the feature set of multi-label data presents a high dimensionality problem.The irrelevant and redundant features will damage the generalization performance of classification model.Therefore,it is necessary to reduce the dimension of the high-dimensional multi-label data.In this paper,we propose a semi-supervised multi-label feature selection(SSMLFS)algorithm based on the above two problems.The basic idea is that we evaluate features based on between the dependency of the original feature description and the related labels and the local structure retention ability under the semi-supervised learning framework.The main contents are as follows:Firstly,based on HSIC(Hilbert-Schmidt Independent Criterion),we calculate the Hilbert-Schmidt cross-covariance operator norm on the RKHS(Reproducing Kernel Hilbert Space)domain,and obtaining the independence criterion,that is,the HSIC empirical estimate.And we maximize it regarding as an optimization goal.Secondly,we consider labeled and unlabeled data at the same time.At the beginning,we construct a sample-based adjacency graph,and then maximize the local structure of the samples.Finally,we calculate the importance of each feature.The experimental results based on six different evaluation criteria on six datasets verify the effectiveness of the SSMLFS algorithm.

Keywords/Search Tags:

multi-label learning, multi-label feature selection, semi-supervised learning, Hilbert-Schmidt independent criterion, locality preserving projection

PDF Full Text Request

Related items

1	Research On Feature Optimization Method For Multi-Label Decision System
2	Feature Selection Method Research For Multi-label Classification
3	Research On Label Coding Algorithms For Multi-label Classification
4	Research On The Multi-label Feature Selection And Classification Methods With The Label Correlations
5	Research On Feature Selection Algorithm Based On Multi-label
6	Research On Multi-label Feature Selection Algorithms Based On Normalized Cross-Covariance Operator
7	Research On Several Key Issues Of Multi-label Learning For Limited Supervised Information
8	AutoLink Semi-supervised Multi-label Study Of Literature Research And Implementation Methods
9	Research Of Feature Selection Algorithm For Weak Label Learning
10	Research On Multi-label Learning Algorithms With Ensemble Learning