Feature Selection Algorithm For Multi-label Learning

Posted on:2018-12-11

Degree:Master

Type:Thesis

Country:China

Candidate:J Y Ma

Full Text:PDF

GTID:2348330542468725

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Classification is one of the hotspots in data mining technology research.Multi-label classification is proposed for more and more multi-label data,and has been widely used in the detection of gene function,automatic labeling of multimedia content and other fields.The multi-label feature selection algorithm is to eliminate the large number of redundant and irrelevant features in the classification task,so as to reduce the number of features and improve the performance of the classifier in multi-label classification.In this thesis,we design two feature selection algorithms,FI-ARML algorithm and Mult-ReliefF algorithm,from different angles.We propose a feature selection algorithm based on frequent itemsets named FI-ARML algorithm,which can find the association between attributes in data according to association rules.The algorithm improves the multi-label feature selection algorithm based on neighborhood rough set.It is divided into four steps: the first,constructing frequent k-itemsets based on class labels;second,dividing the training samples according to the label set;third,calculating the subset of features of each sub-sample;fourth,all the feature subsets are combined to obtain the final feature set.Experiments show that FI-ARML algorithm can greatly improve the speed of feature selection and shorten the time when the classification effect is equivalent.To solve the problem that ReliefF algorithm is limited to single label data,a multi-label feature selection algorithm named Mult-ReliefF is proposed.The Mult-ReliefF algorithm redefines the in-the-class nearest neighbor and out-of-class nearest neighbor search methods and updates the feature weight formula by adding the contribution value of the label.Experiment shows that Mult-ReliefF algorithm can improve the classification accuracy and obtain better feature subsets.

Keywords/Search Tags:

Multi-label feature selection algorithm, Frequent itemset, ReliefF algorithm, Neighborhood rough sets, Attribute reduction

PDF Full Text Request

Related items

1	Research On Feature Selection Based On F-neighborhood Rough Sets
2	Feature Selection Method Based On ReliefF Algorithm And Rough Set
3	Attribute Reduction Algorithm Of Neighborhood Rough Sets And Its Application In Classifier
4	Attribute Reduction Algorithm Based On Neighborhood Rough Sets
5	Research On Accelerated Algorithm Of Attribute Reduction In Rough Sets And Its Neighborhood Model
6	Attribute Reduction Algorithm For Neighborhood Rough Sets And Its Application In Classifiers
7	A Study On Attribute Reduction Based On Neighborhood And Fuzzy Rough Sets
8	Research On Attribute Reduction Algorithms Based On Extended Rough Set Model
9	Research Of Attributes Reduction And Samples Reducding Algorithm Based On Neighborhood Rough Sets And Application In Text Categorization
10	The Research On Label Enhancement-Based Multi-Label Feature Selection Algorithm With Fuzzy Rough Sets