A Research On An Improved ML-KNN Multi-label Classification Method

Posted on:2018-05-19

Degree:Master

Type:Thesis

Country:China

Candidate:H M Fu

Full Text:PDF

GTID:2348330512484881

Subject:Engineering

Abstract/Summary:

Multi-label classification is an important branch to research data classification in data mining field.In the era of large data,the surge in the amount of data and the annotation structure of data are becoming more and more complicated,which makes the multi-label learning problem exist in the real world very widely.How to find a fast and effective multi-label classification algorithm with high classification accuracy has become a hot topic in the field of data mining.The research of multi-label data mining is becoming more and more prominent.This paper focuses on the problem of multi-label classification.According to the characteristics of multi-label classification,the main work of this paper is as follows:First,the existing multi-label classification algorithm is summarized and classified.In this paper,the algorithms that have been applied to multi-label classification learning are divided into problem-based transformation strategy and algorithm-based algorithm.For each kind of algorithm,the classification principle,classification step,the advantages and disadvantages of the algorithm and the adaptation conditions are expounded in detail,and several algorithms are simulated on the data set.Second,we propose an improved algorithm:IML-KNN,which is based on the multi-label classification algorithm ML-KNN.The improved points of IML-KNN are described in detail.The classification experiments are carried out on four multi-label data sets,and compared with the other two algorithms.Finally,some factors influencing the algorithm are analyzed and discussed.Finally,A new multi-label classification algorithm is proposed to apply the idea of pseudo-nearest neighbor and penalty function to IML-KNN algorithm.The new algorithm uses the pseudo nearest neighbor(PNN)instead of the nearest neighbor to find the nearest neighbor of the sample x more effectively,and adds the penalty function to improve the posterior probability.Then the principle of pseudo-nearest neighbor and penalty function,the steps of the improved algorithm,the result and analysis of classification experiment are described in detail.

Keywords/Search Tags:

data mining field, multi-label classification, KNN, ML-KNN, pseudo nearest neighbor, penalty function

Related items

1	Study On Generalized Nearest Neighbor Pattern Classification
2	Research On Multi-label Learning Algorithm Based On K Nearest Neighbor
3	Recognition Of Applicable Laws Based On Hierarchical Multi-label Classification
4	Research On Several Pattern Classification Methods Based On K-nearest Neighbor Criterion
5	Research Of Nearest Neighbor Classification Algorithm Based On Sample Selection
6	Research On Multi-label Learning And Algorithms Based On Data And Label Correlations
7	Optimization Of Nearest Neighbor Preserving Feature Selection In Multi-label Classification
8	Parallel Multi-label K-nearest Neighbor With Local Dependency
9	Improvement And Research Of Multi-label Learning Algorithm
10	Mining Research, Based On The Integration Algorithm Of The K-nearest Neighbor Classification