Research On Classification Algorithm Based On Representation Learning

Posted on:2022-06-24

Degree:Master

Type:Thesis

Country:China

Candidate:L Tan

Full Text:PDF

GTID:2518306338470104

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The explosive growth of data generated by various ways such as the Internet makes data analysis extremely difficult.As a basic problem of data mining,classification has become a hot topic.With the continuous development of machine learning and deep learning,the performance of classification algorithms based on representation learning has been greatly improved.However,the current classification algorithm based on representation learning has the following problems:the spatial relationship between nodes is not considered when the training set is expanded,the performance of classification is poor when the data labels are unbalanced,and the relationship between nodes cannot be considered when multi-label classification is implemented.Therefore,the focus of this paper is to put forward effective solutions to the above mentioned problems,so as to get better classification performance.The main research contents of this paper are as follows:(1)This paper studies and analyzes the spatial relationship between nodes without considering the expansion of the training set,and proposes the ET-GCN algorithm.Firstly,the nodes in the network are pre-classified to get the pseudo-labels of nodes,and then the centers of different categories are obtained by clustering the pseudo-labels of nodes by using the distribution density relationship of nodes.K nodes that are closest to the category center are selected to be added into the training set for training.Experimental results show that the proposed method is more effective than the existing methods.(2)In this paper,the semi-supervised classification algorithm of graph convolutional neural network is analyzed and studied.In order to solve the problem that the label data imbalance is not considered when the graph convolutional neural network is doing the classification task,an enhanced classification GCNBoost algorithm is proposed in this paper.The algorithm is based on the classification error to increase the attention to the classification error node and improve the loss function of graph convolutional neural network.The experimental results show that the proposed method is more effective than the existing methods.(3)The multi-label classification algorithm based on feature aggregation is studied and analyzed,and the MLCAA algorithm is proposed.Firstly,the attribute graph is constructed according to the attribute relationship,and then the feature is aggregated according to the constructed attribute graph.In addition,in order to avoid the interference of noise,the features of nodes are extracted by the denoising autoencoder,and the aggregated features are input into the label prediction of data.The experimental results show that the proposed method is more effective than the existing methods.

Keywords/Search Tags:

classification, representation learning, graph convolutional neural network, multi-label, label imbalance

PDF Full Text Request

Related items

1	New Label Learning For Multi-Label Image Classification
2	Research On Multi-label Classification Algorithm With Label Correlations
3	Class-imbalance Issue In Applying Multi-label Learning To The Study Of Parkinson In Traditional Chinese Medicine Diagnosis
4	Contributions To Several Issues Of Multi-Label Learning
5	Research On Multi-label Learning With Missing Labels For Image Classification
6	Multi-label Prediction Model Based On Ontology Database And Data Mining In Bio-medicine
7	Research Of Multi-label Clothing Image Classification Based On Deep Learning
8	Research On Normalized Label Propagation Algorithm Dealing With Label Imbalance
9	Imbalanced Multi-label Learning Algorithm Based On Density Label Space
10	Graph-structured Description Of Multi-label Classification Problems And Researches On Several Learning Methods