The Research Of Incremental Learning Methods For Large-scale Multi-class Data Classification

Posted on:2018-11-21

Degree:Master

Type:Thesis

Country:China

Candidate:T T Xie

Full Text:PDF

GTID:2428330569999063

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

In recent years,dynamically growing data and incrementally growing number of classes pose new challenges to large-scale data classification research.Thus,training a new model with all the data is so time-consuming that it doesnot make sense.Besides,data is not arriving in a regular way,while some data are unlabeled,which hinders the development of incremental learning.To remedy it,we propose methods about incremental learning in both supervised learning and unsupervised learning.For incremental supervised learning,Most traditional methods struggled to balance the precision and computational burden when data and its number of classes increased.However,some methods are with weak precision,and the others are time-consuming.In this paper,we propose an incremental learning method,namely,heterogeneous incremental Nearest Class Mean Random Forest(hi-RF),to handle this issue.It is a heterogeneous method that either replaces trees or updates leaves of trees in the random forest adaptively,to reduce the computational time in comparable performance,when data of new classes arrive.Particularly,to keep the accuracy,one proportion of trees are replaced by new NCM decision trees;to reduce the computational load,the rest trees are updated their leaves probabilities only.Most of all,out-of-bag estimation and out-of-bag boosting are proposed to balance the accuracy and the computational efficiency.Fair experiments were conducted and demonstrated its comparable precision with much less computational time.For incremental unsupervised learning,traditional methods could only work in unsupervised classification,other than incremental learning,especially unsupervised incremental learning.To accomplish this goal,Incremental AutoEncoder(IAE)is proposed.IAE takes AutoEncoder as the basic model,which makes the original CNN as an encoder.Added with some constraints,IAE could enhance the classification ability of unsupervised tasks,and maintain the classification ability of original tasks.Besides,IAE do not need the original data to update the model,which reduces the storage memory a lot.Experiments were conducted to show that,the classification result of IAE is better than k-means,and make the incremental procedure to be end-to-end.

Keywords/Search Tags:

Incremental learning, Supervised Learning, Unsupervised Learning, large-scale data classification

PDF Full Text Request

Related items

1	Target Classification Of Synthetic Aperture Radar Based On Semi-supervised And Unsupervised Learning
2	Research And Application On Supervised Similarity Metric Learning Approaches
3	Fault Diagnosis For Industrial Processes Based On Machine Learning
4	Learning from partially labeled data: Unsupervised and semi-supervised learning on graphs and learning with distribution shifting
5	Research On Deep Unsupervised Learning Algorithm
6	Research On Network Traffic Classification Based On Machine Learning
7	Research And Parallel Application Of Supervised Learning Algorithms For Large-scale Data Classification Problems
8	Local Learning And Global Preserving Based Semi-supervised Algorithm For Large Scale Classification Problems
9	Research On Ensemble And Imbalanced Based Supervised/Unsupervised Learning Methods And Application
10	Design And Implementation Of Semi-Supervised Continuous Learning Framework