Improved Stochastic Gradient Descent Algorithm For SVM

Posted on:2018-07-17

Degree:Master

Type:Thesis

Country:China

Candidate:Z Jin

Full Text:PDF

GTID:2348330539985815

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Support vector machines(SVMs)are particular linear classifiers which are based on the margin maximization principle.They perform structural risk minimization,which improves the complexity of the classifier with the aim of achieving excellent generalization performance.The SVM accomplishes the classification task by constructing,in a higher dimensional space,the hyperplane that optimally separates the data into two categories.Stochastic Gradient Descent algorithm(SGD)is a simple and effective algorithm for SVM.It is particularly fast for linear classification and it is also adapted to the non-linear classification with Mercer kernel.The running time scales linearly with the number of iterations and does not depend on the number of the training size.In this paper,we look at different variants of gradient descent,and using them to optimize the linear SVM to figure out whether these algorithms will improve linear SVMs or not.In order to improve the convergence rate and classification accuracy with large data sets.This paper also proposes a MapReduce-based SVM ensemble algorithm with SGD.We utilize Hadoop Distributed File System to store big training set and MapReduce parallel computing model to training several SVMs as SVM ensemble.The results show that our methods achieve a faster convergence rate than Pegasos that is a traditional SGD algorithm.

Keywords/Search Tags:

SVM, SGD, Ensemble, MapReduce, Hadoop

PDF Full Text Request

Related items

1	Improved Stochastic Gradient Descent Algorithm For SVM
2	An Ensemble Method for Large Scale Machine Learning with Hadoop MapReduce
3	Research On The Performance And Optimization Of MapReduce Model In Hadoop Platform
4	The Mapreduce Model In The Hadoop Implementation Of Performance Analysis And Optimization Improvements
5	Design Of Mapreduce Task Scheduling Algorithms In Heterogeneous Hadoop Cluster
6	The Research Of MapReduce Job Scheduling Algorithm Based On The Hadoop Platform
7	The Performance Optimization And Improvement Of MapReduce In Hadoop
8	Research On Improving The Fault Tolerance Performance In MapReduce
9	Research On Scheduling Algroithm In Hadoop Mapreduce
10	Research On MapReduce Model For Fusion Architecture And Accelerated Strategy For Hadoop