Sublinear Algorithms For Large-scale Kernel Learning

Posted on:2019-06-11

Degree:Master

Type:Thesis

Country:China

Candidate:M Gu

Full Text:PDF

GTID:2428330626952095

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Support vector machines(SVM)and penalty logistic regression(PLR)are two important learning methods in machine learning,which enjoy sound theoretical basis.For linearly inseparable problems,both SVM and PLR can classify the data in high-dimensional linearly separable space by mapping the data in the original space through kernel method.In this case,the performance of linear SVM and PLR are far less than that of nonlinear models.However,the computational time complexity of nonlinear kernel learning problems will not lower than On~2 theoretically,where n represents the training set size,which prevents nonlinear SVM and PLR from being applied to large-scale datasets.To this end,two sublinear time optimization algorithms for nonlin-ear SVM and a sublinear time optimization algorithm for nonlinear PLR are designed respectively in this paper.First,the sublinear algorithm of SVM is proposed based on the gradient descent optimization method.Each iteration of the algorithm consists of a gradient descent up-date step and a projection step,we use the random Fourier feature map to map the sam-pled example into an explicit random feature space to perform the above steps,which makes the time complexity of each iteration is constant.We derive the convergence rate of the algorithm and prove that the time complexity that returns the?approximate solu-tion to the corresponding problem is independent of the training set size.Furthermore,by replacing the loss function in SVM with the exponential loss function,we can design a sublinear algorithm for nonlinear PLR based on the same method.Then,Based on subset selection technique and improved primal-dual optimiza-tion algorithm,another sublinear time optimization algorithm for nonlinear SVM is proposed.Theoretically,it is proved that the time complexity of the corresponding algorithm for solving SVM is independent of the training set size.At last,Experimental results on several large-scale datasets demonstrate that the proposed algorithms are efficient and effective.

Keywords/Search Tags:

Nonlinear kernel, Gradient descent, Subset selection technique, Random Fourier features, Support vector machine, Penalty logistic regression, Sublinear time

PDF Full Text Request

Related items

1	A Study On Large Scale Nonlinear Support Vector Machines
2	Application Of Gradient Descent Method In Machine Learning
3	The Reseach And Application Of Stochastic Gradient Descent And Dual Coordinate Descent Algorithm
4	A Research Of Stochastic Gradient Descent Algorithm
5	Research Of Speaker Identification Models Based On Kernel Methods
6	The Research And Application Of Wavelet Support Vector Machines In Data Modeling
7	Some Empirical Research On Statistical Machine Learning
8	Research Of Parameter Selection For Support Vector Machine
9	Research And Application The Parameters Of Support Vector Machine
10	Support vector machine/regression feature selection with an application towards classification