Optimization Of The Kernel Parameters Of The Support Vector Machine Based On The MAB Model

Posted on:2019-06-24

Degree:Master

Type:Thesis

Country:China

Candidate:X S Cao

Full Text:PDF

GTID:2438330551960710

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Currently,it is undoubted that machine learning is the most popular topic,which is changing people's lives gradually,making it more intelligent and convenient.Support Vector Machine(SVM)is developed,which overcomes "the curse of dimensionality" and "overfitting",based on statistic learning theory and applied to regression and category.The parameters of Support Vector Machine have dramatic effects on the ultimate regression and category,but the appropriate theories have not been found out to instruct how to choose the parameters quickly and specifically.In statistic learning theory,Multi-Armed Bandit(MAB),short for Bandits Problem,is a problem that the arms of which bandits are pulled,how many times the arms are pulled and the choosing order when a gambler(or several gamblers)faces a line of multi-armed bandits.While pulling the arm,each bandit provides random praises with a certain probability.The goal of gamblers is to acquire the maximum reward in a series of pulling orders.The gamblers confront the important balance between "use" of maximum expected profit machine and"exploration" of mastering other machines' expected profit information in each experiment.The paper studies how to choose the parameters of SVM in terms of Multi-Armed Bandit(MAB),avoiding unsatisfactory category for inappropriate parameters in applying SUV.Primarily,the paper researches Multi-Armed Bandit(MAB)and Support Vector Machine(SVM),which improves kernel parameter of SVM and penalty term based on concluding the general principles,merits,drawbacks and relative studies systematically.Then,the paper studies theoretical foundation and development status of SVM,introduce its main idea and basic statistical theory,analyze ?-greedy,UCB1,Softmax and Pursuit and simulate astringency in different parameters.Other hypothetical conditions can be ignored for the characteristics of ?-greedy.In each situation,the next pulling is chosen by statistical calculation without any parameters,which means that any priori conditions are not required.Adequately understanding ?-greedy in MAB that is used to optimize initial kernel parameter ?,penalty term C and loss function parameter ?,a kind of SVM parameter optimization method based on MAB is proposed,which optimizes parameter C and parameter ?.Initial kernel parameter ?,penalty term C and loss function parameter ?,which are trained by UCI statistics,are regarded as the arms of multi-armed bandits.This idea is utilized to categorize and validate UCI data sets.Analyzing the result of this experiment,this method achieves great effects to a certain extent.

Keywords/Search Tags:

Support Vector Machine(SVM), Multi-Armed Bandit(MAB), Reinforcement learning, ?-greedy

PDF Full Text Request

Related items

1	Research On The Algorithm Of Personalized Learning System Based On Multi-armed Bandit
2	Research And Application Of Large-scale Multi-armed Bandit Algorithm
3	Research On Channel Selection Mechanism Based On Multi-armed Bandit In Cognitive Network
4	Research On Multi-armed Bandit Aided Online Learning Approach For Wireless Caching Strategy
5	Study On Relay Selection Algorithm Based On Multi-Armed Bandit In Underwater Acoustic Cooperative Communication Networks
6	Collaborative Deep Sensing By Dynamically Fusing Multiple Models
7	A Dynamic Pricing Algorithm Of Niche Goods Based On Multi-Armed Bandit Model
8	Research On Some Problesm Of Support Vector Machine Learing Algorithm
9	Research On Incentive Mechanism In Mobile Crowdsensing Based On Reinforcement Learning
10	Research Of Recommendation Algorithm Based On Contextual Restless Multi-armed Bandit Model