Research On Training Algorithm And Preprocessing Algorithm Of Support Vector Machine

Posted on:2010-01-30

Degree:Master

Type:Thesis

Country:China

Candidate:Y T He

Full Text:PDF

GTID:2178360275958664

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Support Vector Machine(SVM) has many advantages such as perfect generalization performance and simple form,it has been widely used in the fields of pattern recognition, signal processing and image processing.SVM is equivalent to quadratic programming, so it's confronted with the poor generalization and performance bottleneck in imbalanced or large data sets.In the case of imbalanced data set(IDS),the introducing of the preprocessing algorithm eliminates the redundant samples and shortens the training times.But the size differences between the training sets aren't considered in the preprocessing algorithm, which results in low efficiency.In the case of large data set,the decomposition algorithm uses working set strategy to reduce the complexity of SVM training.But the existing working set selection algorithm doesn't fully use the information of objection function which leads to slow convergence.In this thesis,the existing preprocessing and learning algorithms are investigated, and the solutions are given to the two problems.1:Analyzing the reason of low generalization for IDS and the difficulty of choosing the k value for preprocessing algorithm.2:Introducing the sample set's distribution to preprocessing algorithm and improving the parameter selection algorithm which eliminate the redundant samples and increase the generalization performance for IDS.3:Comparing the working set selection strategy of two decomposition algorithms, and analyzing the deficiencies of working set selection algorithm used in svm-light.4:By combining the working set selection method of libsvm with that of svm-light, a new working set selection algorithm based on second order information is proposed. The effectiveness of algorithm is proved by using UCI data sets.

Keywords/Search Tags:

support vector machine, k-nearest neighborhood, imbalance data set, feasible direction method, second order information

PDF Full Text Request

Related items

1	Study On The Key Problem Of The Competing Learning Vector Quantization And Support Vector Machine
2	Support Vector Machine-based Data Mining Method
3	Research And Application Of Imbalance Data Classification Based On Support Vector Machine
4	Research On Improved Support Vector Machine Based On Category Imbalanced Dataset
5	Research On Some Issues In Support Vector Machines
6	Research On Network Intrusion Detection Based On Support Vector Machine Combine With K Nearest Neighbor Method
7	The Research Of Imbalanced Data Classification Algorithm Based On Support Vector Machine
8	The Research And Realization Of The Military Port Objects Classification Platform
9	Feasible Performance Modeling Of Analog Circuit Based On SVM
10	Research On Outlier Detection Method Based On Nearest Neighborhood