Study On Text Categorization Method Based On Support Vector Machine

Posted on:2007-08-02

Degree:Master

Type:Thesis

Country:China

Candidate:W Ying

Full Text:PDF

GTID:2178360212480626

Subject:Systems Engineering

Abstract/Summary:

PDF Full Text Request

Data mining is a new technology that is used to extract useful information and knowledge from large databases. Text classification is an important task of data mining. Facing the massive volume and high dimensional data how to build effective algorithm for text mining is one of research directions of data mining. Aiming at above issues, some problems of text classification with SVM (support vector machine) have been studied substantially in this paper. The main contents are listed as follows:Through analyzing the main reason that the training speed of SVM is slow, we employ a pre-extracting SVs(support vectors) algorithm and circulated iterative algorithm to improve the speed of training SVM. And based on it a new two classes text categorization algorithm is presented which includes pre-extracting support vectors as the initial working set and fuzzy circulated iterative algorithm as training method of SVM. Compared with the conventional support vector machines, the present method possesses much higher computation efficiency.To solve the problems and defections of existing methods of SVM multiclass classification, a new method of SVM multiclass classification based on binary tree is employed and applied it to multiclass text categorization. Several simulations demonstrate that compared with the existing methods,the new method prseessed the following advantages: the number of SVMs needed to be trained is less, the speed of training and decision is fast and the region that can not be classified does not exist again.

Keywords/Search Tags:

Text mining, support vector machines (SVM), two classes text categorization, multiclass text categorization

PDF Full Text Request

Related items

1	A Study On Text Categorization Based On Machine Learning
2	The Research And Implementation Of Automatic Text Categorization For Chinese Web Documents
3	Research On Support Vector Machines Classification Algorithm In Text Categorization
4	An examination of KSS for feature selection for text categorization using support vector machines
5	Research On Text Categorization Based On Support Vector Machines
6	Support vector machines for text categorization
7	Research On Chinese Text Categorization Based On Support Vector Machine
8	The Research And Implementation Of Chinese Text Categorization
9	Text Categorization Research Based On Support Vector Machine
10	Study On Text Category Oriented Chinese Text Mining And Its Implementation