Data Preprocessing And K-Means Clustering Based Support Vector Regression Model

Posted on:2013-04-11

Degree:Master

Type:Thesis

Country:China

Candidate:W G Zhao

Full Text:PDF

GTID:2248330371987458

Subject:Applied Mathematics

Abstract/Summary:

PDF Full Text Request

In the practice of people’s production and life, forecasting of something is a work which is very rich in practical significance, where the accuracy is its lifeblood. How to improve the forecasting accuracy has been the focus of the study researchers. They usually take the means of improving the fitting accuracy of the prediction model to the original series, but if the data itself is a problem and thus can not correctly reflect the trend of the series, no matter how good the fitting accuracy is, the model is also likely to have a poor forecasting accuracy.In view of this situation, this paper attempts to improve the forecasting accuracy through data preprocessing, specifically, that is pre-detection of data jumps, excluding the outliers or noise reduction for the original series prior to forecasting. For the choice of the forecasting model, since the training set with high internal similarity can be more effectively simulated, this paper introduces a new algorithm, that is K-means clustering based least squares support vector regression (denoted by K-LSSVR). It first divides the training set into several categories according to the Euclidean distance of the input vectors using K-means clustering. Then it uses them respectively to train the LSSVR model. In the phase of forecasting, according to what category each input vector belongs to, K-LSSVR selects the corresponding LSSVR model to predict.Through the inspection of three simulations, we can find the forecasting accuracy of K-LSSVR is generally improved compared with LSSVR (especially when the data contains data jumps or outliers). What’s more, preprocessing for the data can even further improve the forecasting accuracy.

Keywords/Search Tags:

Data preprocessing, Least squares support vector regression, K-meansclustering, Data jump, Outliers processing, EMD-based signal filtering

PDF Full Text Request

Related items

1	Study Of Least Squares Support Vector Regression
2	Regression Analysis And Application Of Support Vector Regression In Material Experimental Data
3	Research On Least Squares Twin Support Vector Regression
4	Support Vector Regression Based Single Image Super-resolution
5	Support Vector Regression Analysis Based On Complex Censored Data
6	The Comparation And Study On Least Square Method,?-Support Vector Regression And Least Square Support Vector Regression
7	Research On Robust Learning Models And Algorithms Of Support Vector Machine
8	Research On Twin Support Vector Regression
9	Research On Wideband Digital Predistortion Technology Based On Machine Learning
10	Research And Application On Regression Analysis Of Water Quality Data