Study On Sequential Prediction And Clustering Of Big Data

Posted on:2017-03-18

Degree:Master

Type:Thesis

Country:China

Candidate:Q T Zhang

Full Text:PDF

GTID:2180330485951783

Subject:Probability theory and mathematical statistics

Abstract/Summary:

PDF Full Text Request

As the advancement of modern science and technology, the volume of data used is expanding, so the research of big data become extremely urgent. This paper study sequential prediction and clustering of big data. Firstly, we propose a sequential linear regression (SLR) method for a large amount of sequential data. This method is not on-ly computationally efficient in speed and storage but also has higher accuracy than the method of mean predicting. Besides, a weighted strategy is introduced on the curren-t model to determine the impact of data from different periods. Secondly, we propose sparse autoencoder neural network method for reducing dimensions for the high dimen-sions unlabelled data, the solution algorithm is numerical optimization algorithm and a standard k-means algorithm is applied to form the clusters on the hidden layer. When compared with others clustering method, we demonstrate the advantage of our method from the simulation data and real data.

Keywords/Search Tags:

Big Data, Sequential Linear Regression Method, Weighted Sequential Lin- ear Regression Method, Sparse Autoencoder Neural Network Method, Clustering

PDF Full Text Request

Related items

1	Numerical Algorithms On Regression Parameter Estimation Based On Sparsity And Diversity
2	A Sequential Sampling Method Based On Support Vector Regression
3	Research On Quantitative Precipitation Estimation Algorithms Based On Sequential Regression Of Spatio-temporal Data
4	Sequential Monitoring Coefficient Change In Linear Regression Model
5	Weighted Linear Quantile Regression In LTRC Data Model
6	A Kind Of Sequential Decision Method Based On MCTS With Its Application In The Acupoints’ Ranking Schema
7	Research On Regression-based Sparse Matrix Decomposition Method And Its Application In Sequencing Data
8	Variants of multivariate adaptive regression splines (MARS): Convex vs. nonconvex, piecewise-linear vs. smooth and sequential algorithms
9	The Theory And Method Of Geographically And Temporally Neural Network Weighted Regression
10	Research On Sequential Test Methods Based On Multiple SPRTs