Research On The Prediction Of Students' Achievement In Educational Data Mining By Cooperative Filtering Algorithm

Posted on:2017-02-17

Degree:Master

Type:Thesis

Country:China

Candidate:C C Liu

Full Text:PDF

GTID:2278330488964793

Subject:Computer system architecture

Abstract/Summary:

PDF Full Text Request

Currently, the education of students in the school accumulated a lot more obvious variety of data, such as student enrollment, dropout rates and student achievement scores of data subjects, in particular the right to rate their classes to answer the question of knowledge mastery of information. Clearly, these various types of data in the field of education is constantly changing, as will the development of information and accumulation increased, how to extract these complex data burdensome useful information, with good research value.Combining collaborative filtering algorithm similarity in the field of e-commerce and other data analysis, collaborative filtering algorithm will be applied to the data in the field of education, focusing on student achievement prediction research on KDD Cup 2010 Competition selected from ITS Intelligent Tutor System 8.9 million pieces of data as the experimental data sets were student achievement data mining prediction education practice. Experimental data set features a large amount in the range is large, mostly text-type data, partial data sparse and so on. To solve these problems, this paper carried out the following work:(1) an incremental sampling method, to determine the optimal size of the training set, substantially reduce the amount of training set records; combined data set time characteristics, the training set to extract the latest data of N; remove the implicit answer large result sets null proportions feature, part of a complex structure of separate property.(2) a single K-nearest neighbor classification algorithm and singular value decomposition SVD model apply to educational data set to validate the test set Correct First Attempt (CFA) property prediction, and as evaluation of the content, while comparison of the two algorithms prediction.(3) This article is also based on two base algorithms complementary characteristics, the SVD dimensionality reduction and K-nearest neighbor algorithm combined forecast student achievement. Experiments can be analyzed, the algorithm enables data sparsity eased to some extent, but only to retain the basic characteristics of the data, partial data loss caused by dimensionality reduction evaluation results will cause little impact.

Keywords/Search Tags:

Educational data mining, K nearest neighbor, dimensionality reduction, handling characteristics, student performance prediction

PDF Full Text Request

Related items

1	Nonlinear Dimensionality Reduction Based On Stochastic Initialization
2	Research On Dimensionality Reduction And Quantification Methods Of Approximate Nearest Neighbor Query For Streaming Data
3	On the K-Nearest Neighbor approach to the generation of fuzzy rules for college student performance prediction
4	Research On K-nearest Neighbor Search Algorithm In High Dimensional Space
5	Study Of College Student Performance Prediction Based On Machine Learning
6	Research On Dimensionality Reduction And Prediction Methods In Time Series Data Ming
7	Student Interest Analysis And Prediction Based On Sequential Convolutional Network
8	Research Of Dimensionality Reduction And Its Appliacation On Data Mining Of Large-Scale Text
9	Study And Application Of Several Improved Methods Of Nonlinear Dimension Reduction For High Dimensional Data
10	Study On Generalized Nearest Neighbor Pattern Classification