Research On Predicting The Difficulty Of Test Paper Based On Title Attribute And Text

Posted on:2019-05-05

Degree:Master

Type:Thesis

Country:China

Candidate:Q M Wang

Full Text:PDF

GTID:2370330545468632

Subject:Applied Statistics

Abstract/Summary:

PDF Full Text Request

Since the selection examination has been widely concerned by the society,the research on the propositional quality of the test questions has a very important significance,and the adjustment of the difficulty of the test questions is the key to improve the quality of the proposition.Taking the mathematics test questions as an example,this paper analyzes the data of eight sets of tests for an important selection test in a province and city for four consecutive years,and estimates the difficulty parameters of the test questions by using the project response theory.Considering that different tests are composed of different subjects,the difficulty of different test estimates is not comparable.Through the equivalent experimental design,all tests are combined to establish a matrix of questions,and the scores on all different test forms are converted to the same score.On the scale,Achieve the goal of unified evaluation.In this paper,18 subject attributes are separated from the test questions by artificial coding.Starting from the four dimensions of base-development,problem type,knowledge,and ability,nine topic attributes that have a significant impact on the difficulty of the test questions have been selected through T tests for 160 topics.They are base-development,problem type,algorithm,sequence,arithmetic solution,data processing,abstract summary,analysis and problem solving,reasoning and innovation.Based on the extracted topic attributes,33 text word vectors,66 text word vectors,and combination of title attributes and text word vectors as independent variables,a multiple linear regression model was proposed.In order to reduce the risk of overfitting,the support vector regression model was further proposed and the penalty function was used.To reduce the complexity of the model has solved the problem of overfitting.The empirical results show that the vector SVM model based on topic attributes and 33 text words is superior.It has a smaller mean square error on the test set and the model fits well.In addition,because the difficulty itself is a vague concept,the difficulty of the questions is divided into four levels from easy to difficult,and the difficulty level is classified by the support vector machine for the title and answer text and the title attribute respectively.The results show that the prediction accuracy of the support vector machine model based on the topic attributes is higher in the two prediction models,which is 75%.It also confirmed the feasibility of pre-testing difficulty levels with support vector machines.

Keywords/Search Tags:

Item Response Theory, Support Vector Regression, Difficulty, K-fold cross validation

PDF Full Text Request

Related items

1	Logistic regression and item response theory: Estimation item and ability parameters by using logistic regression in IR
2	Bayesian and frequentist cross-validation methods for explanatory item response model
3	The Personality Test Item Response Model Containing The Time Variable Based On The Information Function-response Difficulty Assumption
4	Semiparametric Item Response Model
5	Research On Intertemporal Arbitrage Strategy Of Stock Index Futures Based On GA-SVR-GARCH
6	A New Support Vector Regression Model
7	A New Support Vector Regression Model
8	Research On Model Selection For Support Vector Regression And Application Of It
9	Geographically And Temporally Weighted Least-Square Support Vector Regression Machine And Its Application
10	The Differential Geometry Method Of Support Vector Regression Machines