Chi-square Test Neural Network

Posted on:2020-10-05

Degree:Master

Type:Thesis

Country:China

Candidate:L L Li

Full Text:PDF

GTID:2428330596487272

Subject:computer science and Technology

Abstract/Summary:

PDF Full Text Request

Artificial neural network(ANN)has been a research hotspot in the field of artificial intelligence since the 1980s.It abstracts the neural network of human brain from the perspective of information processing,builds some simple model,and forms different networks according to different connection modes.It is composed of a large number of nodes(or neurons)connected to each other.Chi-square test is a widely used hypothesis testing method,which is mainly used in statistical inference of classified data.It determines the chi-square value by calculating the deviation between the actual observed value and the theoretical inferred value.This paper introduces the chi-square test neural network:a single hidden layer backpropagation neural network using chi-square test theorem to redefine the cost function and the error function.The weights and thresholds are modified using standard backpropagation algorithm.In the algorithm,a small number of labeled samples are divided into some subinterval,and the theoretical probability of a certain type of samples within each subinterval is calculated as a reference,and the training model is trained with unlabeled data.Due to the homogeneity of chi-square test,the prediction probability that the classification result of the input samples fall in each subinterval during training should be approximate to the theoretical probability.The two probabilities are substituted into the chi-square test statistical formula and compared with the critical value to judge whether the training is over.If not,update the parameters according to the improved error formula and continue to train the model.Therefore,the algorithm proposed in this paper can use a small number of labeled samples as a reference and use a large number of unlabeled samples to train the model.The proposed approach was tested on five sets of public and data sets provided by UCI machine learning library and private apple near-infrared spectral data and lung CT images.The proposed approach has the advantage of making consistent data distribution over training and testing sets.It can be used for binary classification.The experimental results on real world data sets indicate that the proposed algorithm can significantly improve the classification accuracy comparing to related approaches.This proves that the algorithm is feasible.Moreover,only a small amount of labeled data is needed to calculate the theoretical probability as a reference.And to a certain extent,it solves the problem of difficult data acquisition with labels.

Keywords/Search Tags:

Semi-supervised learning, the chi-square test, artificial neural network, binary classifier

PDF Full Text Request

Related items

1	Research On Semi-supervised Classification Algorithm Based On Integrated Neural Network
2	Research And Application Of The Semi-supervised Algorithm Based On Classifier Combination
3	Comparison And Improvement Of Two Methods Based On Semi-supervised Learning
4	Comparison And Improvement Of Two Methods Based On Semi-Supervised Learning
5	Semi-Supervised Learning With Multiple Views
6	Research Of Reliable Semi-supervised Classification
7	Research On Semi-supervised Clustering And Classification Algorithm
8	Biomedical Entity Relation Extraction Based On Semi-supervised Learning And Deep Learning
9	The Analysis Of The Method For Modulation Recognition Based On Semi-supervised Learning
10	Research On The Semi-supervised Few-shot Classification Based On Siamese Network And GMM