Research On Software Defect Prediction Method Based On Adaptive Synthetic Sampling And Denoising Autoencoder

Posted on:2024-02-29

Degree:Master

Type:Thesis

Country:China

Candidate:Z J Li

Full Text:PDF

GTID:2568307151967509

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of the internet,various kinds of software come out one after another.These software make life more convenient,but they also become more and more complex,and more prone to some defects.Software defect prediction can help developers and testers to find the problem early and ensure the reliable operation of the software system.However,there are two problems in software defect prediction,the first is the class imbalance of software defect data,the second is the high-dimensional feature of software defect data.These two problems affect the performance of software defect prediction.In order to solve these problems,this thesis proposes a software defect prediction method based on adaptive synthetic sampling and denoising autoencoder.The main contents of this thesis are as follows.Firstly,to solve the class imbalance problem of defective data sets,this thesis proposes an adaptive synthetic sampling method based on genetic algorithm.By calculating the proportion of the majority samples around the minority samples as the weight,the adaptive synthetic sampling can synthesize more new samples for the hard-to-learn samples in the minority classes,thus,the decision boundary is shifted to the hard-to-learn samples to reduce the bias caused by unbalanced learning.Then,using the individual evolution method in genetic algorithm,the selected samples are crossed and mutated into adaptive synthetic sampling to generate new samples,so as to balance the data set.Secondly,for the problem of feature high dimension,this thesis proposes a feature representation based on denoising autoencoder,which is realized by neural network.Firstly,the data is corrupted by noise,and then the corrupted data is input into the neural network,and it is required to reconstruct the original input through encoding and decoding.In this process,noise is introduced to the data to force the neural network to learn more robust coding and improve the generalization ability of the model.Thus,when the original data set is input,the hidden layer of the autoencoder can get more representation of the nature of the data,and solve the problem of feature high-dimension without reducing the number of data features.Finally,based on the adaptive synthetic sampling and denoising autoencoder,the support vector machine is selected as the classifier to construct the software defect prediction model.The validity of this method is verified by using the NASA MDP data set,and compared with other researchers proposed methods,and the experimental results are analyzed.

Keywords/Search Tags:

software security, software defect prediction, class imbalance, denoising autoencoder, neural network

PDF Full Text Request

Related items

1	Research And Implementation Of Software Defect Prediction Model Construction And Sharing Methods
2	Software Defect Prediction Strategy Design For Imbalanced Data
3	Research On Sampling Integration Algorithm Of Unbalanced Data In Software Defect Prediction
4	Research On Unbalanced Data Classification Algorithm In Software Defect Prediction
5	Wide Research Of Data Mining With Machine Learning On Software Defect Prediction
6	Software Defect Prediction Model Based On Deep Learning
7	Research On High-dimensional Data Processing In Software Defect Prediction
8	Research On Data Preprocessing Technology In Cross Project Software Defect Prediction
9	Research On Software Defect Prediction Model For High Dimensional And Imbalanced Data
10	Research On Software Defect Prediction Based On Learning Mechanism