Learning With Unlabeled Data Based Research On Software Quality Assurance

Posted on:2016-06-20

Degree:Master

Type:Thesis

Country:China

Candidate:Z X Yang

Full Text:PDF

GTID:2348330461460088

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

With the increasing integration level of software into daily life,people get higher requirements for software quality.In order to improve the software quality in the perspective of defects as the object and developers as the subject,thorough research and analysis on software defect detection and effort estimation have been conducted respectively,and new methods based on learning with unlabeled data have been utilized to tackle the issue of scarcity of valid labeled data,with some major results obtained and described as follows.First of all,as to the issue of biased label in software defect detection,by assuming these modules as unlabeled data,the positive and unlabeled learning framework is utilized to identify hidden defective modules and a new technique based on kernel density estimation is proposed to extract reliable negative instances.The proposed method is capable of identifying hidden defective modules from the originally labeled as defect-free ones effectively.Furthermore,a defect detection model in a semi-supervised manner is built by using the remaining unlabeled data to further improve the prediction performance to a large extent.Secondly,to tackle the effort estimation issue in software development with extremely small data sets,a novel technique based on the twice learning framework is proposed to generate a large amount of unlabeled virtual examples,and combine the models with strong generalization ability and high comprehensibility respectively to build the ultimate prediction model.As a result,the proposed model is able to achieve better performance as well as disclosing the key factors within effort estimation effectively.

Keywords/Search Tags:

Software Quality Assurance, Software Defect Detection, Effort Estimation, Positive and Unlabeled Learning, Twice Learning

PDF Full Text Request

Related items

1	Intrusion Detection Technology Research Based On Positive-unlabeled Learning
2	Research And Application Of Effort-aware Software Defect Prediction Based On Approximate Density
3	A Study On Learning From Positive And Unlabeled Examples
4	Software Defect Prediction Based On Spiking Neural Networks
5	Bayesian Classifier For Positive Unlabeled Learning With Uncertainty
6	FPA-Based Software Effort Estimation Research And Practice
7	Research On Positive And Unlabeled Learning By Random Forest
8	Early Software Effort Estimation Supported By Semantic Analysis Of Requirement Documents
9	Software Defect Prediction Research For Unlabeled Datasets
10	Research On Data Drought Key Techniques For Software Effort Data Based On Machine Learning