Software Bug Triaging Based On Text Classification And Developer Rating

Posted on:2019-05-30

Degree:Master

Type:Thesis

Country:China

Candidate:X W Shi

Full Text:PDF

GTID:2428330545499761

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Over the past two decades,we have witnessed the growing effect of software on human society.This trend echoes the famous Marc Andreessen's 2011 proclamation that software is eating the world.Moreover,software-defined anything has been identified by Gartner as one of the top ten strategic technology trends for 2014,implying that we are entering a new era of the information society defined by software.Compared with proprietary software,open-source software(OSS)brings forth a fundamental shift in the manner in which software is developed and distributed.Due to lower cost,superior security,freedom from vendor lock-in,better quality and other primary reasons,an increasing number of enterprise-level companies have adopted OSS offerings as an essential part of their IT systems.To improve the efficiency of bug triage and reduce the cost of human and time,so far,many researchers have proposed a variety of automatic bug assignment methods.According to the data,algorithms,and tossing graph models,there are three categories of existing methods:text content-based approach,developer relationship-based approach,and hybrid approach.However,most of the existing methods assume that developers who distribute,discuss,or fix bugs are helpful to bug triaging.Therefore,they usually regard all the developers who are involved in the triaging process of a bug as the labels of the bug,so that they can get a higher recall rate when recommending possible developers.First of all,a prediction method based on text classification and developer rating is proposed in this paper.The core idea of building the prediction model is to consider both text classification based on machine learning and rating based on the source of bugs.According to the experiment on hundreds of thousands of bugs in the Eclipse and Mozilla projects,in the ten-fold incremental verification mode,the best average accuracies of our method reach 78.39%and 64.94%,respectively.Moreover,the accuracies of our method are increased by 17.34%and 10.82%,respectively,compared with the highest average accuracies of the baseline method(machine learning classification + tossing graphs).Therefore,the results indicate the effectiveness of our method.Also,we analyze the cooperation between developers based on the history of each bug report,calculate the tossing probability,and construct the tossing graph for bugs.We then combine the method mentioned above and the tossing graph to recommend potential developers related to the bug triaging process.Aiming at the defect data of the two open-source projects of Eclipse and Mozilla,the proposed approach obtains high recall rates which reach 78.20%and 71.98%,respectively.They are increased by 25.53%and 27.74%,respectively,compared with the FastText+developer rating+tossing graphs method.Also,the recall rates of our method are 14.17%and 13.25%higher than those of the ML-KNN+developer similarity method,which demonstrates the effectiveness of our method for recommending multiple possible developers.

Keywords/Search Tags:

Open-source software, Bug triage, Text classification, Rating, Tossing graph

PDF Full Text Request

Related items

1	Automatic Software Bug Triage Method Based On Multi-feature Bug Tossing Graphs
2	Research On Developer Recommendation Method For Automatic Bug Triage Of Open Source Software
3	Research On Open-Source Npm Packages Classification Based On Text Analysis
4	An Automatic Bug Triage Approach Based On Deep Learning
5	Research On Open-source Software Quality Evaluation And Recommendation Technology Based On A Graph Model
6	Open Source Software Vulnerability Mining Method Based On Knowledge Graph
7	On Intellectual Property Protection Of Open Source Software
8	Research On Relationship Between Code Quality And Software Defects For Open Source Software
9	Research On The Key Technologies Of Massive Open Source Resources Location For Software Reuse
10	Research And Implementation Of Open Source Software Systems Situation Analysis