Font Size: a A A

Partitioning filter approach to noise elimination: An empirical study in software quality classification

Posted on:2005-03-21Degree:M.SType:Thesis
University:Florida Atlantic UniversityCandidate:Rebours, PierreFull Text:PDF
GTID:2458390011951321Subject:Computer Science
Abstract/Summary:
This thesis presents two new noise filtering techniques which improve the quality of training datasets by removing noisy data. The training dataset is first split into subsets, and base learners are induced on each of these splits. The predictions are combined in such a way that an instance is identified as noisy if it is misclassified by a certain number of base learners. The Multiple-Partitioning Filter combines several classifiers on each split. The Iterative-Partitioning Filter only uses one base learner, but goes through multiple iterations. The amount of noise removed is varied by tuning the filtering level or the number of iterations. Empirical studies on a high assurance software project compare the effectiveness of our noise removal approaches with two other filters, the Cross-Validation Filter and the Ensemble Filter. Our studies suggest that using several base classifiers as well as performing several iterations with a conservative scheme may improve the efficiency of the filter.
Keywords/Search Tags:Filter, Noise, Base
Related items